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1967 represented the 50th yr. of publication of this Journal. A reading of 10 
complete volumes on a quinquennial cycle starting in 1917 suggests that the 
classic problems of applied psychology were approximately adumbrated by 
about 1922, except for the area of biomechanics. Many of today’s fashionable 
concepts, such as moderator variables, cross-validation, question format, are 
easily discernible in early volumes. The disappearance of several early tests 
as the technology improved is noticeable. Judging by bibliographic citation 
patterns, applied psychologists share psychologists’ generally cavalier attitude 
toward history. Thus this excursion into the history of 1 journal may com- 
mend itself to other amateur historians as an interesting and possibly worth- 


while didactic exercise. 


The time was March 1917. The United 
States was engaged in the “war to end war”; 
it came to be known later as World War I. 
Volume 1, Number 1, of the Journal of Ap- 
plied Psychology, Florence Chandler of Wor- 
cester, publisher, went to a subscription list 
of now unknown size. Its foreword by G. 
Stanley Hall, John Wallace Baird, and L. R. 
Geissler described its goals: 


The past few years have witnessed an unprecedented 
interest in the extension of the application of psy- 
chology to various fields of human activity... . 
Thence the attention of the applied psychologist 
turned to the more comprehensive and fundamental 
problem of vocational selection ...and it must 
appeal to every human being who is interested in 
increasing human efficiency and human happiness by 
the more direct method of decreasing the number of 
cases where a square peg is condemned to a life of 
fruitless endeavor to fit itself comfortably into a 
round hole . . . none of the existing journals devote 
themselves to the task of gathering together the 
results of workers in the various fields of applied 
psychology, or of bringing these results into relation 
with pure psychology....%In view of all these 
circumstances the undersigned have decided, not 
without hesitation, to launch a Journal of Applied 
Psychology,—their hesitation being of financial ori- 
gin because they have no pecuniary resources in 
hand or in prospect, excepting their own... [pp. 
5-7]. 


1 


The masthead identified the cooperating 
editors; these were the first of many psycholo- 
gists who served effectively in this anonymous 
but crucial gate-keeping role: W. V. Bing- 
ham, June E. Downey, Robert H. Gault, Dr. 
William Healy, H. L. Hollingworth, Joseph 
Jastrow, Herbert S. Langfeld, James H. 
Leuba, Henry C. McComas, Eleanor Harris 
Rowland, Walter D. Scott, C. E. Seashore, 
Daniel Starch, Edward K. Strong, Jr., L. M. 
Terman, E. L. Thorndike, J. E. Wallace Wal- 
lin, G. M. Whipple, Helen Thompson Wool- 
ley. The masthead goes on further to say: 


The field of the Journat will include the follow- 
ing: 

(a) The application of psychology to vocational 
activities, such as law, art, public speaking, indus- 
trial and commercial work, and problems of business 
appeal. 

(b) Studies of individual mentalities, such as types 
of character, special talents, genius, and individual 
differences, including the problems of mental diag- 
nosis and vocational prognosis. 

(c) The influence of general environmental condi- 
tions, such as climate, weather, humidity, tempera- 
ture; also such conditions as nutrition, fatigue, etc. 

(d) The psychology of everyday activities, such as 
reading, writing, speaking, singing, playing games or 
musical instruments, sports, and the like. 

In reply to several inquiries the editors wish to 
state that those mental tests which aim to deal with 
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purely educational problems or abnormal minds do 
not as a rule fall within the scope of the JouRNAL 
[pp. 1-2]. 

The contributors were expected to pay cer- 
tain costs but “as soon as the JouRNAL be- 
comes self-supporting, or as soon as the pres- 
ent excessive prices of paper and printing 
abate, the above terms will be made more 
liberal [p. 2].” The record indicates that the 
excessive prices did not abate. 

In his autobiography, Hall (1923) makes 
only the most casual reference to this new 
Journal. Writing about the evolution of the 
testing movement as an outgrowth and exten- 
sion of one of his major loves—child study— 
he says: 


It is in the great stress now laid upon these tests, 
too, that psychology in general has enormously in- 
creased its utility, but not without grave danger of 
lapsing from culture to Kultur, since the later meth- 
ods seek practical results with individuals and have 
added little to our knowledge of child nature in 
general. In this latter direction, however, Clark Uni- 
versity has made it a point for the last decade or 
more to have some competent authority to apply 
tests gratis to all children from the city sent or 
brought to it, and in recent years Dr. L. R. Geissler 
gave his entire time to making such tests and in- 
structing others to do so here, and we founded here 
the Journal of Applied Psychology, an organ for the 
development of this work which, as I write, is in its 
sixth volume [p. 399]. 


This is the only mention of Geissler in the 
autobiography; there is no mention whatever 
of Baird, the other coeditor. Baird had been 
at Clark since 1910; he had been in charge 
of the laboratory there and in 1918 served as 
president of the American Psychological As- 
sociation. He died in 1919, having served as 
coeditor for only 2 years. 

Hall was far more enamored of the Ameri- 
can Journal of Psychology, which he founded 
in 1889, and the Pedagogical Seminary, which 
he founded in 1893. In another place in his 
autobiography, he speaks more harshly of 
psychometrics: 

We have no method of testing testers, whose compe- 
tence varies vastly from mere mechanical accumula- 
tion of data and the busy work of computing corre- 
lations up to real insight and sagacity. .. . I would 
cast no discredit upon this work, much as I deplore 
its present monopoly of psychological endeavor, but 
to my mind no one has made it sufficiently clear 
just what even the most used of them really tests, 
and still less justified for either psychology or for 
life the value of the powers they activate. All of 
them together have done a great work in applying 
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psychology to life and industry but have added 
scarcely a scintilla to our knowledge of the human 
soul [p. 450]. 


Editors may propose, but authors and the 
availability of manuscripts dispose. There 
were 35 articles, 37 authors, and 399 pages 
in Volume 1. The volume also included book 
reviews and notices, brief notes and summa- 
ries, and a section on psychology and mili- 
tary affairs covering the deployment of psy- 
chologists, the work of special committees, 
and other special activities related to World 
War I, signed by Robert M. Yerkes, Major, 
Sanitary Corps, and Chairman Psychology 
Committee, National Research Council. 

Using the editors’ four fields for the new 
Journal, these 35 articles are classified in the 
1917 column of Table 2. It is evident that 
contributions were accepted under less than 
rigorous criteria in the first year. The essays 
ranged from Hall’s own APA twenty-fifth an- 
niversary speech on psychology and the war 
through two philosophic articles by A. A. 
Roback, in one of which he attacks Munster- 
berg’s avoidance of moral and ethical issues 
in applied psychology. The doubtful felicity 
of this critique lies in the fact that William 
Stern’s memorial tribute on the occasion of 
Munsterberg’s death is found in the same 
volume. Other essays deal with applications 
of psychology in a wide range of endeavor— 
with its utility to business men, its promise 
of solutions to the problems of labor turnovef, 
its value in increasing the efficiency of work- 
ers. In the main, these are characterized by a 
high moral tone, by hope rather than evi- 
dence, and by a very conservative economic 
outlook. 

By today’s standards, statistical analyses 
were primitive and often missing. A few arti- 
cles, however, are sophisticated in design, 
analysis, or presentation. Baird’s study of the 
legibility of telephone directories in various 
type faces; Burtt’s analyses of Munsterberg’s 
tests applied to various samples; Miner’s cor- 
relational study of rating methods; Pintner 
and Toops’ test analyses of unemployed men; 
Scott’s presentation of test results of appli- 
cants versus successfully employed as a vali- 
dating method; Terman’s use of psychological 
tests in selecting policemen and firemen; 
Thorndike’s study of decrements in work out- 
put and work satisfyingness; Yerkes’ com- 
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parison of Binet and point scale measures of 
intelligence—each of these is a delight to read. 
Wallin, in a long case history of a stutterer, 
takes a direct slap at Freudian theory: 


But to maintain that all the words in a custom- 
made list which are associated with delayed re- 
sponses reveal an inner conflict or emotional com- 
plex, may suffice for the unsophisticated devotee of 
a cult but surely cannot satisfy the critical experi- 
mental scientist who demands facts capable of ob- 
jective verification [pp. 358-359]. 


Garrison, Burke, and Hollingworth present a 
case report on a superior child first tested at 
8 years and 4 months of age, with a Stanford- 
Binet of 187; follow-up reports on this child 
appear with pleasant regularity in Volumes 6 
and 11 of the Journal. 

And so the new journal was launched. It 
took its place in brilliant company: Psycho- 
logical Review was, in 1917, 23 years old and 
its monograph supplements first appeared in 
1896; the Journal of Experimental Psychology 
had been established in 1916; the Psycho- 
logical Bulletin was 13 years old; the Psycho- 
logical Index had been in existence since 
1894; the Journal of Abnormal Psychology 
first appeared in 1906; the Journal of Educa- 
tional Psychology was first published in 1910; 
Psychological Monographs first appeared in 
1910; G. Stanley Hall’s first love, the Ameri- 
can Journal of Psychology, started publica- 
tion in 1889, the year in which Hall opened, 
as president, the doors of Clark University. 

To see how the journal evolved, I have read 
Volumes 1, 6, 11, 16, 21, 26, 31, 36, 41, and 
46 in their entirety, thus sampling almost a 
half-century of the Journal’s life. Some of 
the material I have tabled. (See Tables 1, 2, 
and 3.) At other points I have commented on 
trends or items of interest—to me at least.t 

The chronology of editors and ownership 
deserves brief comment. In 1921, James P. 
Porter, Clark University, and William F. 
Book, Indiana University, appeared as editors. 
Halli had apparently sold the Journal in 1920 
or 1921; his ardor for it, certainly, had cooled. 
Porter left Clark to go to Ohio University; in 
1922 his new affiliation first appeared. By 
1928, Porter was the sole editor; Book ap- 


1 The structure of this review owes much to David 
Cohn’s (1940) The Good Old Days, a fascinating and 
exciting treatment of our social history. I regret my 
lack of time and skill to duplicate Cohn’s analysis. 


TABLET 1 


NUMBERS OF ARTICLES, PAGES, AND AUTHORS 
IN SELECTED VOLUMES OF THE JOURNAL 
OF APPLIED PsYCHOLOGY 














No. 
ven Volume No. No, No. early 
number | articles | pages | authors | publi- 
cations 
1917 1 35 399 37 aad 
1922 6 a0) 433 37 a 
1927 11 45 544 51 == 
1932 16 68 Fld 81 a 
1937 21 59 735 66 = 
1942 26 81 887 102 — 
1947 31 80 604 96 Pil 
1952 36 97 433 132 Di 
1957 41 89 418 131 4 
1962 46 85 432 141 i 








Note.—Book reviews, notes and news, materials received 
appeared through Volume 39. Contemporary Psychology, estab- 
lished in 1956, removed these pages from the Journal of Applied 
Psychology. 


parently withdrew in 1927. The Psychological 
Bulletin for 1943 (Volume 40, page 652), re- 
porting the annual business meeting of the 
APA, noted that the Journal was purchased 
from James Porter that year. Volume 27, 
1943, was edited by Donald G. Paterson and 
published by the APA with the cooperation 
of the American Association for Applied Psy- 
chology. In 1955, Volume 39, I started a 6- 
year term as editor. In 1961, Volume 45, Ken- 
neth E. Clark became the editor and con- 
tinues today in that position. Porter’s tenure 
lasted 22 years; Paterson held the editorship 
for 12 years. The years of the Journal’s ex- 
istence have been the years of psychology’s 
most rapid growth. New journals have come 
into being in the area of counseling psychol- 
ogy, in the areas of psychometrics and test- 
ing, the area of human engineering, and the 
area of personnel management, to mention 
some of the fields for which the Journal of 
Applied Psychology has provided a publica- 
tion outlet. 

Let us go back, now, to our quinquennial 
sampling of the volumes. Volume 6 covers the 
year 1922. There were 33 articles, 433 pages, 
and 37 authors. The number of dataless es- 
says has dropped to two; four articles that 
must be classified as statistical or methodo- 
logical now appear: Toops described a method 
of computing a matrix of intercorrelations by 
using an adding machine; Hull described a 
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TABLE 2 

PERCENTAGES OF ARTICLES IN VARIOUS CATEGORIES OR FIELDS 

Applica-" | studies of | Tiuence Psychology Statistical/ 
Year Volume Gs ee individual of environ- of everyday Essays Case methodo- 

number vocational oe mental ean. studies : 
ihe mentalities a activities logical 

activities conditions 
1917 fi 20 26 0 6 36 9 3 
1922 6 27 49 0 3 6 3 12 
1927 11 22 47 0 13 4 7 i 
1932 16 26 47 2 13 0 2 10 
1937 21 26 34 3 20 i 3 7 
1942 26 31 oe 2 12 5 4 14 
1947 31 44 21 5 9 7 0 14 
1952 36 50 18 10 7 8 0 7 
1957 41 27 25 26 16 0 0 6 
1962 46 34 25 27 1 0 0 13 














Note.—It was immediately evident in Volume 1 that the editors’ original four fields were insufficient ; thus these other categories 
of articles had to be introduced in the classification scheme. Additionally, I have no evidence on the reliability or interjudge con- 
sistency of the categories, since I was the sole judge of each article’s place in the scheme, 


method of deriving sigma scores, and a method 
of computing Pearsonian r’s from ranked 
data; Thorndike discussed methods of deriv- 
ing equivalent scores on intelligence tests of 
varying degrees of correlation, means, and 
dispersals; Chapman discussed the problem of 
known and guessed scores in true-false exami- 
nations. 

Ream, in an article describing the develop- 
ment of a test to measure Thorndike’s “social 
intelligence,’ mentions socially acceptable 
and unacceptable items, foreshadowing the 
later spate of literature on item format. Pof- 
fenberger concludes an article on setting cut- 
ting scores with the following statement: 


From the psychologist’s point of view it might not 
be difficult to decide upon a critical intelligence score 


most institutions where tests are being established 
the keenest interest centers upon the individual case. 
If one or a few good students be lost, the test may 
thereby be condemned. “If a man have a hundred 
sheep and one of them be gone astray, doth he 
not leave the ninety and nine, and goeth into the 
mountains, and seeketh that which is gone astray? 
And if so be that he finds it, . . . he rejoiceth more 
of that sheep than of the ninety and nine which 
went not astray.” [p. 160] 


So much for selection ratios, institutional de- 
cisions, and predictive validity. Poffenberger, 
in another article, uses peer ratings as an 
independent variable in predicting the cri- 
terion of expert ranks of letters of applica- 
tion. Max Freyd introduces the methods later 
used by Strong in the measurement of voca- 
tional interests. Root, in one of the two es- 














for the purposes of this school.... But here as in says, gives a sophisticated discussion of the 
TABLE 3 
PERCENTAGES OF AUTHORS IN VARIOUS TYPES OF EMPLOYMENT 

Vear Volume | Consulting | University | Military | State/city Federal Private Not 

number | orindustry | or college service service service practice known 
1917 1 14 66 0 8 0 6 6 
1922 6 3 73 0 21 0 0 3 
1927 11 6 77 0 11 0 0 6 
1932 16 ey 72 0 15 3 3 6 
1937 21 9 76 0 15 0 0 0 
1942 26 19 60 4 13 0 0 4 
1947 31 30 62 4 2 1 1 0 
1952 36 10 79 5 2 2, 2 0 
1957 41 16 61 20 1 1 0 1 
1962 46 18 59 20 1 2 0) 0 
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clinical versus the statistical problems under- 
lying the battle over the constancy of the IQ. 
Goldsmith reports the complete standardiza- 
tion of a weighted application blank for use 
in predicting success of life insurance sales- 
men. Naccarati and Lewy-Guinzberg, writing 
on the relation between hormones and intelli- 
gence, display various correlations between 
height-weight ratios and _ intelligence test 
scores, explaining away the low correlations 
by sampling errors that led to the inclusion 
of too many normosplanchnics in the student 
samples, at the expense of microsplanchnic 
and macrosplanchnics. J. McKeen Cattell an- 
nounces the availability of shares of stock in 
the newly organized Psychological Corpora- 
tion at $10 per share. Knight shows an amaz- 
ing mean difference between teacher volun- 
teers who signed their names and those who 
did not in scores on a scholastic ability test 
in a program introducing them to the idea of 
mental testing. Viteles compares the inter- 
correlations and differential predictive power 
of three different intelligence tests, using col- 
lege grades as criterion measures. His findings 
indicate that tests of presumably the same 
functional domain do not show high inter- 
correlations or equal predictive power. One 
gets the feeling, from Volume 6, that quite 
a few of the classic problems of applied psy- 
chology had been opened up fairly early. 

In Volume 11, 1927, three methodological 
articles discuss partial: correlations, weight- 
ing and scaling methods for mental tests, and 
theorems concerning the arithmetic mean and 
standard deviation. Kornhauser, standardiz- 
ing a questionnaire to differentiate between 
good and poor students at the University of 
Chicago, clearly describes the procedure of 
cross-validation, although he does not so 
name it: 


The unreliability of the questionnaire results became 
further apparent through the following comparison. 
In order to check the results obtained from our 
sample of 45 questionnaires, another sample of 30 

. was taken. Of the 30 questions which gave 
the best differentiation in the first sample, only 
eight gave even fairly good results in the new sam- 
ple. The correlation of total scores on the 30 items, 
with average marks, was 0.20 in this group of 30 
people, as contrasted with the 0.73 in the original 
sample [p. 93]. 


Reporting on two case histories of a pair of 
sisters placed in an orphanage setting, Flor- 


On 


ence Teagarden deals with the nature-nurture 
controversy as follows: 


By way of conclusion it may be said that the im- 
provement in moral, physical, and domestic habits 
and skills seems to justify the move that was in- 
augurated for these two children. Their quality of 
intelligence, however, has remained unchanged dur- 
ing six years. They were feeble-minded in the old 
environment and they are feeble-minded in the new 
environment. Finally it may be said that their old 
environment is probably only an index of the intelli- 
gence of the stock from which the children come 
[p. 296]. 


So much for nurture! 

In two articles, South deals with psycho- 
logical aspects of committee work: ‘The com- 
mittee is a perquisite of democracy .. . to- 
day a goodly part of the world’s deliberations 
takes place in committees [p. 348].” His re- 
search designs deal with committee size, type 
of problem, sex of committee members, role 
of intelligence, and time available for com- 
mittee activities. Considering that these ques- 
tions are still not clearly understood by our 
colleagues in social psychology, it is interest- 
ing to see such an early applied approach to 
these problems. The moral tone still sounds 
occasionally in Volume 11. In a job analysis 
of athletes, Charles Homer Bean writes: 


Educators seem to be no longer in doubt about the 
value of games and other athletic sports, not only 
for health and for the consumption of surplus en- 
ergy that would otherwise bother the teacher, but 
also for the education of both body and mind. The 
boy or girl who enjoys few, or no opportunities for 
vigorous athletic training seems to grow to adult- 
hood unable to use, without confusing amounts of 
static, the various psychoneural hook-ups that are 
as all-important to clear thinking as to the efficient 
converting of thought into action [p. 369]. 


The notes and news columns of Volume 11 
contain the following two items: 


“Motion picture theaters are too dark,” Guy A. 
Henry, of New York, General-Director of the Eye 
Sight Conservation Council of America, declared in 
a recent statement. “People are unnecessarily sub- 
jected to eye strain,” he asserted, “in poorly lighted 
auditoriums.” Investigations reveal that managers 
of motion picture theaters have no method of 
determining the effectiveness of the lighting, said 
Mr. Henry, who urged the framing of a special code 
of illumination for motion picture auditoriums, fol- 
lowing a scientific study of the problem. He be- 
lives that this is too important a matter to be left 
to the judgment of individuals [pp. 165-166]. 


Defective eyesight contributes to juvenile delin- 
quency, according to Guy A. Henry, General-Direc- 
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tor of the Eye Sight Conservation Council of Amer- 
ica. Child Health Day, general observance of which 
has been asked in proclamations by mayors of 
cities and other officials throughout the country, 
should be made an event of moral and physical sig- 
nificance in every household where there are chil- 
dren. 

Investigation by the Eye Sight Council has shown 
that “bad eyes make bad boys,” and that the ex- 
perience of Juvenile Courts prove that defective 
vision makes children truants. Each year more than 
200,000 children come before these courts, and eye 
conservation in the home and in the schools should 
be employed as a factor in checking criminal tend- 
encies. 

“Juvenile Courts are finding that bad eyesight 
leads to inattention in school, to unfair competition, 
and to disrespect for authority. Bad eyes lead to 
truancy and the truant child is a criminal in em- 
bryo.” [p. 319] 


The editorial standards for notes and news 
were, apparently, not stringent. 

Volume 16 showed noticeable increases in 
total pages, number of articles, and number 
of authors. However the proportions of arti- 
cles in various fields of applied psychology 
did not change markedly. 

Two items from the Notes and News sec- 
tions of Volume 16 are interesting: 


According to a recent report by the New York 
Commission on Ventilation there are still twenty 
states in the Union which retain statutes concerning 
ventilating devices which are based on disproved 
or antiquated theories. Dr. C.-E. A. Winslow, chair- 
man of the Commission, estimates that an unneces- 
sary expenditure of nearly $2,500,000 annually is 
made in an attempt to conform to obsolete regu- 
lations requiring the installation and maintenance in 
school buildings of a system of mechanical venti- 
lation which science has proved to be not only 
unnecessary but even a menace to health [p. 97]. 


Syracuse University announces the establishment of 
an independent department of psychology, with Dr. 
Ernst Thelin as head of the department. Until the 
present time, psychology and philosophy have been 
operating as a single department at Syracuse, de- 
spite the fact that in nearly all other universities 
they have established separate organizations. At 
Syracuse such a change was deferred, owing to the 
satisfactory operation of both these branches of 
teaching under the head of the Department of Phi- 
losophy. But in recent years psychology has grown 
at Syracuse from a simple department in Philosophy 
to a group including 16 sections of Introductory 
Psychology and 18 advanced courses, with a total 
enrollment of 1,000 students [p. 98]. 


The only case study reported in Volume 16 
is S. H. Britt’s “Examination of Miss Gene 
Dennis, ‘Psychic’.”” Miss Dennis was appar- 
ently laying them in the aisles in St, Louis in 
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her theatrical performances as “a real seer 
[p. 691].” Dr. Britt, having examined her at 
her own request, states: 


The experiment seems to justify the following con- 
clusions with regard to Miss Dennis: 

1. In terms of IQ, she was slightly below average. 

2. In the solution of a learning problem, she was 
considerably less efficient than the average college 
student. 

3. As to “mind-reading” ability, she failed to dem- 
onstrate any. 

4. In no part of the experiment was there evi- 
dence of “psychic” powers [p. 694]. 


There is no evidence of the effect of these 
conclusions on Miss Dennis’ subsequent ca- 
reer. 

Volume 16 carries an excellent study of 
faking of vocational interest test scores by H. 
C. Steinmetz; among his suggestions: ‘Further 
experimentation of this sort with personality 
tests is suggested for measurement of social 
insight and the extent to which purpose may 
distort test results [p. 130].” 

An interesting observation in Volume 16 
on something akin to socialized medicine is 
found in the following introduction to an 
article by Keller and Weber: 


At present all persons seeking to prepare for the 
dental profession in Germany must take and suc- 
cessfully pass a battery of searching tests. Before 
proceeding to an account of the tests it will be well 
to say something regarding the training and status 
of the dentist in Germany. The dentist is not to be 
confused with his academic competitor, the Zahn- 
arzt. The Zahnarszt is fully educated in the academic 
sense and holds a full medical degree. He is a medi- 
cal doctor, perhaps also a surgeon, who has special- 
ized in diseases and troubles of the teeth. The den- 
tist on the other hand is a practical though perhaps 
highly skilled technician. We are speaking here of 
the Staatlichgeprueften Dentisten who form the Na- 
tional Association of Dentists (Reichsverband 
Deutscher Dentisten) with head offices in Berlin. 
Among others these dentists are permitted to take 
as patients the large number of people who are 
insured in the Krankenkasse, one of the great social 
institutions of Germany. Certain classes of people, 
particularly laborers, are required to pay small fees 
to the Krankenkasse as a premium for medical serv- 
ices to which they are entitled when need arises. 
This system has been found successful in combating 
the tendency which is world-wide to neglect ills— 
naturally one takes advantage of medical services 
for which one has already paid [p. 465]. 


Their evidence on validation is given briefly: 
“Tt has never occurred that a candidate who 
was found ‘suitable’ in the test was after- 
wards found unsuitable in college work, ex- 
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cept in one case in which, however, a psy- 
chosis was involved [p. 474].” 

Volume 16 seems to have a report on the 
first use of subliminal stimulation affecting 
consumer behavior. Entitled “How the Con- 
sumer Estimates Quality by Subconscious Sen- 
sory Impressions,” the article describes a field 
test of four identical pairs of women’s stock- 
ings treated with traces of four different 
scents. The author blandly quotes from a 
local newspaper the reason for discontinuing 
the field work: 


Police learned the efforts were made at the 
suggestion of Prof. Donald Laird, professor of 
psychology at Colgate. They reported that the 
experimenters carried a box of three pairs of stock- 
ings which were perfumed and another with stock- 
ings that were without the artificial odor and their 
errand was to learn whether the housewives would 
consider the perfumed article more attractive than 
those without perfume [p. 245]. 


Laird’s own conclusions: 


Two things are plainly indicated by this experiment: 
1. Subconsciously perceived sensory impressions re- 
ceived through the olfactory apparatus are potent 
in determining the housewife’s judgment of quality 
in silk hosiery. 2. Scents of one type are more influ- 
ential in determining this judgment of quality than 
are scents of another type [p. 246]. 


Applications of psychophysical methods 
are not missing in Volume 16. A modest study 
by G. R. Wendt, then of the Yale Institute 
of Human Relations, presents a modest sum- 
mary: 


The Method of Constant Stimuli was applied to 
two problems in the measurement of psychological 
factors in industrial products. Using this method it 
was shown that a rubber tile floor composed of alter- 
nate light and dark squares will appear brighter 
when its pattern is made up of larger units. The 
amount of this apparent brightness difference was 
measured. In a second application of the same 
method it was found that the character of cover 
markings of golf balls has no effect on their apparent 
size. 

Such applications of experimental psychological 
methods seem to the writer to offer a wide and as 
yet untouched field for work in applied psychology. 
The applications of psychology to industry have in- 
cluded personnel studies, studies relating to the 
adjustment of the worker to his job and studies in 
the psychology of advertising. Except for an occa- 
sional study of consumers’ preferences little measure- 
ment of psychological factors in industrial products 
has been attempted. The field is one which recom- 
mends itself not only because of its range of possi- 
bilities but also because of the relative ease with 
which its problems can be solved [p. 276]. 


Volume 21 finally contains an article clearly 
falling in the psychology of everyday life; 
Henry Adams provides a statistical analysis 
of golf. Several laws derive from this study: 


Other things being equal, the average number of 
strokes needed to play a hole is a straight line func- 
tion of the length of the hole. ... The average 
effective length of the drive of the composite tourna- 
ment winner is 228.96 yards. . . . The effective length 
of the full second shot as played by the tournament 
winners is 228.96 yards. ... The humanly perfect 
shot is one having a deviation of approximately 5° 
on both sides of the intended line of flight of the 
ball [p. 384]. 


These “Jaws” are cold comfort to weekend 
golfers. 

Three articles in this volume deal with 
automobile driving and traffic problems, the 
first such articles in this sampling of the 
Journal. Andrew presents the first factor 
analysis seen in this quinquennial review, in 
her report on the standardization of the Min- 
nesota Vocational Test for Clerical Workers. 

In a study of the job satisfaction of psy- 
chologists, Hoppock sampled the APA mem- 
bership in vocational or industrial psychology; 
out of 203 job satisfaction questionnaires 
mailed, 66 psychologists responded. ‘During 
the four years from 1932 to 1936 the average 
salary had decreased $157 a year to a 1936 
average of $3,261 [p. 301].” The good old 
days! 

The formation of the new American Asso- 
ciation for Applied Psychology is mentioned 
in the Notes and News columns and in a 
more extended article by Fryer, following 
upon the APA meeting in Minneapolis in 
1937. 

Another excellent example of cross-valida- 
tion, though not so called, appears in the arti- 
cle by Helen Messenger entitled: “An Illus- 
tration of New Techniques in Test Construc- 
tion [p. 311].” A little-known and, I must 
admit, long-forgotten article by Darley (page 
485) bears a fair resemblance to what later 
came to be called studies of moderator varia- 
bles; a similar study of moderator variables, 
also affecting the prediction of academic suc- 
cess, is presented in a study by Wolf (page 
304). 

Volume 26, 1942, marked the last year of 
Porter’s editorship. It contained 81 articles 
and 887 pages. The first appearance of the 
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analysis of variance, in this quinquennial re- 
view, is found in two articles; one, by Brown, 
Fisk, and Torkelson, reports on recovery from 
glare blindness under conditions of changes in 
dietary intake of Vitamin A; the other, by 
Jackson, Jerome, and Schoenfeld, reports a 
study of the effectiveness of deodorant creams! 

World War II makes its presence felt in 
Volume 26 in the Notes and News columns; 
in one article on Naval recruit testing by 
Lewinski (page 316); in an essay by English 
on the applications of psychology to problems 
of military training (page 3); in Link’s re- 
port of attitude surveys of workers in a war 
economy; in Ruch and Young’s analysis of 
the effectiveness of Axis propaganda; and in 
an early and favorable book review of Tol- 
man’s Drives Toward War. Psychology’s re- 
sponse to World War II reached journal pub- 
lication somewhat more rapidly than was the 
case in World War I. 

Another excellent example of cross-valida- 
tion is found in the article by Ryan and 
Johnson (page 543) on the use of interest 
measures in the selection of salesmen and 
servicemen. 

Volume 31, 1947, is the first volume sam- 
pled here under Paterson’s editorship. As 
Table 1 will show, 27 articles (or 34% of all 
articles in the volume) represented early pub- 
lication paid for by the authors. One gets the 
feeling the option was somewhat overused. 

The first extended discussion of the field of 
human engineering or biomechanics appears 
in Volume 31; Dunlap’s presidential address 
to the Division of Consulting Psychology at 
the Denver APA meetings (1947) describes 
the wartime emergence of the field and pre- 
dicts additional areas of application in forth- 
coming years. As will be seen in Table 2, the 
little used category of “influence of environ- 
mental conditions,” in Volume 31 and subse- 
quent volumes under review, becomes heavily 
loaded with human engineering articles. 

Clark, in the midst of many favorable re- 
ports of military applications of psychology 
in these volume years, takes from Navy re- 
ports examples of projects that were of lim- 
ited value or were total failures in contribu- 
ting to military efficiency (page 94). This 
article will be required reading at the start of 
World War ITI. 

Volume 36, 1952, gives a dramatic illustra- 
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tion of editorial power; as Table 1 indicates, 
it contained 97 articles, the largest number in 
the volumes reviewed here but packed within 
only 433 pages that also included book re- 
views and other materials. Since the propor- 
tion of these articles in the various areas of 
applied psychology did not differ markedly 
from other years, readers were merely exposed 
to more of the same diet, with the notable ex- 
ception of a decrease in articles about adver- 
tising research. One of these articles (Twedt, 
page 207), however, applies factor-analytic 
methods of mechanical and content variables 
in ads as predictors of a readership criterion; 
it is the most effective theoretical study of 
advertising to be found in the volumes re- 
viewed. 

A pleasant air of controversy appears in 
Volume 36; Eells and Darley exchanged 
rather brisk comments on Darley’s unfavora- 
ble review of Eell’s coauthored volume on in- 
telligence and cultural differences; Farr, Jen- 
kins, Paterson, and England exchange bland 
rebuttals with Klare and with Flesch regard- 
ing their earlier article on the simplification 
of Flesch’s reading ease formulas. Bernard 
criticizes Rock’s “Sales Situation Test”; 
Rock answers Bernard’s criticism, and Pater- 
son replies to Bernard’s passing comment 
that he shouldn’t have published Rock’s origi- 
nal article in the first place! With regard to 
industrial applications of psychology, this 
volume contains examples of more sophisti- 
cated studies of industrial organizations, as 
distinct from prediction studies of perform- 
ance of personnel. An article by C. G. Brown 
and B. J. Neitzel, entitled ““Communication, 
Supervision, and Morale,” illustrates this 
shift in emphasis (page 86). 

In Volume 41, there is a noticeable increase 
in the number of articles classified as human 
engineering (in the area, “influence of the 
environment’). I was serving my third year 
as editor; APA policies had set an allowance 
of approximately 400 pages per volume year 
for this Journal. I seemed to have continued 
Paterson’s emphasis on skort articles, or at 
least contributors had continued to submit 
shorter articles after Paterson’s terms as edi- 
tor. A somewhat increased level of sophistica- 
tion in choice of problems and applications of 
methods seems to appear in this volume, but I 
hasten to disclaim any editorial credit for this; 
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sooner or later the field simply ad to become 
more sophisticated! 

By Volume 46, 1962, Kenneth E. Clark 
had assumed the editorship of the Journal. In 
Volume 41 and again in Volume 46, an in- 
crease in the proportions of authors from 
other than academic employment may be seen, 
reflecting in part the changing nature of em- 
ployment patterns in psychology. (See Table 
oo) 

My colleagues in history find themselves ill 
at ease in the years of the present; they can 
neither identify trends nor evaluate factors 
producing change. I find that I share their 
malaise as I move to the present years of the 
Journal of Applied Psychology. It would be 
well, therefore, to draw the curtain of this 
review with Volume 46, 1962. 

But what of the past and what, in broad 
outline, can be learned from it? Several gen- 
eral observations come to mind as I return 
these 10 dusty volumes to their dusty shelves. 

Plus ca change, plus c’est la méme chose. 
There is obviously no domain of knowledge 
or unitary field of psychology that can be 
subsumed under the word “applied”; rather 
there are many ways and places to apply psy- 
chology. In large part, these were nearly all 
foreshadowed in the early volumes of this 
Journal. Today’s concepts of cross-validation, 
moderator variables, and question format, for 
example, are easily discernible in early arti- 
cles. As Xhignesse and Osgood (1967) have 
recently pointed out in studying the time di- 
mension of journal citations, 


. . . the time perspective of psychology is short and 
getting shorter. If this is true, then it would seem 
that psychology as a science is in danger of “for- 
getting where it has been,” of repeatedly rediscover- 
ing facts and theories that have been well worked 
in the past [p. 790]. 


Certainly applied psychologists seem to have 
a cavalier attitude toward their own history. 
And, as Santayana pointed out, those who for- 
get their own history seem doomed to relive 
it; there is much that is repetitious in the 
enterprise. 

At the same time, there was and still is an 
air of excitement and discovery as one follows 
the Journal of Applied Psychology over time, 
seeing on occasion a new application, an inter- 
esting application, or an exciting methodologi- 


cal solution to a persistent problem in human 
affairs. That such articles are rare or infre- 
quent is not, I submit, characteristic of this 
Journal alone; the truly seminal or important 
articles in any area of psychology are infre- 
quent events. 

Another question: where are the tests of 
yesteryear? Many tests appear in early arti- 
cles that by now have disappeared into well- 
deserved oblivion, as the psychotechnology 
and standards of test construction have be- 
come more sophisticated and rigorous. Hall 
did not foresee this, nor did he foresee the 
contribution that test developments and sta- 
tistics would make to our understanding of 
the organization of behavior. The Journal of 
Applied Psychology has always been an out- 
let for this area, for strong and weak articles. 

In the main, authors in this Journal have 
been writing about performance differences 
related to age, sex, race, special group mem- 
bership, or (rarely) special conditions set by 
an experimenter. These studies, in Cronbach’s 
(1957) definition of the two sciences of 
psychology, were concerned with events that 
nature herself had defined and created; they 
also, in more modern terms, dealt both with 
construct validity and predictive validity. 
In any new science, many such studies are 
needed to clear away the underbrush of “com- 
mon sense,” to test assumptions of causality 
or covariation, to develop methods and experi- 
mental expertise, and to open up new areas 
of study in the field and in the laboratory. 
The pages of the Journal of Applied Psychol- 
ogy reflect this evolution, with both its 
strength and its weaknesses, its redundancy, 
and its infrequent points of high discovery. 
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LATERAL VISUAL FIELD AS RELATED TO AGE AND SEX’ 


ALBERT BURG 


Institute of Transportation and Traffic Engineering 
University of California, Los Angeles 


Lateral nasal and temporal visual field measurements were obtained by means 
of a screening perimeter for nearly 17300 Ss, ages 16-92. The major findings 
are: (a) temporal and total fields are maximum to about age 35, after which 
field constricts progressively with advancing age, (b) nasal fields increase to a 
maximum occurring about age 35 or 40, after which a progressive decline 
takes place, and (c) females consistently demonstrate slightly larger visual 
fields than men. Possible interpretations for these and other findings are pre- 
sented, and additional research is suggested to explain some of the relationships 


obtained in the study. 


From 1961 through 1966, the author di- 
rected a large-scale research project investi- 
gating the relationships between vision and 
driving. The major conclusions from this study 
are presented in a recent report (Burg, 
1967b). 

Apart from data relating vision to driving, 
the study produced a very great amount of 
information concerning visual performance as 
a function of age, sex, and other variables. 
The present paper is one of a series of reports 
(including Burg, 1966 and 1967a) presenting 
normative data on vision that should be of 
interest to others working in this area. 


METHOD 
Apparatus 


The device used for testing lateral visual field was 
an American Optical Company screening perimeter, 
fixed in a horizontal position (Figure 1). The perim- 
eter has a 30-cm.-radius black mat arc, with re- 
cessed 4-mm. white targets located at 5° intervals 
around the arc. Each of the targets is activated by 
depressing a spring-loaded plunger on the back side 
of the arc, and the target disappears from view as 
soon as the plunger is released. Each target subtends 
approximately 46 min. of arc. At each end of the 
arc is a sighting device for obtaining the correct eye 
level and 30 cm. distance from the cornea to the 
fixation point (a 4-mm. white target in the center 
of the arc). An adjustable double chin rest is used 
to obtain this proper positioning of S’s head for 
monocular testing. The arc is evenly illuminated by 
a 50-w., 120-v. incandescent bulb mounted above 


‘Data collection and analysis costs were partially 
borne by the United States Public Health Service 
(Grant AC-00015), and the State of California and 
United States Bureau of Public Roads (California 
Standard Agreement 13600). 
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S’s head on the axis of curvature of the arc.2 The 
targets are approximately 69 cm. from the center 
of the light source. A black cloth is suspended from 
the lower edge of the arc to screen out E’s arm 
movements when he is manipulating the target 
plungers. 

Three testing units were in simultaneous operation 
during the course of the data-collection phase of 


*Target illumination is 7.5 ftc. measured per- 
pendicular to the light source-to-target axis. When 
measured perpendicular to the subject-to-target line 
of sight, target illumination is approximately 2 ftc. 





Fic. 1. Lateral visual field screening apparatus, 
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the study. To insure comparable scores, the perim- 
eter test setup was identical in every respect in all 
situations. All testing was conducted in light-free 
rooms, with the perimeter arc solely illuminated by 
the aforementioned 50-w. bulb. (Needless to say, 
the several test operators employed during the 32-mo. 
data-collection period were all highly trained to 
administer the test in an identical manner.) 


Procedure 


After S was seated at the perimeter (on an ad- 
justable-height chair), he rested his chin on the 
right side of the chin rest (covered by a tissue), 
placing his left eye in line with the fixation target 
in the center of the arc. Using the sighting devices 
at each end of the arc, E adjusted the chin rest and 
the tilt of S’s head until the latter’s eye was at the 
proper level and distance from the fixation target. 

The S then was asked to hold a folded tissue over 
his right eye. Care was taken that the left-eye nasal 
field was not intruded upon, either by the tissue or 
by S’s hand. Standing behind the perimeter, £ 
asked S to fixate with his left eye upon the fixation 
target (EZ monitored S’s adherence to this instruc- 
tion throughout the test). Using a modified method 
of limits, E then established the (lateral) nasal and 
temporal limits of S’s left-eye field, by recording 
the position of the furthest target perceived by S, 
both nasally and temporally. If it appeared neces- 
sary, Measurements were repeated to insure that S’s 
responses were accurate and that his eyeglass frames, 
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Fic. 2. Left nasal visual field by age and sex. 
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Fic. 3. Right nasal visual field by age and sex. 


if any, were not obscuring the targets. The S then 
placed his chin in the left side of the chin rest, and 
his right-eye nasal and temporal fields were simi- 
larly measured. 


Subjects 


The Ss were California drivers who were volun- 
tarily participating in the vision and driving study 
mentioned earlier. A total of 17,479 Ss were involved 
in the program, 62.8% of whom were male and 
37.2% female. The age range was from 16 to 92, 
and (corrected) static acuity ranged from 20/13 to 
20/200. The Ss were tested at branch offices of the 
California Department of Motor Vehicles scattered 
throughout the State. 


RESULTS 


Table 1 presents a summary, by age and 
sex, of mean monocular nasal and temporal 
fields, as well as total monocular and binocu- 
lar fields. Figures 2 through 8 present graphi- 
cal depictions of these data. 

The tabular and graphical data are based 
on somewhat less than 17,300 Ss; approxi- 
mately 200 Ss were not included in the pres- 
ent analyses, either because they did not have 


3 The Ss were tested with corrected vision, if they 
indicated that they wore corrective lenses to drive. 
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Fic. 5, Right temporal visual field by age and sex. Fic. 7. Right-eye visual field by age and sex. 
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valid perimeter scores (e.g., they failed to 
maintain steady fixation upon the central 
target), or because they were blind in one 
eye. (Because of their special nature, one- 
eyed Ss are being analyzed separately, and 
will be the subject of a future report.) 

Inconsistencies among the cell sample sizes 
evident in Table 1 are due to the fact that 
different computer runs provided data for dif- 
ferent sections of the table, and a few Ss had 
invalid scores for some, but not all, of the 
variables under consideration. 

Upon inspection of the above table and 
graphs, the following general statements may 
be made: 


1. Field versus Age: 


(a) For both sexes, total visual field and 
both temporal fields are at their maximums 
from age 16 up to about age 35, after which 
point field decreases progressively with ad- 
vancing age. 

(6) For both sexes, nasal fields and (there- 
fore) eye fields increase up to a maximum 


TABLE 1 


SUMMARY OF MEAN VISUAL FIELD SCORES BY AGE AND SEX 


























Left nasal Right nasal Left tem- Right tem- Left-eye Right-eye P 3 
field field poral field poral field visual field visual field more visual ned 
(in degrees) | (in degrees) | (in degrees) | (in degrees) (in degrees) (in degrees) IuAGeRLaes 

Age Sex 
N Xs N s N 2 N xX N xX N xX N x SD 
16-19 M 1181 | 54.9 1181 | 55.1 1181 | 87.5 1181 | 87.0 1181 | 142.4 1181 | 142.1 1182 | 174.5 9.1 
F 732 | 58.3 132A SSeS: 732 | 88.7 732 | 87.8 732 | 147.0 732 | 146.1 732 | 176.4 5.5 
20-24 M 1380 | 56.2 1381 | 56.0 1380 | 87.8 1381 | 87.3 1380 | 144.0 1381 | 143.3 1379 | 175.2 7.7 
F 775 | 58.5 776 | 58.9 775 | 88.4 776 | 88.0 775 | 146.9 776 | 146.9 776 | 176.1 8.8 
25-29 M 1159 | 56.5 1159 | 56.8 1159 | 87.6 1159 | 87.2 1159 | 144.1 1159 | 144.0 1158 | 174.9 io 
F 664 | 58.7 664 | 59.0 664 | 88.3 664 | 87.7 664 | 147.0 664 | 146.7 665 | 176.0 9.0 
30-34 M 1118 | 57.2 L017 57.3 1118 | 87.2 1117 | 87.3 1118 | 144.4 1117 | 144.6 1116 | 174.6 lah 
F 629 | 59.3 629 | 59.5 629 | 88.0 629 | 87.7 629 | 147.3 629 | 147.2 631 | 175.6 eee 
35-39 M 1118 | 56.7 1114 | 57.2 1118 | 86.9 1114 | 86.4 1118 | 143.6 1114 | 143.6 1114 | 173.4 9.8 
F 691 | 59.5 692 | 60.0 691 | 87.9 692 | 87.4 691 | 147.4 692 | 147.4 690 | 175.1 6.7 
A0-44 M 1121 | 56.6 1120 | 56.9 1121 | 86.2 1120 | 86.0 1121 | 142.8 1120 | 142.9 1119] 172.5 9.2 
F 718 | 59.2 717 | 59.7 718 | 86.7 717 | 86.6 718 | 145.9 717 | 146.3 i aac 7.6 
45-49 M 937 | 56.5 937 | 56.9 937 | 85.6 937 | 85.5 937 | 142.1 937 | 142.4 935 | 171.2 8.6 
F 640 | 58.1 640 | 58.6 640 | 85.6 640 | 86.0 640 | 143.7 640 | 144.6 640 | 171.8} 10.0 
50-54 M 852 | 56.6 852 | 56.0 852 | 84.2 852 | 84.3 852 | 140.8 852 | 140.3 849 | 168.4 | 11.5 
F Soles 576 | 57.7 577 | 85.9 576} 85.5 577 | 143.2 576 | 143.2 SAOn| Utes 9.7 
55-59 M 633 | 56.2 629 | 55.9 633 | 82.4 629 | 83.5 633 | 138.6 629 | 139.4 629 | 165.9 | 13.4 
F 374 | 57.0 375 | 57.4 374 | 84.3 375 | 83.9 374 | 141.3 SSeS: 374 | 168.0} 10.3 
60-64 M 488 | 54.6 488 | 54.5 488 | 80.4 488 | 80.9 488 | 135.0 488 | 135.4 487 | 161.5 | 16.9 
F 265 | 56.3 265 | 56.4 265 | 81.7 265 | 82.5 265 | 138.0 265 | 138.9 265 | 163.7 | 14.4 
65-692 M 883 | 52.8 883 | 53.1 883 | 75.3 883 | 77.6 883 | 128.1 883 | 130.7 401 | 158.0 | 16.5 
F SOM SSL 350 | 57.9 350 | 78.1 350 | 81.2 350 | 136.2 350 | 139.1 205 | 160.6 | 12.5 
70-74 M 266 | 152.8 | 16.3 
F 95 | 157.6} 13.8 
75-79 M 136 | 147.7 | 21.2 
F 36 | 150.0 | 20.1 
80 -+over M 76 | 139.5 | 21.4 
F LS) LSS-59 2255: 
All M 10870 | 55.6 | 10861 | 55.7 | 10870 | 85.3 | 10861 | 85.2 | 10870 | 140.9 | 10861 | 140.9 | 10849 | 170.8 | 12.4 
F 6415 | 57.8 6416 | 58.1 6415 | 86.6 6416 | 86.2 6415 | 144.4] 6416] 144.3) 6409 | 172.9} 10.3 
All Both | 17285 | 56.4 | 17277 | 56.6 | 17285 | 85.8 | 17277 | 85.6 | 17285 | 142.2 | 17277 | 142.2 | 17249 | 171.6] 11.7 
































® Values given are for ages 65 and over for all measures except fofal visual field. 
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occurring about age 35-40, followed there- 
after by progressive decreases with increasing 
age, with the exception of an increase for fe- 
males age 65 and over. 

(c) For both sexes, right-temporal and 
right-eye visual fields are smaller than their 
left counterparts for the younger age groups, 
while the reverse is true for the older age 
groups. This effect is not exhibited for the 
nasal fields. 


2. Field versus Sex: 


(a) With the exception of the (relatively 
small) over-80 age group, females consistently 
demonstrate larger nasal and temporal visual 
fields (and, consequently, larger eye fields 
and total fields) than do men. The greatest 
difference is evident in the nasal fields. 

(6) For both nasal and eye fields, females 
achieve their maximum values between the 
ages of 35 and 40, while for males this maxi- 
mum occurs between ages 30 and 35. 

An additional finding is that the product- 
moment correlation between total visual field 
and age is slightly but significantly less for 
females (—.420) than it is for males (—.462). 


DISCUSSION * 
Field versus Age 


The results of the present study, showing 
a progressive decline in visual field with in- 
creasing age, are consistent with most previ- 
ous research studies of a similar nature. How- 
ever, the interpretations of findings of this 
type have not been consistent. As Weale 
(1963) points out, one reason for this lack of 
consistency is the fact that senile degeneration 
of the visual mechanism takes several forms, 
some of which may intrude upon visual-field 
measurements and consequently lead to mis- 
taken conclusions as to causal relationships. 
For example, the assumption usually made 
states that visual field constricts with age due 
to peripheral degeneration of the retina. How- 
ever, it is well known that sensitivity to light 
diminishes with increasing age, possibly due 
to the gradual yellowing of the crystalline 
lens and/or because of reduction in pupil 


*The assistance of Frank A. Brazelton (Optome- 
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diameter (senile miosis), and that visual 
sensitivity decreases as the peripheral retina 
is approached. As a consequence, visual-field 
measurements using perimeters whose arcs 
are evenly illuminated at a fixed intensity, 
Weale continues, may show decreasing field 
with increasing age because of decreased light 
sensitivity and not necessarily because of a 
true peripheral degeneration of the retina. 
Weale concludes that if stimulus intensity 
were increased to compensate for these effects, 
restrictions in visual field would be shown to 
be caused primarily by physical changes in 
the ocular system, rather than by retinal 
decay. 

Regardless of the validity of Weale’s view- 
point, from a practical standpoint the prob- 
lem he raises is not particularly germane to 
the present study. The research herein de- 
scribed is concerned with investigating the 
functional limits of visual field by means of a 
measurement technique that is of sufficient 
accuracy and ease of administration to permit 
its use as a high-volume testing device (e.g., 
for driver-license applicants). Thus, the em- 
phasis is on rapid visual screening rather than 
on detailed clinical evaluation. Practically 
speaking, if measurement conditions are iden- 
tical for all Ss, the fact that visual field 
constricts with increasing age is of importance 
regardless of the physiological reasons under- 
lying this change. In the performance of a 
task such as driving, in which visual-field re- 
strictions do appear to act as a handicap 
(Burg, 1967b), establishing the fact of these 
restrictions is a proper and important function 
of the licensing agency, while etiological diag- 
nosis and possible remedial measures (if any) 
are the responsibility of the clinician. 

One additional point needs clarification. 
The visual-field values derived in the present 
study are of importance primarily as indica- 
tions of relative size as a function of age and 
sex. It is not intended that the obtained mean 
scores be considered completely accurate in 
any absolute sense, for at least two reasons: 


1. The screening test used provides scores 
that may underestimate the true values, be- 
cause the targets are spaced at 5° intervals. 
From a statistical standpoint, a closer ap- 
proximation of the true field values could be 
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obtained by adding 2.5° to each mean nasal- 
or temporal-field score, and 5.0° to each mean 
eye-field or total-field score. 

2. The target illumination used was some- 
what lower than is commonly found in peri- 
metric investigations and, therefore, may 
possibly have slightly penalized the older S 
because of the light-sensitivity decline men- 
tioned earlier. If such a penalty was in fact 
imposed, however, in all likelihood its magni- 
tude was minor, because of the substantial 
size and reflectance of the white perimeter 
targets used. 


In contrast to the temporal fields, the nasal 
fields initially increase in size before com- 
mencing their progressive decline with age. 
There is no obvious explanation for this find- 
ing; conceivably, it may be due to a physiog- 
nomic change in eye-nose relationships oc- 
curring up to ages 35-40. If such is the case, 
this change reverses direction in the older age 
groups, where gradual recession of the eye 
(relative enophthalmos) causes the nose in- 
creasingly to intrude upon the nasal field 
(Weale, 1963). Both nasal and temporal 
fields show marked declines after ages 35—40; 
if anything, however, the temporal fields ex- 
hibit a greater decline (both absolute and 
relative) than do the nasal fields. 

Similarly, at present no readily acceptable 
explanation can be offered for the reversal in 
relative size that seems to occur, with age, be- 
tween the right temporal and monocular fields, 
and the corresponding fields for the left eye. 
As yet unpublished monocular visual acuity 
data for these Ss do not exhibit correspond- 
ing changes with age. As is possible with any 
research study, this finding (as well as other 
results) may be the consequence of an arti- 
fact in the data; however, in view of the 
large sample size and the extreme care taken 
to maintain standardized experimental condi- 
tions, the likelihood of experimental artifact 
providing a reasonable explanation for this 
outcome appears quite small. 


Field versus Sex 


No obvious explanations can be offered 
either for the slight but significant superior- 


ity in visual-field size demonstrated by fe- 
males or for the fact that nasal and (conse- 
quently) eye fields peak at a later age for 
females than for males. The fact that visual 
field correlates with age slightly less for fe- 
males than for males can be explained, how- 
ever, by the smaller sample size for females, 
which results in a narrower range of scores 
(see the standard deviations presented in Ta- 
ble 1) and, consequently, a statistical limita- 
tion on the size of the correlation obtainable. 

In summary, it may be said that several of 
the study findings pose interesting questions 
of interpretation. Additional research obvi- 
ously is necessary if reasonable answers to 
these questions are to be found. For example, 
it would be desirable to obtain field measure- 
ments on a large number of Ss with higher 
levels of target illumination, physiognomic 
evaluation, pupil-size measurements, and so 
on. Data collected in the present study are 
currently being subjected to more detailed 
analysis to determine whether, despite the 
precautions taken, those Ss who were tested 
wearing glasses (some 32% of the total) 
were in fact experiencing restrictions in their 
field while being measured that could in some 
way account for some of the findings. Other 
analyses in the planning stages include the 
effects of smoking and fatigue (lack of sleep) 
upon visual performance; however the likeli- 
hood is small that either of these analyses 
will provide satisfactory explanations for any 
of the findings described above. 
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367 adults were given short passages to study for limited periods of time. 
They then had to answer open-ended questions on the content. A separate- 
groups design with groups of about 50 adults and a Greco-Latin-square design 
using 96 adults appeared to have about equal potential power statistically, but 
the differences between comparable conditions were almost twice as large 
with separate groups. Text typed in pica combined upper- and lower-case 
letters was comprehended on the average about 13% more rapidly (p < .02) 
than text in pica or Siemens all upper-case letters. There was little difference 
(p > .05) between the Siemens all upper case, a dyeline copy of the Siemens, 
and pica or elite all upper case; and little difference also (p> .05) between 
pica and elite all lower case, which gave intermediate rates of comprehension. 
The lettering of the question sheets did not affect the results. 


Telex and telegram services employ only 
26 letter codes, which are printed out by the 
teleprinter at the receiving end in capitals or 
upper-case letters. Transmitting messages 
coded in combined upper- and lower-case 
letters would be more expensive, since it 
would require an extra binary unit of informa- 
tion to distinguish between the upper and 
lower case. However, a printout of combined 
upper- and lower-case letters might be easier 
for the receiver to read. This was one of the 
questions investigated in the two experiments 
to be reported. 

Curiously enough, very little experimental 
work has been published comparing the type- 
faces of teleprinters and typewriters, although 
large amounts of material are being produced 
in this way and read every day. The only 
experiments appear to have been by Fox 
(1963). He compared material typed in 
standard elite with the same material typed 
in the smaller-sized capitals of Gothic elite. 
Gothic elite has two sizes of capitals instead 
of upper- and lower-case letters. If the smaller 
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size is used for text, and the larger size is 
reserved exclusively for headings, use of the 
typewriter shift key is virtually eliminated. 
This makes typing easier and hence less ex- 
pensive. In comparing passages of prose typed 
in the two styles, Fox used as his criterion 
the speed of reading without measuring com- 
prehension. He found that text typed in small 
capitals was read 18% more slowly than the 
standard elite text, but only when read first. 
On changing to the other style the readers 
did not change their reading speed appreci- 
ably, presumably because they knew that 
their level of comprehension was not being 
measured. Similar transfer effects have been 
reported in other situations (Poulton & Free- 
man, 1966). Clearly, more work is required 
in order to assess the size of the likely gain 
to the reader, if any, of converting teleprinters 
from all capital letters to combined upper- and 
lower-case letters. 

The classical experiment of Tinker and 
Paterson (1928) suggests that text typed in 
combined upper- and lower-case letters ought 
to be easier for the reader to comprehend 
than text typed all in capitals. These authors 
used the Chapman-Cook speed of reading 
test printed in 10-point “old style” type, 
presumably without leading. This method 
also measures speed of reading, but demands 
a certain level of comprehension since in- 
congruous words in the passage have to be 
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crossed out. The all-capitals print was found 
to take 13.4% longer to read. The only other 
experiment giving statistically reliable re- 
sults which is in any way relevant is by 
Paterson and Tinker (1932). They found 
text typed in American Typewriter elite type- 
face slower to read than text printed in any 
of eight printers’ typefaces in common use. 

A second aim of the experiments was to 
compare text typed all in capitals with text 
typed all in lower-case letters. Words in lower 
case have typical shapes as a result of the 
ascenders and descenders (Tinker, 1963, 
p. 60). The shapes might be expected to help 
a reader even in the absence of capital letters 
at the start of sentences and for proper names. 
Apart from tradition, the only reason for 
using all capital letters rather than all lower- 
case letters for teleprinters is that capitals 
are said to perform better when teleprinters 
are left for prolonged periods without main- 
tenance. However, reasonably large, well-de- 
signed, lower-case letters should be equally 
good in this respect, and might be quicker and 
easier to read. This point also seemed worth 
investigating. 

A third aim was to compare two different 
experimental designs, to see which was the 
more powerful statistically. A separate-groups 
design avoids transfer effects between experi- 
mental conditions like those found by Fox 
(1963). But its power is reduced by the 
large sizes of the individual differences if the 
samples of individuals are at all representa- 
tive (Poulton, 1965a). In contrast, a Greco- 
Latin-square design produces unmeasurable 
transfer effects, which may reduce the sizes 
oi the differences between conditions (Poul- 
ton & Freeman, 1966), but large differences 
between individuals can be eliminated from 
the statistical comparisons of conditions. 
Clearly it is useful to know which design is 
the more powerful, in order to increase the 
sensitivity of experimental comparisons in this 
research area (Poulton, 1965b). 

A fourth aim was to see whether the style 
of the question sheets affected the scores for 
comprehension. In experiments comparing all 
capitals with mixed upper- and lower-case 
newspaper headlines, it was found to make no 
difference to the experimental comparisons 
whether the question sheets were typed in 


I.5.M. 72 PICA ALL UPPER CASE 


I.B.M.72 Pica combined Upper and Lower case 


i.b.m.72 pica all lower case 

smith-corona pica 1 all lower case 
Smith-Corona Elite 66 combined Upper and Lower case. 
smith-corona elite 66 all lower case. 

SMITH CORONA ELITE 66 ALL UPPER CASE. 

SIEMENS ALL UPPER CASE 


Olivetti Lexikon 80 Elite combined Upper and 


Lower case (for stencilled question sheets only) 


Fic. 1. Carbon or dyeline copies of text typed 
in the top eight styles were compared. (The bottom 
style was used for the stenciled question sheets only.) 


upper-case or mixed upper- and lower-case 
letters (Poulton, 1967). However in a pilot 
experiment (not previously reported) practice 
pieces of newspaper with all the headlines in 
capitals were found to favor reliably (p< .01) 
test newspaper sheets with all the headlines in 
capitals. This suggests that although there 
may be transfer effects between the styles in 
which the text is set, the style of type used 
for the question sheets may make no differ- 
ence. If so, it could save a lot of effort, espe- 
cially with Greco-Latin-square designs, since 
all the question sheets do not then have to 
be produced in all the styles of type to be 
compared. 

The separate-groups experiment compared 
the existing Siemens teleprinter typeface, 
which consists of relatively large capitals 
without serifs (Figure 1), with possible alter- 
native typefaces. To represent the output of 
a more complex teleprinter system with both 
upper- and lower-case letters, the pica type- 
face of the IBM 72 electric typewriter was 
chosen. To represent the output of a simpler 
existing system, but with all lower-case letters, 
the lower-case pica letters of the IBM 72 
electric typewriter were used (see Table 1). 
IBM 72 pica capitals were included as a pos- 
sible straight alternative to the Siemens 
capitals. Figure 1 shows that they are serifed, 
as compared with the Siemens capitals which 
do not have serifs, but they are not quite so 
tall. Here the stencils for the question sheets 
were typed like the text in IBM 72 pica 
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capitals, to see whether this helped the reader. 
A separate group was also run which read the 
passages in Siemens capitals, and had dyelined 
question sheets also in Siemens capitals. For 
the remaining groups the stencils of the ques- 
tion sheets were typed in the standard Olivetti 
Lexikon 80 elite combined upper- and lower- 
case letters used by the laboratory typing 
pool. 

The Greco-Latin-square experiment com- 
pared the Siemens teleprinter typeface with 
a dyeline copy of the teleprinter output, and 
with capitals from an elite 66 Smith-Corona 
electric typewriter (see Figure 2). Two sets of 
lower-case letters were also compared, pica 1 
and elite 66 from Smith-Corona electric type- 
writers. Combined upper and lower case from 
the Smith-Corona elite 66 electric typewriter 
was included as a sixth condition to provide 
a base line. The stencils for all the question 
sheets were typed in the standard Olivetti 
Lexikon 80 elite combined upper and lower 
case. 


METHOD 


This followed that of previous experiments (Poul- 
ton, 1965a). Simple test passages of about 450 words 
on widespread infectious diseases were typed in the 
styles shown in Tables 1 and 2 and Figure 1. The 
same standard teleprinter layout was used through- 
out: an average line of about 6.5 in. long, triple 
spacing between lines, no indentations at the begin- 
nings of paragraphs, and about 20 lines to a page. 
All typing was on four-ply teleprinter paper rolls 
which have carbon paper interleaved. The first 
carbon copy on pink paper, the second carbon 
copy on yellow paper, and occasionally the bottom 
copy on green paper, were used in the experiment, 
each person having the same color throughout. The 


TABLE 1 


RESULTS OF SEPARATE-GROUPS EXPERIMENT 











Average | Average 

B ? No. compre- rate of 

Style of type readers hension reading 

%) (wpm) 

IBM 72 pica combined 

upper and lower case® 47 55.1¢ 213 
IBM 72 pica all lower case 49 51.9 254 
Siemens all upper case* 50 49.3 258 
Siemens all upper case> De 48.6 265 
IBM 72 pica all upper case> 73 46.9 262 





8 The question sheets were in the standard Olivetti Lexikon 
80 elite upper- and lower-case letters. 

b The question sheets had the same style of type as the test 
passages. 

¢ Pica combined upper and lower case was reliably better 
than the three all-upper-case conditions (p < .02). 


TABLE 2 


RESULTS OF GRECO-LATIN-SQUARE EXPERIMENT 








| Average com- 





Style of type | prehension 

| (Ze) 

Smith-Corona elite 66 combined | 
upper and lower case | 62.9 
Smith-Corona pica 1 all lower case 60.5 
Smith-Corona elite 66 all lower case 60.3 
Siemens all upper case 59.2 
Smith-Corona elite 66 all upper case 58.7 
Dyeline Siemens all upper case 58.5 


a The three upper-case styles were reliably worse than the 
three remaining styles (p < .05). 


dyeline copies of the Siemens were made from the 
top sheet which consisted of tracing paper. 

There were 10 open-ended questions on each 
passage to test for comprehension. Normally the 
questions were typed in upper- and _ lower-case 
Olivetti Lexikon 80 elite letters on a wax stencil, 
and stenciled copies run off. But for the two ex- 
perimental conditions indicated by Footnote b of 
Table 1, the lettering of the question sheets cor- 
responded to the lettering of the test passages. The 
question sheets for the IBM 72 Pica all-upper-case 
passages were stenciled copies. The question sheets 
for the Siemens all-upper-case passages were dyeline 
copies. The questions were spread evenly over the 
text. There were also passages of about 2,000 words 
typed in the styles of Table 2. These were used in 
the Greco-Latin-square experiment to familiarize the 
readers with the styles before they were given the 
test passages. The Ss were tested in groups of about 
a dozen. 

Separate-groups experiment. Each experimental S 
was allocated to one of the styles listed in Table 1. 
All the passages were typed in all the styles and 
were read in the same order by all Ss. After the 
purpose of the experiment had been explained to 
them, Ss were handed two passages each of about 
450 words, comparable to those upon which they 
were to be tested, and told to look them over for 
3 min. This was followed by a practice passage 
which had to be studied for only 75 sec. When time 
was up, each S had to mark with a pencil the point 
in the passage up to which he had read, and then 
to answer 10 practice questions on the content of 
the passage. Unlimited time was available for 
answering, but Ss seldom took longer than 4 min. 
Finally a test passage was read under conditions 
identical to the practice passage, and 10 test questions 
on it were answered. The whole experiment took 
about 25 min. The 18 Ss who answered correctly 1 
or less of the 10 test questions were discarded. 

Greco-latin-square experiment. The 96 Ss read 
passages typed in each of the six styles listed in 
Table 2. The six passages, each of which was typed 
in the six styles, and the six orders of passage- 
style combinations were arranged in a Greco-Latin 
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square. Before each test passage the S spent 5 min. 
looking at a passage of about 2,000 words typed in 
the same style as the test passage. Ninety seconds 
were allowed for studying each test passage, and a 
further 4 min. for answering the 10 questions on it. 
There was a break of about 15 min. between the 
first three and the last three tests. The whole experi- 
ment took just under 2 hr. to administer. Three Ss 
who failed to follow the correct procedure had to be 
replaced by substitutes. 

Experimental subjects. Of the 271 Ss in the separate- 
groups experiment, half were men, and of these 
about one-third were young, enlisted, naval ratings. 
Overall the ages ranged from 17 to 82 yr. Each of 
the five experimental groups had about equal propor- 
tions of each kind of S. Only 8 of the 96 Ss in 
the Greco-Latin-square experiment were men. They 
were allocated randomly to the different experi- 
mental conditions. Here the ages ranged from 21 
to 59 yr. 


RESULTS 


Separate groups. The results of the sepa- 
‘rate-groups experiment are given in Table 1. 
The upper of the two Siemens groups had 
stenciled question sheets typed in the standard 
elite combined upper- and lower-case letters. 
If the lettering of the question sheets affected 
the score for comprehension, this is the group 
most likely to show the effect. For the elite 
combined upper and lower case of the question 
sheets is more different from the san-serif 
Siemens all upper-case text than from any of 
the other styles of text in the table. The lower 
of the two Siemens groups in the table 
(with the superscript b) had dyeline copies 
of the question sheets produced from the 
Siemens teleprinter. Here the lettering of the 
text and question sheets was in the same 
style. Comparing the two groups, it will be 
seen that the lettering of the question sheets 
made little difference (p > .05). If anything, 
the Siemens text with the dyeline Siemens 
question sheets was comprehended slightly 
less well than the Siemens text with the 
stenciled upper- and lower-case question 
sheets, 48.6% compared to 49.3%. 

The pica all-upper-case passages and their 
stenciled question sheets were made on the 
same pica electric typewriter in upper-case 
letters. This may have been even less of a 
change for the reader than switching from 
Siemens text to dyeline Siemens questions. 
The table shows that the pica all-upper-case 
text was comprehended if anything slightly 


less well than the Siemens text; the average 
score for comprehension being 46.9%. Thus 
in comparing all capitals with combined upper 
and lower case, the lettering of the question 
sheets can be neglected. 

None of the three all-upper-case versions 
of the text differed reliably (p > .05), so 
their results can be pooled. Analysis of vari- 
ance showed that the level of comprehension 
of the group reading the pica combined upper- 
and lower-case text was reliably above the 
average level of the three groups reading the 
text typed all in capitals (p < .02). Table 1 
shows that the difference in average compre- 
hension between the combined upper- and 
lower-case text and the various all-upper-case 
texts ranged from 5.8 to 8.2%, which become 
10.5 to 14.9% when expressed as percentages 
of the average comprehension of the combined 
upper- and lower-case text. The level of com- 
prehension of the all-lower-case text lay inter- 
mediately and was not reliably different from 
either. The last column of Table 1 shows that 
the average rate of reading was a little faster 
for the combined upper- and lower-case text, 
but here none of the differences was reliable. 

Greco-latin square. The results of this ex- 
periment are given in Table 2. Analysis of 
variance showed that the average level of 
comprehension of the text typed in the three 
upper-case styles was reliably below the 
average level of the text typed in the three 
remaining styles (p< .05). Here Table 2 
shows that the difference in average compre- 
hension between the combined upper- and 
lower-case text and the all-upper-case texts 
ranged from 3.7 to 4.4%, which become 5.9 
to 7.0% when expressed as percentages of 
the average comprehension of the combined 
upper- and lower-case text. The all-lower-case 
text again lay intermediately. 


DIscUSSION 


The rank orders of the results for com- 
prehension in Tables 1 and 2 are similar. The 
combined upper- and lower-case text came off 
best, whether it was typed in pica or elite. 
The all-upper-case text came off worst, 
whether the letters were pica or elite and 
hence serifed, or Siemens or dyeline Siemens 
without serifs. The all-lower-case text fell 
between, whether typed in pica or elite. There 


20 E. C. PouLtton AND C. HELEN BROWN 


was little difference between text typed in 
the various styles of all-upper-case letters. 
The pica and elite all-lower-case texts were 
also about equal. The differences between the 
combined upper- and lower-case texts and 
the all-upper-case texts are in line with the 
results of Tinker and Paterson (1928) re- 
ferred to in the introduction for passages 
printed in 10-point “old style.” 

These results for comprehension suggest 
that if teleprinters are to have new typefaces, 
not a great deal is to be gained by changing 
from the existing all upper-case Siemens 
letters to the pica or elite all-lower-case letters 
used in this experiment, as long as the ma- 
terial has to be read and understood. The 
greatest gain from the point of view of the 
reader, who has got to understand and take in 
the information presented, would be a change 
to combined upper- and lower-case letters. 
This unfortunately demands a rather more 
elaborate form of coding for automatic trans- 
mission, and hence some additional expense. 

Comparing the results of the two different 
experimental designs, the means for compre- 
hension are rather higher in Table 2 because 
90 sec. were allowed for reading each passage 
instead of 75 sec., and because Ss had be- 
come more practiced at reading and answer- 
ing the questions by the time they had done 
so six times over, as required by the Greco- 
Latin-square design. The differences between 
the means of the combined upper and lower 
case and the all upper case in Table 2, 3.7— 
4.4, are rather smaller than the corresponding 
differences of 5.8-8.2 in Table 1. This is un- 
likely to have been due to the use of pica 
combined upper and lower case in Table 1, 
and of elite in Table 2. For the pica and elite 
all lower case gave almost identical results in 
Table 2. And the small difference between 
pica all upper case and Siemens in Table 1 
in favor of Siemens, corresponds to the small 
difference between elite all upper case and 
Siemens in Table 2, again in favor of Siemens. 
Thus the pica and elite typefaces appear to be 
about equally effective. The larger differences 
between conditions in the separate-groups 
experiment of Table 1, compared with the 
Greco-Latin-square experiment of Table 2, 
could have resulted from transfer effects be- 
tween conditions in the Greco-Latin-square 


experiment reducing the average sizes of the 
differences (Poulton & Freeman, 1966). 

However, the potential power of the two 
alternative experimental designs to discrimi- 
nate statistically between combined upper and 
lower case and all upper case is probably of 
about the same order. The Greco-Latin-square 
experiment did not make full use of the po- 
tential power of a factorial design. Its original 
purpose was to make comparisons between 
three upper-case alternatives, and between two 
all-lower-case alternatives. A single com- 
bined upper- and lower-case condition was 
included simply as a base line. If there had 
been two combined upper- and lower-case 
conditions, and only two all-upper-case con- 
ditions, it would have been possible to have 
tested directly between three pairs of condi- 
tions. A level of confidence at around p < .02 
might then have been achieved, as in the 
separate-groups experiment. It took 96 readers 
for the Greco-Latin square, compared with 
five groups each of about half the size (a 
total of 271 readers) in the separate-groups 
experiment. Increasing the numbers of readers 
much further becomes a relatively inefficient 
method of increasing statistical power. It is 
probably more effective to stratify readers ac- 
cording to reading ability, and to use sub- 
groups of equal size drawn from each of a 
number of ability ranges. This is the tech- 
nique used by Zachrisson (1965, Experiment 
4) on children starting to read. It would seem 
to offer the best hope for increasing statistical 
power without sacrificing the representative- 
ness of the readers. 

The relatively small and nonsignificant dif- 
ferences in rates of reading in Table 1 are 
contrary to the results of Fox (1963) quoted 
in the introduction. This is probably because 
our readers knew that they were to be tested 
for comprehension, while Fox’s readers knew 
that they were not to be tested. 
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ORGANIZATIONAL CORRELATES OF EXECUTIVE ROLES? 
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AND JOHN M. HOGAN ? 


New York University 


Relationships between each of 9 executive role dimensions and various features 
of the organizational setting were studied. The sample consisted of 194 
middle-management personnel of the U. S. Army. The roles, as described on 
questionnaires by the executives themselves, were found to vary with differences 
in the work setting. Organizational features most prominently associated with 
variations in role included the mission and level of the organization, the job 
family of the executive, and his span of control. The role dimensions which 
were most prone to vary with the situation were staffing, controlling, and 
time spent with others. Executives whose roles emphasized administrative 
more than technical features received higher average performance ratings 


from their superiors. 


It is a well-known fact of social psychology 
that the behavior of members of a social 
system is shaped by characteristics of the 
system as well as of the individuals. The con- 
cept frequently invoked to link the system 
and the individual is that of role, that is, a 
set of activities and responsibilities associated 
with a particular position or job. 

The notion that a given job role may vary 
in different organizations has important im- 
plications for personnel management. Among 
them is the prospect that the personnel re- 
quirements for a job are likely to depend 
on contextual considerations. Failure to pay 
due attention to situational factors could 
therefore result in defective personnel pro- 
cedures, such as invalid selection techniques, 
ineffective training programs, or inappropriate 
supervision. 

While the applicability of these concepts 
to executive personnel management is gen- 
eral recognized, they have received relatively 
little systematic or detailed attention. It would 
seem important not only to know in what 
ways executive roles may differ among organ- 
izational settings, but also whether there are 
stable relationships between given role dimen- 
sions and specific organizational variables such 


1 This research was conducted by contract to the 
Department of the Army under sponsorship of the 
United States Army Personnel Research Office. 

2 The opinions expressed in this paper are those 
of the authors and do not necessarily reflect official 
Department of the Army policy. 
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as size or span of control. The present paper 
reports research on this topic, undertaken 
as an early stage of a program of improved 
personnel management of civilian executives 
in the Department of the Army. 

More specifically, the study sought to 
determine whether differences in the ways in 
which executives perceive their roles are as- 
sociated with differences in variables char- 
acterizing their organizational settings. As 
subsidiary objectives, the study also sought 
evidence of interaction between organizational 
and personal variables in their relations with 
these role dimensions, and sought data bearing 
on the relationships between role behaviors 
and executive effectiveness. 


MeETHOD 
Subjects 


The data were collected on 194 male civilian ex- 
ecutives of the Department of the Army. With the 
exception of seven who were in General Service 
(GS) Grades 16 and 17, they were classified in 
GS Grades 13 through 15, roughly equivalent to 
“middle management.” To assure a sample broadly 
representative of such positions, Ss were selected 
from six Army organizations with different missions, 
levels, sizes, and geographical locations. The Ss’ 
mean age was 46.7 yr. (o=8.1), and had a mean 
supervisory experience of 15.2 yr. (¢ = 8.6). The jobs 
covered a variety of titles in such fields as finance, 
research and development, technical specialties, and 
personnel management; relatively few could be con- 
sidered typically “line management,” since such 
positions are usually occupied by military officers. 
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Measures of Job Behavior 


This paper is limited to the study of job be- 
haviors and responsibilities as perceived and reported 
by Ss themselves, using the following questionnaires: 


(a) Executive Position Description: an adaptation of 
85 items from the Executive Position Description: 
Form IV developed by Hemphill (1960). Respond- 
ents indicated on a 4-point scale the extent to which 
an activity or responsibility described by an item 
was a part of their position. The following items are 
typical: 


“Schedule work so that 
steadily.” 
“Reduction of costs.” 


it flows evenly and 


(b) Work Analysis Form: an adaptation of one 
by the same name described by Stogdill and Shartle 
(1955). The respondent estimated the percentage of 
time which he spends in contact with others and 
in individual effort. He then reported the detailed 
breakdown of the time spent in these two major 
categories of duties, and also the time spent on 
duties included in a list of major responsibilities. 
The following items are typical of each of the 
categories, respectively: 


“Attending committee meetings and conferences.” 
“Preparing and writing reports, orders, etc.” 
“Professional consultation—giving technical advice.” 


(c) Performance Style Questionnaire: 40 items 
listing the work patterns, habits, and interpersonal 
relationships a person may emphasize in his job 
following the approach described by Barrett (1961). 
The respondent rated the items in a forced-normal 
distribution according to the extent to which they 
characterize his job behavior. Typical items are: 


“Stays out of arguments.” 
“Gives attention to matters immediately as they 
arise.” 


Measures of Organizational Characteristics 


Information about the organization as a whole, 
such as the number of personnel or type of mission, 
was obtained by the researchers from records and 
interviews with top-level executives. Descriptions 
of certain organizational variables immediate to 
each S’s job, such as his GS level, were obtained 
from him by use of a background information sheet. 
A third type of contextual information concerned 
the supervisory climate in which each S$ works. 
Some of this was obtained by questionnaire from 
the supervisor himself, including such items as his 
age, education, amount of supervisory experience, 
etc. To provide data on the supervisor’s behavior, 
each S filled out the Leader Behavior Description 
Questionnaire (Stogdill & Coons, 1957) describing 
his perceptions of the extent to which his supervisor 
manifests consideration and initiation of structure. 


Other Measures 


One of the subsidiary objectives of the study was 
to see whether personal characteristics interact with 
organizational characteristics in their relations with 
job behavior. For this purpose, data on certain 
personal variables, such as age and education, were 
obtained on the background information sheet filled 
out by each S. Another subsidiary objective con- 
cerned possible relationships between executives’ job 
behavior and ratings of their effectiveness. For the 
latter purpose, a simple three-scale rating form was 
filled out by the immediate supervisor of each S. 
One of the scales concerned a comparison of the job 
performance of S with others at his level, a second 
entailed a comparison of his performance with 
standards, and a third asked for the supervisor’s 
willingness to hire the man if he had his choice. 
This instrument was used in another study of ex- 
ecutives by one of the authors, in which the sum of 
the three scales was found to have an interrater 
reliability of .77. 


Data Analysis 


In order to condense the descriptions of job be- 
haviors and responsibilities into a limited number of 
meaningful role dimensions, a centroid factor analysis 
was performed on the three questionnaires describing 
job activities. Excluded were those items in the 
Executive Position Description where more than 
50% of the respondents reported that the activity 
or responsibility was not relevant to their position. 
This yielded a total of 90 items which entered into 
the intercorrelation matrix. Eight orthogonal fac- 
tors emerged: 


1. Long-Range Planning: planning for future 
activity, including forecasting trends or events, con- 
ducting pilot projects, being concerned with legisla- 
tion which might affect the organization, and estab- 
lishing long-range objectives for the organization. 

2. Staffing: personnel activity characterized by 
involvement in selection, placement, training, and 
advancement of employees. 

3. Technical Consultation: gathering, consolidating, 
interpreting, and providing others with facts and 
information. 

4. Budgeting: preparation and defense of budgets, 
and accounting for expenditures. 

5. Shared versus Individual Responsibility: the 
shared pole being characterized by emphasis on 
keeping others informed of present states of affairs, 
whereas the individual pole emphasizes personal 
responsibility for taking action. 

6. Operational versus Professional Concerns: the 
operational pole being characterized by concern for 
the efficiency of present organizational operations and 
personal professional development, whereas the pro- 
fessional pole is oriented toward advising others on 
technical matters. 

7. Technical versus Administrative Activity: the 
technical pole being characterized by the use of pro- 
fessional tools and techniques, including writing 
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TABLE 1 
RoLe Dimensions AS FUNCTIONS OF ORGANIZATIONAL VARIABLES 
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gineering 


Construction & equipment en- 
Technical specialties 


Research & development 
Finance & data processing 


Type of executive job 
Personnel 





technical reports, whereas the administrative pole 
involves coordination of activity, communication, 
and recommendations. 

8. Controlling: relating to formulation of limited 
plans or specifications in regard to resources and 
finances of the organization, including reducing costs. 


Only a few of the items from the Performance 
Style Questionnaire were found to have appreciable 
loadings on any of these factors, so for reasons of 
efficiency it was decided to exclude it from the 
measurement of job behavior. Scores on the eight 
factors were computed for each S on the basis of 
specification equations, each including between 5 
and 10 items from the other two questionnaires. 
The percentage of time on the job that the in- 
cumbent reported spending in contact with other 
individuals was considered as an additional role 
variable, because of its intrinsic interest. Thus, a 
total of nine variables constituted executive “job 
behavior” or “roles” as discussed below. 

In passing, it may be of interest to compare the 8 
factors described above with the 10 which Hemphill 
(1960) identified in his study of executives. Long- 
Range Planning is essentially duplicated in both 
studies, and there is some similarity between our 
Controlling factor and his of Business Control. The 
other factors are not parallel in the two studies, 
perhaps because of differences in the jobs, in Ss, in 
the settings, or some combination of such para- 
meters. 

Analysis of variance was used to determine 
whether each of the nine role dimensions was differ- 
entially associated with organizational character- 
istics. In instances where organizational variables 
yielded continuous distributions, their association 
with the nine role dimensions was determined in- 
stead by product-moment coefficients of correlation. 
The N on which these analyses were based was 
usually somewhat smaller than the total of 194, 
because data were not complete for all Ss on all 
variables; no systematic basis was discerned for the 
loss of cases. 


RESULTS 


Role Dimensions in Relation to Organizational 
Variables 


Table 1 presents, in terms of analysis of 
variance, the comparison of role-dimension 
scores associated with various organizational 
and job settings. Table 2 reports product- 
moment correlation coefficients between role- 
dimension scores and certain continuously 
distributed organizational variables. In both 
tables, significance levels are reported on the 
basis of two-tailed tests. 

Tables 1 and 2 show that there were a 
number of significant differences in roles of 
the executives studied, depending on the 
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organizational setting in which they operated. 
Organizational features associated with these 
differences included the organizational mis- 
sion, level, size, stability, and whether urban 
or rural in location. The executive’s job 
family, GS level, and span of control also 
were influential. The role dimensions most 
frequently varying with organizational char- 
acteristics were Staffing, Controlling, and Time 
Spent with Others. These relationships are 
described in more detail below. 

As shown in Table 1, the roles of those 
executives who worked in an organization 
having a mission of administration entailed 
relatively little emphasis on Long-Range 
Planning and relatively strong emphasis on 
Staffing and on Controlling. Roles in organiza- 
tions having a major mission in the area of 
manufacturing, engineering, or materials 
handling, tended to be relatively high on 
Long-Range Planning, on Staffing, and on 
Time Spent with Others, and were average 
on Controlling. Those in the laboratory and 
research functions were moderately high on 
Long-Range Planning and low on Staffing, 
Controlling, and Time Spent with Others. 

The level of the organization was also 
related to the roles reported by the in- 
cumbents. The higher the level, the less 
prominent were Staffing and Controlling di- 
mensions. 

Those executives who worked in the Wash- 
ington headquarters gave less emphasis to 
Staffing, Technical Activity, and Controlling. 
These results are similar to those comparing 
urban and rural locations, partially because 
Washington was one of the major urban 
settings studied. 
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Executives in the smaller organizations 
placed more emphasis on Long-Range Plan- 
ning, possibly because small organizations 
may receive less attention and direction from 
headquarters and may therefore be more 
autonomous with respect to plans. These ex- 
ecutives also spent less time with others, 
which may support the common observation 
that larger organizations require more com- 
munication. Not surprisingly, executive roles 
in expanding organizations were relatively 
high in Staffing responsibilities. 

The job family of the executive’s position 
was found to be related to his role: those in 
research and development work were less in- 
volved in Long-Range Planning, those in 
construction and equipment engineering con- 
cerned more with Budgeting and less with 
Operational considerations, personnel special- 
ists more concerned with Long-Range Plan- 
ning but less with Budgeting, and financial 
people more concerned with Long-Range Plan- 
ning and Budgeting. 

As shown in Table 2, the more people 
supervised directly or indirectly by the ex- 
ecutive, the more role emphasis there was on 
Long-Range Planning, Staffing, Budgeting, 
Controlling, Administration, and Time with 
Others. 

The GS levels covered in this study were 
largely limited to Grades 13, 14, and 15. 
This restriction of range accounts in part for 
the relatively small correlations found in 
Table 2 between GS level and role character- 
istics. Nevertheless, the higher-level people 
were found to place significantly less stress 
on Controlling and more on Administrative 
activity rather than Technical activity or 


TABLE 2 


COEFFICIENTS OF CORRELATION BETWEEN ROLE DIMENSIONS AND CERTAIN ORGANIZATIONAL VARIABLES 





Role dimension 





Organizational 





variable Long- Technical Shared Operational| Technical vs. : 
ae eee consulta- ee gear she vs. profes- | administrative | Controlling Be! 
ning tion eftont sional activity 
No. persons reporting 
directly 08 60% —.02 eda .07 10 -O1 .19 .32** 
No. persons reporting 
directly or indirectly | .31** 3D —.08 24% .06 10 —.20* aa .18* 
GS level a5: rll —.16* ako — .03 —.11 — 29% —.22* .16* 





Note.—N = 145, 
Mo 05s 
wep <= .00: 
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TABLE 3 


COEFFICIENTS OF CORRELATION BETWEEN ROLE DIMENSIONS AND SUPERVISORS’ CHARACTERISTICS 








Supervisors’ 


Role dimension 




















Sot? Long- : Shared : pe a 
characteristic & Technical ~ «4. | Operational] Technical vs. ns 
plate oe consulta- aoe aie vs. profes- administrative Controlling eee 
nie tion nerart sional activity 5 
Initiation of structure | —.17* -0S —.15 —.01 =—15 —.08 .08 —.07 14 
Consideration 02 .03 —.05 07 —.26** .06 —.07 an 00 
Percentage of S’s 
duties supervisor 
can perform .O7 .04 —.02 .14 .09 .30** —.12 —.06 Eis 
Time S served under 
supervisor —.22** -04 -03 —.02 .02 —.01 .06 .08 .06 
Time in present 
position —.19* .06 (il —.06 =i) —.05 .08 .00 = 02 
Age —.07 —.01 —.08 —.11 —.08 —.07 —.09 04 = 07; 
Amount of education —.17* | —.05 —.14 —.03 —.12 —.19* .04 —.26** .07 
Academic standing aS —.11 .07 04 — .03 .07 O01 10 —.04 
Closeness of contact 
with S 15 —.06 .00 .09 -06 11 —.14 —.01 .03 
Length of face-to-face 
contact —,.21** .05 00 .05 10 —.09 16** nL .08 
Note.—N = 136-146. 
e5T< .05! 


*KD < 01. 


Consultation. There was some trend also for 
them to engage in more Long-Range Planning 
and to spend more Time with Others. 

Table 3 reports the findings bearing on the 
supervisory aspect of the organizational cli- 
mate within which Ss operated. The table 
lists correlation coefficients between char- 
acteristics of Ss’ supervisors and Ss’ own role 
perceptions. The coefficients in Table 3 are 
small, and the number of significant ones is 
not much beyond chance expectancy. There 
is some indication that the supervisors’ formal 
education was appreciably associated with the 
roles of their executive subordinates, the 
more highly educated supervisors having sub- 
ordinates whose roles involved less emphasis 
on Long-Range Planning and on Controlling, 
and greater emphasis on Professional rather 
than Operational Concerns. This finding may 
be as much a function of the technical nature 
of the jobs held by subordinates of highly 
educated supervisors as it is of the personal 
impact of the supervisor. On comparing the 
columns of Table 3, it appears that Long- 
Range Planning is the role dimension most 
frequently associated with characteristics of 
the supervisor; executives whose roles more 
heavily featured this characteristic were those 
whose supervisors did less Initiation of Struc- 
ture, had less extensive face-to-face contact 
with their subordinates, showed evidence of 


greater mobility in jobs, and had less educa- 
tion. 


Interaction Effects of Personal and Organiza- 
tional Variables 


In a complete nomology of executive role 
behavior, one would expect that organizational 
influences would interact with personal dis- 
positions. The present study was not designed 
to delve deeply into personal correlates of 
executive roles, but it did include the col- 
lection of certain biographical data which 
could be examined in this light. The personal 
variables found to be more strongly associated 
with role have to do with educational back- 
ground. For example, Ss with more formal 
education described their jobs as entailing 
less Long-Range Planning, Staffing, Budget- 
ing, and Controlling, but more Professional 
Concerns. However, most of the personal 
variables were found to have only slight rela- 
tionships, if any, with the nine role dimen- 
sions; this statement covers variables like 
age, amount of supervisory experience, and 
number of dependents. Perhaps for this reason, 
when two-way analyses of variance were 
performed of role-dimension scores against 
both personal and organizational variables, 
the number of significant interaction effects 
was only about chance, and yielded no mean- 
ingful overall picture. 
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Relation of Role to Performance Ratings 


One of the nine role dimensions turned 
out to have low but statistically significant 
relationships to the three ratings (median r = 
.19), to wit, executives whose roles more 
prominently featured Administrative as con- 
trasted with Technical Activity received 
higher ratings from their superiors. This was 
not due to a tendency to rate higher those 
who occupy higher-level jobs, since the cor- 
relations of ratings and GS level were approxi- 
mately zero. It may signify that administra- 
tive behavior is perceived as a more critical 
requirement of executive jobs of the types 
studied. 


DIscUSSION 


In line with psychological theory, executive 
role was found to be associated with features 
of the organizational and work setting. In this 
sample of positions and organizations, dif- 
ferences in executive roles were especially 
marked between organizations of different 
missions and levels, between jobs designated 
as general administration compared to those 
characterized as scientific or fiscal, and be- 
tween positions showing differences in span of 
control. The role dimensions of Staffing, Con- 
trolling, and Time with Others were par- 
ticularly prone to vary with organizational 
characteristics. 

These results underscore the potential 
fruitfulness of taking organizational settings 
more fully into account when designing ex- 
ecutive personnel management practices, such 


as techniques of selection, training, or super- 
vision. For example, it would be reasonable 
to expect that critical requirements for ex- 
ecutives having large spans of control, which 
were found here to be characterized by greater 
role emphasis on administrative, planning, and 
personnel matters, would be different from 
those having smaller spans of control. The 
logical next step would be to see whether 
improvements in executive personnel manage- 
ment can indeed be effected when organiza- 
tional variables are included as moderating 
parameters in investigations of the relation- 
ships between management methods and 
criteria of effectiveness. The present study 
indicates the relative potency of a number of 
organizational variables in this regard; by 
specifying how role behavior is associated 
with the more influential variables, it sug- 
gests which aspects of management method 
may be fruitfully examined for their impact. 
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UsSEVORIWMIGH SCHOOLy DATAXTOUPREDICT 
WORK PERFORMANCE’ 
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For a sample of high school graduates employed in an aircraft plant, teachers’ 
work habits and cooperation ratings, absenteeism, and grade-point data were 
obtained from high school transcripts and related to work-performance criteria 


of supervisory ratings, absenteeism, and tardiness records. 


Significant rela- 


tionships were obtained between the high school predictors and the work- 


performance criteria. 


For the great majority of industrial test- 
validation studies, the obtained validities are 
disappointingly low, indicating that the in- 
dustrial researcher must search for additional, 
nontest predictors to be used in conjunction 
with the test information. One of the factors 
that is frequently considered important is 
that of “attitude toward work.” Attitude is 
not frequently utilized, though, because psy- 
chologists have found it extremely difficult to 
obtain useful measures of attitude in an 
employment setting. However, a meaningful 
measure of attitude might be obtained from 
an earlier situation similar in nature to the 
work situation, one in which the § had an 
opportunity to express similar attitudes. This 
study was directed toward the investigation of 
one source of previous evidence of attitude— 
data concerning behavior in the high school. 


PROCEDURE 


The Ss used in this study were 100 employees of 
the Lockheed-California Company who (a) gradu- 
ated from a high school in the Los Angeles City 
School Districts, (b) attended that school for at 
least 2 yr., (c) were employed by the Lockheed- 
California Company within 1 yr. of graduation, and 
(d) were employed by the Lockheed-California Com- 
pany for at least 1 yr. 

The data for the first seven variables (the pre- 
dictors) in Table 1 were obtained from Ss’ high 
school transcripts, while the data for the next five 

1 Extensive assistance was received in connection 
with this study from many members of the Los 
Angeles City School Districts. It was a pleasure to 
work with Muriel Sheldon, Clyde Sorensen, and Rose 
O’Connor of the Secondary Guidance and Counseling 
Section who provided invaluable advice on the de- 
sign of the study and obtained all of the predictor 
data used in it. 
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(the criteria) were gathered from supervisory ratings 
and departmental records. 

The teachers’ work habits and cooperation ratings 
Were given on a 3-point scale, and the supervisory 


ratings on a 5-point scale. The “sex” variable was 


scored 0 for females and 1 for males. 


RESULTS 


The means, the standard deviations, and 
the intercorrelation matrix are presented in 
Table 1. 


Reference to Table 1 reveals the following: 


1. The standard deviations of both the 
predictor and the criterion variables are large 
enough to provide adequate discrimination. 

2. The standard deviations of the absen- 
teeism and tardiness data are much larger 
than would be expected if the distributions 
were normal. This indicates that a relatively 
large percentage of Ss had either a very low 
number of absences or a very large number. 
Plotting of the data revealed this to be the 
case. 

3. The teachers’ ratings are very highly 
related to each other and to students’ grade 
point. 

4, The teachers’ ratings, the students’ grade 
point, and school absences are significantly 
related to supervisory ratings. Ten of these 
12 correlations are significant at the .01 
level. 

5. The teachers’ ratings and school ab- 
sences are related to work absences at the 
.01 level. 

6. Grade point and rank in class are ex- 
tremely highly correlated, indicating that a 
given grade-point average would be in the 
same relative rank position in each of the 12 
schools furnishing data. 
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TABLE 1 


MEANS, STANDARD DEVIATIONS, AND INTERCORRELATIONS OF VARIABLES 








0 | 11 | 12 | 13 























Variable | M|SD 1 | Bs | 3 | 4 | 5 | 6 | 7 | 8 | 9 
1. School absences | 21.5 | 19.6 40 —11 —14 —28 —27 —32 —25 —31 —21 30 20 06 
2. Unexcused | 
school 
absences LESH S53 40 —04 —06 —18 —16 —25 —11 —31 —09 13 07 17 
3. Three-year 
grade point 2, .6|—11 —04 96 78 67 73 36 37 34 —16 10 —58 
4. Rank in class 
(inverted) 42.7|27.4| —14 —06 96 73 63 Te 33 34 30 —16 10 —56 
5. Teacher’s work 
habits rating Dol .4|—28 —18 78 73 85 75 34 44 41 —26 —02 —5Sl 
6. Teacher’s coop- 
erationrating | 2.4) .3|}—27 —16 67 63 85 64 30 45 39 —29 —02 —42 
7. Grade point last 4d 
two semesters | 2.4 6} —32 —25 a) 72 75 64 30 38 26 —16 O7 —41 
8. Supervisor’s 
ability rating | 3.4 9|—25 —11 36 33 34 30 30 61 72 —14 —05 —23 
9. Supervisor’s 
conduct rating) 3.5 9|—31 —31 31 34 44 45 38 61 73 —31 —15 —22 
10. Supervisor’s 
production 
rating 3.4 9}—21 —09 34 30 41 39 26 72 73 —23 —14 —29 
11. On-the-job 
absences LEDER 30 13 —16 —16 —26 —29 —16 —14 —31 —23 37 —03 
12. On-the-job 
tardies 3.6] 4.1 20 07 10 10 —02 —02 07 —05 —15 —14 37 —22 
13. Sex 06 17 —58 —56 —51 —42 —41 —23 —22 —29 —03 —22 
Note.—A correlation of .20 is necessary for significance at the .05 level. A correlation of .26 is necessary for significance at 
the .01 level. Decimals have been omitted from the correlation part of the table. 


DIscussION 


These results indicate the following: 


1. The high school data collected in this 
study have adequate discrimination to serve 
as predictor data. The teachers are making 
meaningful discriminations of work habits 
and cooperation behavior. 

2. The high school data are significantly 
related to performance on the job as evalu- 
ated by the immediate supervisor. Of even 
greater significance, the degree of relation- 
ship between the high school data, especially 
the teachers’ ratings, and the work-perform- 
ance data is greater than that normally ob- 
tained with aptitude tests (e.g., see Ghiselli 


& Brown, 1955, p. 227). In other words, 
whatever it is that determines a supervisor’s 
evaluation of work performance, the high 
school teacher is able to (and does) measure 
it more successfully than aptitude tests. 

3. Absenteeism on the job can be pre- 
dicted to a statistically significant extent by 
absenteeism at school and by the teachers’ 
ratings. 


REFERENCE 


GHISELLI, E. E., & Brown, C. W. Personnel and 
industrial psychology. New York: McGraw-Hill, 
1955. 


(Received February 6, 1967) 


Journal of Applied Har hory 
1968, Vol. 52, No. 1, 31-35 


VOCATIONAL INTERESTS OF FEMALE 
COMPUTER PROGRAMMERS 


DALLIS K. PERRY ann WILLIAM M. CANNON 


System Development Corporation, Santa Monica, California 


Scores of 293 female computer programmers on the SVIB for men were found 
to be quite similar to those of male programmers, although interests of the 
women were somewhat higher in aesthetic and scientific fields and lower in 


technical and technical-supervision occupations. 


Female programmers indicate 


interest in all forms of mathematics and lack of interest in people. A key com- 
posed of items with a 12% or larger difference between female programmers 
and other professional women showed a 20% overlapping of professional 


women in general with a cross-validation group of 


female programmers. 


Programmers who are dissatisfied with their occupation score significantly 
lower on the key than do satisfied programmers. 


In the process of measuring the vocational 
interests of computer programmers and de- 
veloping a Strong Vocational Interest Blank 
(SVIB) scale to represent these interests, a 
substantial number of completed SVIBs for 
men were obtained from female programmers. 
The following points became clear through 
analysis of SVIB scores for the female pro- 
grammers: 


(a) The vocational interests of female pro- 
grammers are highly similar to those of male 
programmers. 

(6) The differences in interests that char- 
acterize men and women in general charac- 
terize male and female programmers as well, 
with the result that the male programmer key 
with male norms does not adequately repre- 
sent the interests of female programmers 
(Perry & Cannon, 1967). 


Because there is considerable overlapping 
of items between the SVIBs for men and for 
women, it appeared feasible to use the re- 
sponses of female programmers on the SVIB 
for men to develop a computer programmer 
key for women that could be used with the 
women’s form of the blank. At the same time 
it seemed desirable to provide normative data 
to facilitate the interpretation of women’s 
scores on the male programmer key in those 
instances in which the men’s SVIB form is 
used with women. The present study, there- 
fore, was intended to (a) develop a computer 
programmer key for the SVIB for women, 


hil 


and (0) develop female norms for the pro- 
grammer key for the SVIB for men. 


METHOD 


Subjects. The 1,378 computer programmers par- 
ticipating in the original study included 186 females. 
This group was augmented by 107 additional females 
from six of the large programming organizations in 
the original study. The same data-collection and re- 
porting procedures were again followed to assure 
anonymity of participants (Perry & Cannon, 1967). 
The original data were gathered in 1964 and the 
additional data in 1966. 

The total group of 293 female programmers was 
reduced for purposes of key construction and valida- 
tion by dropping 31 members who indicated that 
they (a) were dissatisfied with programming (“prob- 
ably” or “definitely” would prefer another field) ; 
(b) were primarily responsible for computer opera- 
tion, programmer instruction, or computer installa- 
tion management; or (c) had less than 1 yr. of 
experience in programming. With the exception of 
the last restriction, these constraints are the same as 
those applied to the male programmers. Male pro- 
grammers with less than 2-yr. experience were 
found to have vocational interests nearly identical 
with those of more experienced programmers. Con- 
sequently, it appeared justifiable to lower the ex- 
perience requirement from 2 yr. to 1 yr. in the in- 
terest of maintaining sample size. The resulting 
primary sample of 262 female programmers was con- 
sidered adequate for key construction and valida- 
tion, although it was smaller than desired. 

Sample characteristics, The Ss in the primary sam- 
ple ranged in age from 21 to 59, with a median of 
29.9, mean of 31.6, and standard deviation of 7.3. 
Almost 90% were 40 or younger. This group is thus 
younger than all the present SVIB female criterion 
groups except secretary-stenographer (Strong & 
Campbell, 1966). Like male programmers, more than 
90% of the females had attended college, but more 
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were college graduates and fewer held advanced de- 
grees. A much larger proportion of the females (60%) 
reported majors in mathematics, with no large 
grouping in any other field. All but 5% of the 
sample reported some kind of formal training in 
programming, the average amount being 160 hr. As 
might be expected, the proportion of females with 
machine-operations experience (20%) was much 
smaller than that for men (37%), and the average 
amount of operations experience was much less. 
Mean programming experience (5.3 yr.) of the 
female sample, however, was nearly identical to that 
of the male sample, despite the inclusion of a few 
1-yr. programmers in the female group. 
Employment . characteristics of the female pro- 
grammers were substantially different from those of 
the male programmers. Greater proportions of the 
females were in government and research and de- 
velopment organizations, and smaller proportions 
were in business and industrial firms. More of the 
females were on large programming staffs, and fewer 
were doing business programming. Fewer females 


Median monthly salaries of the women ($757) aver- 
aged about $75 less than the men, despite the fact 
that experience of the two groups was about the 
same and about one-third of the women were re- 
sponding 2 yr. later than the men. Examination of 
similar data for the portion of the female sample 
obtained in 1964 indicated that the above differ- 
ences between male and female programmers were 
not due to differences in sampling or dates of col- 
lection. Apparently there are differences in the kinds 
of programming employment entered by, or available 
to, female programmers. 

Procedure. For construction of interest keys the 
primary sample was divided randomly into two parts: 
a criterion group of 200 cases, and a validation 
group of 62 cases. The reference group represented 
an equal sampling from each of the 26 female SVIB 
occupational criterion groups.+ 

Items available for use in a new key for women 
included only those that: 


1 Reference-group response percentages were ob- 


were involved in requirements analysis, program tained from the Center for Interest Measurement Re- 
system design and installation, and supervision. search, University of Minnesota. 
TABLE 1 


MEANS AND STANDARD DEVIATIONS OF FEMALE PROGRAMMERS ON SVIB Krys FoR MEN 




















Key M SD Key M SD 
1 Optometrist 37.53 11 SI 3L Soc: Science Teacher, 27.89 10.58 
2 Physical Therapist 31.00 11.36 | 32 Sch. Superintendent 28.62 11.60 
3 Dentist 32.71 10.59 | 33 Minister 26.23 10.99 
4 Osteopath 28.80 09.65 | 34 Librarian 40.50 10.38 
5 Veterinarian 18.59 09.71 | 35 Artist 31.24 10.31 
6 Physician 37.85 11.64 | 36 Art Teacher 28.74 11.06 
7 Psychiatrist 36.74 10.30 | 37 Musician Performer 44.38 09.24 
8 Psychologist 37.10 10.46 | 38 Music Teacher 34.90 10.48 
9 Biologist 38.84 11.19 | 39 C.P.A. Owner 37.61 09.10 
10 Architect 36.81 11.12 | 40 Senior C.P.A. 40.32 08.84 
11 Mathematician 36.00 10.73 | 41 Accountant 37.80 10.06 
12 Physicist 29.51 13.31 | 42 Office Worker 35.64 09.98 
13 Chemist 41,22 11.56 | 43 Credit Manager S252, 11.00 
14 Engineer 39.43 10.74 | 44 Chamber of Commerce Executive 31.10 10.37 
15 Production Manager 34.27 07.46 | 45 Bus. Educ. Teacher 30.91 ul LP) 
16 Army Officer 26.89 12.15 | 46 Purchasing Agent 26.11 09.36 
17 Airplane Pilot 28.96 10.69 | 47 Banker Dail 09.09 
18 Carpenter 20.99 11.86 | 48 Pharmacist 29.95 08.45 
19 Forest Service Man 2 2elil 12.32 | 49 Mortician 26.63 08.53 
20 Farmer 29.60 09.49 | 50 Sales Manager 23.93 09.05 
21 Indus. Arts Teacher 25.54 10.03 | 51 Real Estate Salesman 28.59 07.08 
22 Math-Science Teacher 40.17 08.86 | 52 Life Ins. Salesman 22.30 09.20 
23 Printer 30.49 08.95 | 53 Advertising Man 31.61 08.88 
24 Policeman 24.95 07.75 | 54 Lawyer 33.62 08.48 
25 YMCA Phys. Director | 21.81 11.64 | 55 Author-Journalist 33.24 07.91 
26 Personnel Director 31.64 11.78 | 56 Pres.-Mfg. Concern 20.02 09.20 
27 Public Administrator 36.32 09.58 | 57 Specialization Level 44.33 09.66 
28 Rehabilitation Couns. 31.30 10.19 | 58 Occupational Level 59.84 06.52 
29 YMCA Secretary 20.85 11.93 | 59 Masculinity—Femininity 37.99 08.85 
30. Social Worker 29.46 11.98 | 60 Computer Programmer 20.34 10.17 
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Fic. 1. SVIB profile for male and female computer programmers. 


(a) had women-in-general response counts avail- 
able (ie., were in the 1946 edition of the SVIB for 
women) ; and 

(6) appeared in the revised SVIBs for both men 
and women in identical or equivalent forms. Judg- 
ments of equivalence were rather conservative, and 
only a few items were not identical. These included 
6 that involved direct changes in gender (e.g., actor- 


actress) and 18 that differed slightly in wording 
(e.g, member of supreme court—supreme court 
justice; president of society—president of society or 
club; author of “best seller”’—author of best-selling 
novel). 
There were 197 items that met these requirements 
and were therefore available for key construction. 
Several interest keys of varying numbers of items 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS OF FouR GROUPS 
ON THE FEMALE COMPUTER PROGRAMMER KEY 








Group N | M Si 
Criterion | 200 | 20.26) 1152 
Validation 62mm as:20 | 9.38 
Women in General —11.52 | 13.734 

9.04 | 13.66 


Dissatisfied | 24 


a Estimated as 1.24 times the combined criterion-validation 
group standard deviation, based on the average ratio of women- 
in-general to criterion group standard deviations for existing 
SVIB keys. 


were constructed and evaluated. Differences in 
validity were slight, and a key composed of items 
with a 12% or larger difference between criterion and 
reference groups was selected as having the best 
combination of both validity and sufficient length 
to provide adequate reliability. 

The answer sheets were scored on 60 male SVIB 
keys, including the male programmer key, and 
normative data for female programmers on both the 
male and female programmer keys were computed. 


RESULTS 


Interests of female programmers. Means 
and standard deviations of female program- 
mers’ scores on 60 SVIB keys for men are 
listed in Table 1 and shown in profile form in 
Figure 1. For comparison, the profile for male 
programmers also is shown in Figure 1. The 
similiarity of the two profiles, despite the fact 
that all the scores were standardized on men, 
is apparent. The women have higher scores in 
the aesthetic fields, as would be expected, 
somewhat higher scores in scientific fields, 
and lower scores in the technical supervision 
and technical occupations. The marked simi- 
larity of the profiles, even in occupational 
areas that show differences in level of in- 
terest, emphasizes the similarity of interests 
of male and female programmers. 

Examination of the SVIB items that differ- 
entiate female programmers from women in 
general reveals two major clusters paralleling 
those for male programmers. These clusters 
indicate a strong interest in all forms of 
mathematics and a lack of interest in people 
—especially in activities involving responsi- 
bility for helping people. Female program- 
mers are also more interested than other 
women in physical science and technical ac- 
tivities and less so in natural and _ social 


science and literature. They prefer indepen- 
dent, nonroutine work. 

Female programmer key. The female pro- 
grammer key contains 156 weighted responses 
to 88 items. Means and standard deviations 
of criterion, validation, and reference groups 
are given in Table 2. The percentage of over- 
lapping of the criterion and reference groups 
is 21, and of the validation and reference 
groups it is 20. In these validity character- 
istics the key compares favorably with exist- 
ing SVIB keys for women. Two-week retest 
reliability of the key for 112 women is .90.? 

Of the 31 female programmers not included 
in the primary sample, 24 were excluded be- 
cause they indicated that they “probably” or 
“definitely” would prefer a different field of 
work. The mean and standard deviation of 
programmer key scores for this group of 
dissatisfied programmers are also shown in 
Table 2. The average score of the dissatisfied 
group is substantially and significantly lower 
than are those of the other programmer 
groups, providing additional evidence of the 
effectiveness of the key. 

Relationships to male programmer key. 
Of the 102 responses in the male programmer 
key and 156 responses in the female pro- 
grammer key, only 44 were common to the 
two keys. However, of the 58 responses in 
the male and not in the female key, only 
15 were available for inclusion in the female 
key. The large number of responses in the 








2 Reliability was computed by the Center for In- 
terest Measurement Research and provided through 
the courtesy of David P. Campbell. 


TABLE 3 


NORMATIVE DATA FOR FEMALE COMPUTER 
PROGRAMMERS ON MALE AND FEMALE 
ComMPUTER PROGRAMMER Keys 








Raw score Percentage 

distribution 
Teteee ee ———| on female key 

grade | Standar 
score Female Male 
key key Pro- 
grammers Wks 

A 45 to 14to 81 16to 56 70.2 3.4 
B+ | 40 to 44 9to 13 12.t02) 15 16.4 3.8 
B 35 to 39 Sito 78 8to 11 ee 8.2 
B— |30to34|— 2to 2 4to 7 3.4 10.1 
C+ | 25 to29} — 8to —3 Oto 3 A) 15.8 
Gc to 24 | —77 to —9| —42 to —1 0.8 58.7 





» Women-in-general estimated, 
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female and not in the male key is partially 
due to the less stringent criterion (12% 
rather than 15% difference between criterion 
and reference group) for including items in 
the female key. The correlation between scores 
of female programmers on the two keys is 
.60. The correlation among women in general 
would be substantially higher. Correcting for 
the larger variance to be expected for women 
in general increases the correlation to about 
.70. The relationship between the two keys 
is quite high, but not high enough so that 
one may be substituted for the other or the 
results on the two accepted as equivalent. If 
responses of women are obtained on the 
SVIB for men, their scores on the computer 
programmer key, expressed in terms of female 
norms, will be a meaningful measure of their 
interest in programming. Such scores, how- 
ever, cannot be interpreted in exactly the 
same way as scores on other SVIB keys for 
men (without female norms) or on the 
SVIB for women. 


Normative data. Scoring weights for the 
computer programming key for women have 
been provided to the major SVIB scoring 
services. Standard-score and _letter-grade 
norms for female programmers on both the 
men’s and the women’s computer programmer 
keys are given in Table 3. The norms are 
based on combined criterion and validation 
groups. Standard scores based on female 
norms for the men’s key are 4 to 5 points 
higher than standard scores based on men’s 
norms, indicating that when the men’s SVIB 
is used with women, the resulting computer 
programmer scores will be about one letter 
grade too low unless they are converted 
by Table 3. 
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SELECTION OF SALESMEN BY MEANS OF AN 
ASSESSMENT CENTER 


DOUGLAS W. BRAY anp RICHARD J. CAMPBELL 


American Telephone and Telegraph Company, New York City 


Newly hired candidates for sales positions were evaluated by means of an 


assessment center consisting of paper-and-pencil tests, 


an interview, and 


individual and group simulations. Assessment staff judgments were compared 
with job performance some months later as evaluated by a special observa- 
tional team. Assessment results were strongly related to this criterion. Super- 
visors’ and trainers’ ratings were not significantly related to the job-performance 
criterion nor to assessment results. The findings lend support to recent studies 
indicating the efficacy of the assessment-center method in personnel selection. 


The assessment center (Taft, 1959) re- 
ceived its first industrial application when 
it was introduced into the Bell System in 
1956 as a major research method of the 
Management Progress Study (Bray, 1964). 
Two years later, in 1958, an assessment center 
was opened in the Michigan Bell Telephone 
Company for the appraisal of candidates for 
promotion to management from vocational oc- 
cupations. The results of this first operational 
assessment center were deemed so useful by 
line managers that the assessment-center 
method spread widely through the Bell Sys- 
tem until over 50 assessment centers were 
established in the Bell System Companies 
processing approximately 8,000 men and 
women each year who are candidates for 
management positions. 

Follow-up investigations in the Manage- 
ment Progress Study (Bray & Grant, 1966) 
demonstrated considerable predictive power 
for the assessment-center method both for 
young noncollege management men who had 
emerged from the ranks at an early age and 
for college recruits assessed very shortly after 
employment. In addition, a study of 500 men 
recently appointed to management (Campbell 
& Bray, 1967), some of whom had not been 
assessed and some of whom had received 
various ratings at the assessment center, fur- 
ther demonstrated the usefulness of the 
method. 

The above assessment centers were designed 
to assess the ability to function successfully 
as a supervisory line manager. The present 
article describes the application of the assess- 
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ment-center method to a quite different prob- 
lem—the selection of prospective communica- 
tions consultants. The job of the communica- 
tions consultant is to work with business 
customers to determine their objectives, un- 
cover inconsistencies in methods of reaching 
these objectives, and suggest improved com- 
munications, where appropriate, for im- 
proved operations. 

A first step in devising an assessment 
center is a statement of the qualities relevant 
to success in the job for which assessment is 
planned. In order to arrive at this list of 
qualities, a psychologist experienced in the 
assessment-center approach due to his work 
on the Management Progress Study, David 
Berlew, spent several weeks interviewing 
sales personnel, observing sales training, and 
making numerous customer visits with ex- 
perienced communications consultants. When 
a preliminary list of qualities had been iso- 
lated, assessment techniques designed to re- 
veal these qualities were selected or developed. 
A group of experienced assessors, under the 
direction of the senior author, then applied 
these methods to a trial group of six successful 
salesmen. 

This pretest resulted in some redefinition of 
the assessment qualities and some revision 
of the assessment-center techniques. The list 
of qualities was reduced to 20 and included 
such characteristics as resistance to premature 
judgment, oral presentation, oral defense, be- 
havior flexibility, and persistence. The major 
assessment techniques were three simulations: 

Leaderless Group Discussion—This tech- 
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nique involves six assessees. They are told to 
assume that they are attending a meeting with 
five of their peers to discuss candidates for 
promotion from among their fictitious sub- 
ordinates. Each is given a description of a 
subordinate whom the assessee is to present 
as his candidate. Each assessee is given up to 
5 min. to present his man, followed by a 
1-hr. group discussion which must result in a 
rank ordering of the six candidates. 

Oral Fact-Finding Exercise—This is an 
individual technique in which the assessee is 
told that he is an arbitrator in a labor- 
management dispute. He is given a short out- 
line of the dispute and the opportunity to 
question a staff member to determine addi- 
tional facts and issues. He then presents his 
decision and its rationale orally to the staff 
member and is questioned intensively about it. 

Consulting Case—This is an individual 
technique in which the assessee is given 
written material—textual, graphic, and tab- 
ular—concerning the operation and problems 
of a business concern. His task is to study 
this material and prepare recommendations 
for the company. He presents his recom- 
mendations orally to two staff members who 
role play the two controlling partners of the 
enterprise and discusses his recommendations 
with them. He then prepares a final written 
recommendation. 

In addition to these simulations, the 
assessees completed a biographical data 
blank, underwent a lengthy interview, and 
took four tests, all cognitive in nature. These 
tests were the School and College Ability 
Test, a Critical Thinking Test, an Abstract 
Reasoning Test, and a Test of Knowledge of 
Contemporary Affairs. 

A trial of the assessment-center methods 
outlined above was conducted by setting up 
a Sales Assessment Center in Cleveland, Ohio 
to which some 14 participating Bell Telephone 
Companies sent candidates with little or no 
Bell System sales experience. This assessment 
center was staffed by a seven-man team, in- 
cluding a fourth-level director and six third- 
level sales managers from various telephone 
company sales departments. This staff re- 
ceived 3 weeks’ training by the authors and 
others before beginning the actual assessment 
of the trial Ss. 


Assessees were processed by the assessment 
center in groups of six, and it required 2 full 
days for each group of six to undergo the 
various assessment techniques. Only one group 
of six was processed per week, however—the 
remaining days of each week being used by 
the assessment staff in preparing reports of 
the behavior of each candidate in each ex- 
ercise and for holding staff sessions to hear 
these reports for each candidate, to rate him 
on the 20 sales-assessment qualities, and to 
make a final recommendation as to his suit- 
ability as a communications consultant. 

In all, some 142 men from the 14 par- 
ticipating telephone companies were assessed, 
but various factors reduced the sample of men 
for the present report to 78. This sizable 
reduction was due to the fact that not all of 
the men sent to the assessment center met the 
standards which had been set up for the 
study. Some had had previous Bell System 
sales experience, others were on a special 
training program for college recruits, and some 
were not assigned to communications con- 
sultant work after assessment. In addition, a 
few resigned from the company before on-the- 
job criteria were collected. 

Great care was taken that the results of 
the man’s performance at the assessment 
center did not affect his assignment or ap- 
praisal on the job. All of the men in the study 
were Bell System employees at the time of 
their assessment and there was no feedback to 
their trainers, their supervisors, or to the 
men themselves on their performance at the 
assessment center. It was anticipated that if 
the assessment center proved a_ valuable 
screening method, it would be used as a pre- 
employment screen, but since the individual 
results of the trial assessments were not to 
be fed back to the supervisors immediately 
concerned with the man’s training or assign- 
ments, all Ss were employed. In fact, the 
participating companies were urged to keep 
as many as possible on the payroll so that 
criterion data could be collected after they 
had had sales training and were on the job 
for several months. 

Before proceeding to a discussion of the 
criterion against which the assessment judg- 
ments were compared, it may be of interest 
to examine the final judgments of the candi- 
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TABLE 1 


ASSESSMENT-STAFF JUDGMENTS OF ACCEPTABILITY 
FOR SALES EMPLOYMENT 


Assessment judgment | No. men | Percentage 
of group 
More than acceptable 9 12% 
Acceptable 32 41% 
Less than acceptable 16 20% 
Unacceptable 21 271% 
Total 78 100% 


dates made by the assessment staff. After the 
reports on each candidate were read at the 
weekly staff meeting and the 20 sales qualities 
rated on a 5-point scale, the assessment staff 
made a final judgment as to each candidate’s 
acceptability as a potential communications 
consultant. The 78 assessees were put into 
four groups—one group was called “more 
than acceptable,” another “acceptable,” a 
third group was considered “less than ac- 
ceptable,”’ and the final and lowest group was 
considered ‘‘unacceptable.” Table 1 shows 
the distribution of assessment-staff judgments. 
It will be seen that only 9 of the 78 men 
performed outstandingly at the assessment 
center while 21 men did very poorly. The top 
two groups—the acceptable groups—totaled 
41 men, or 53% of the sample, while 47% 
were considered unacceptable in varying de- 
grees. 

This finding itself is of interest. It indicates 
that even though the 78 assessees had met all 
employment standards and had been screened 
by their local companies as qualified for the 
job of communications consultant, experienced 
sales managers judged that from what they 
could see at the assessment center, only about 
half possessed good potential for their pro- 
posed job. This result is in keeping with 
other assessment findings in the Bell System, 
both of the college recruits of the late 1950s 
and the male and female candidates for man- 
agement in the various telephone companies. 
Assessment staffs characteristically reject one- 
half or more, usually more, of the candidates. 

The distribution of assessment-staff judg- 
ments was not significantly different for the 
64 men who were eliminated from the study. 
Forty-seven percent of that group “passed” 


the center as compared to 53% of those re- 
tained in the sample. 

The primary criterion against which as- 
sessment-center judgments were compared 
was first-hand observation of actual behavior 
in sales contacts 6 mo., on the average, after 
assessment. By that time each of the candi- 
dates had had sales training and had been 
working in the field for several months. This 
criterion had been decided upon before the 
assessment study was even begun. It was 
considered more pertinent than “sales results” 
which are influenced by many factors other 
than the behavior of the salesman. Company 
sales executives had, in any case, indicated 
that because of differences in market char- 
acteristics in the many areas across the United 
States to which these candidates were as- 
signed, no statistic or group of statistics would 
accurately reflect individual performance. 

The performance observations were carried 
out by a field-review team working out of 
American Telephone and Telegraph head- 
quarters in New York. This group’s regular 
work involved such field reviews, and it was 
an entirely experienced team that went out 
on these special criterion reviews. A reviewer 
accompanied each S on as many visits as 
necessary to determine whether he did or did 
not meet established standards in conducting 
his sales activities. These standards included 
preparation, usage prospecting, recommenda- 
tions, closing, and implementation. As a re- 
sult of the observations, each man’s perform- 
ance was classified as meeting standards or 
failing to meet standards. The reviewers were, 
of course, entirely unaware of any judgments 


TABLE 2 


ASSESSMENT JUDGMENTS AND FrELp— 
PERFORMANCE RATINGS 








No. Percentage 
Assessment judgment No. meeting =| Ete 
men review review 


standards] standards 





More than acceptable 9 9 100% 
Acceptable 32 19 60% 
Less than acceptable 16 i 44%, 
Unacceptable 2H, 2 10% 

Total 78 37 47% 


Note.—x? = 24.19; » < .001. 
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or observations made of the candidate at the 
assessment center. 

Of the 78 men, 37 men or 47% met review 
standards. Table 2 shows the relationship 
between the assessment-center judgments and 
whether or not the salesmen met the stand- 
ards. It will be seen that all of the 9 men 
deemed more than acceptable at the assess- 
ment center met review standards, while only 
2 of the 21 men deemed completely un- 
acceptable at the assessment center passed 
the field review. 

A supplementary investigation’ looked 
into the ratings of the Ss given by their sales 
supervisors and by the trainers who had had 
each man in sales training. A preliminary 
letter was sent to each supervisor indicating 
that he would be asked to rate each salesman 
reporting to him. He was asked to prepare 
himself to do this and, when ready, to tele- 
phone either of a two-man team in the sales 
staff in A.T.&T. in New York. The New York 
staff member verified that he and the super- 
visor each had the same list of salesmen, 
and the supervisor was then asked to rank all 
of the men reporting to him in terms of their 
excellence as a communications consultant. 
After the ranking was recorded, the super- 
visor was then asked to rate each man on a 
graphic rating scale, the upper end of which 
was labeled “highest—I wish I had many 
more like him” and the lower end labeled 
“lowest—He’s in the wrong job.” The super- 
visor was then asked whether he knew if any 
of the men had been assessed and, if he did, 
he was asked whether he knew how the man 
had come out at the assessment center. In 
most cases, the supervisors did not know the 


1 This approach was devised by Ira Cisin of 


George Washington University. 


TABLE 3 


ASSESSMENT JUDGMENTS AND SUPERVISORS’ RATINGS 





Supervisors’ rating 
Assessment judgment } No. men 





Higher | Lower 
Acceptable 41 57% 43% 
Less than acceptable 37 AT% 53% 





Note.—Differences not statistically significant. 


TABLE 4 


SUPERVISORS’ RATINGS AND FiELD— 
PERFORMANCE RATINGS 


Did not 





Supervisors’ Met review meet re- 
, No. men 3 
rating standards | view stan- 
dards 
Higher 41 47% 53% 
Lower CY 447, 56% 





Note.—Differences not statistically significant. 


men had been assessed and in no case did the 
Supervisor have any accurate information 
about the outcome of the man’s assessment. 
The same procedure was carried on with 
trainers, who rated and ranked all men in 
each training group in which an assessee had 
been trained. 

In order to maximize differentiation for 
purposes of comparison, the average rating 
given by the supervisors and by the trainers 
was determined. The study group was then 
divided at about the midpoint of each set 
of ratings. In the following tables, those men 
who scored below the midpoint are described 
as having received higher ratings and those 
who scored below the midpoint are described 
as having received lower ratings. It should be 
noted, however, that the ratings were biased 
in an upward direction, so some of the men 
counted as receiving lower ratings were not, 
in an absolute sense, rated below average. 

Table 3 shows the relationship between the 
assessment-center judgments and the super- 
visors’ ratings. It can be seen that there was 
not a strong relationship—57% of those who 
received acceptable assessment-center judg- 
ments received higher ratings by the super- 
visors, as compared to 47% of those who 
had received unacceptable assessment-center 
ratings. This difference is not statistically 
significant. 

These results immediately raised the ques- 
tion of to what extent the supervisor’s ratings 
were related to the field-performance ratings. 
Table 4 shows the results. It will be seen that 
there was very little or no relationship be- 
tween supervisory ratings and ratings of per- 
formance on the sales job by the field-review 
team. Such a result has, of course, consider- 
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TABLE 5 


ASSESSMENT JUDGMENTS AND TRAINERS’ RATINGS 


Trainers’ rating 














Assessment judgment | No. men |———-————— 
Higher | Lower 
Acceptable 41 51% 49% 
Less than acceptable 37 55% 45% 
Note.—Differences not statistically significant. 
TABLE 6 
TRAINERS’ RATING AND Fre_p—PERFORMANCE 
RATING 
Did not 
Trainers’ 7 Met review meet re- 
: No. men ; 
rating standards | view stand- 
ards 
Higher 41 56% 44%, 
Lower 37 40% 60% 


Note.—Differences not statistically significant. 


able import for questions of appraisal. Of 
considerable methodological importance, how- 
ever, is the observation that the assessment 
center might have been considered not suffi- 
ciently accurate for use if supervisory judg- 
ment had been relied upon as the sole 
criterion. 

Table 5 shows the relationship between the 
assessment-center judgments and the trainers’ 
ratings. Here the relationship is very slight 
and negative, if indeed there is a relationship. 
There is somewhat more of a suggestion that 
trainers might be able to predict field per- 
formance, as Table 6 shows. Fifty-six per- 
cent of the men who received higher training 
ratings met review standards as compared to 
40% of those who had received lower ratings. 
This difference is, however, not statistically 
significant. In addition, 85% of the Ss had 
been given above-average training ratings— 
not a very fine screen. 

In spite of this lack of strong evidence for 
the usefulness of trainers’ ratings, there re- 
mained the possibility that such ratings could 
make a further selection of those who were 
deemed acceptable (or better) by the assess- 
ment center. Table 7 shows the relationship 
between trainers’ ratings and field performance 


for the 41 men who were judged acceptable 
(or better) by the assessment staff. There is 
only a very slight, and insignificant, relation- 
ship between trainers’ ratings and perform- 
ance, and almost two-thirds of those who get 
lower ratings show good performance. It does 
not appear that the trainers’ ratings, as they 
were made in this study, would be of practical 
use as a Selection device. There remains, of 
course, the possibility that trainers’ observa- 
tional and rating methods could be improved. 

It may be of interest to compare the 
degree of prediction obtained from the assess- 
ment-center staff judgment and that from the 
four paper-and-pencil tests. Table 8 shows 
the data in correlation form. It will be seen 
that the highest correlation of any of the tests 
with the criterion is .28. The multiple cor- 
relation using all four tests is .33 (not cor- 
rected for shrinkage). These, of course, are 
considerably below the .51 correlation of the 
assessment judgment with the field-perform- 
ance ratings. 

The biographical blank used at the assess- 
ment center was administered to give the 
staff members some advance information be- 


TABLE 7 


TRAINERS’ RATING AND Fre_p—-PERFORMANCE RATING 
(MEN JUDGED ACCEPTABLE OR BETTER AT 
ASSESSMENT CENTER ONLY) 








Did not 
ae ne : one 24 / ry ro 
rainers agian Met review meet re 
rating standards view stan- 

dards 
Higher 21 71% 29% 
Lower 20 65% 35% 





Note.—Differences not statistically significant. 


TABLE 8 


CORRELATIONS OF ASSESSMENT PREDICTION AND 
PAPER-AND-PENCIL TESTS WITH 
FIELD—PERFORMANCE RATINGS 











Assessment judgment all 
SCAT 25 
Critical Thinking Test .26 
Contemporary Affairs Test .28 
Abstract Reasoning Test .02 
Multiple R—Four Tests 33 
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fore conducting interviews with the candi- 
dates. The blank was not scored. Because of 
the small size of the sample, it is unwarranted 
at this time to analyze these biographical 
items or to attempt to combine by multiple 
correlational techniques all the various pieces 
of data gathered during the 2 days of assess- 
ment. Such analyses can be done later when 
more cases accumulate and are now almost 
completed for a management assessment 
center (Bray & Grant, 1966; Grant, Katkov- 
sky, & Bray, 1967). 


DISCUSSION 


The assessment-center method of evaluating 
potential for various occupations has had a 
spotty history (Taft, 1959). Conclusions 
about its effectiveness have been difficult to 
draw because the number of applications of 
the method has been small, techniques and 
procedures have varied markedly from one 
application to the next, and criterion problems 
are formidable. Studies of Bell System as- 
sessment centers for the prediction of success 
in management positions have, however, uni- 


formly shown that the method has substantial 
validity. This has been true whether the 
assessment staff consisted primarily of pro- 
fessional psychologists or of specially trained 
management personnel. The present study 
demonstrates that an assessment center staffed 
by sales managers instructed in assessment 
techniques can afford a valuable aid in 
the selection of prospective salesmen. 
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A biographical inventory was developed and cross-validated in a sample of 
400 high school boys in the New York metropolitan area, subdivided into 


creative and control groups in artistic and scientific fields. 
nominations supported by specific creative products. 


criterion was teachers’ 


The principal 


Validity coefficients in the cross-validation sample were .64 and .35 for artistic 
and scientific keys (each p< .001). Final keys, containing items differentiating 


in initial and cross-validation samples (compound p<.05), 


were used to 


describe similarities and differences between creative students in the 2 fields. 


Among the many approaches followed in 
current investigations of creativity, a major 
distinction is that between the use of test 
scores and of evaluated achievement as cri- 
teria of creativity. The former approach is 
illustrated by the research of Guilford (1959, 
1967), Getzels and Jackson (1962), and 
Torrance (1962); the latter by that of Roe 
(1951a, 1951b, 1953) and MacKinnon 
(1962). With tests as a criterion, not only 
is coverage likely to be limited to certain 
aspects of creative behavior, but motivational 
variables are also largely excluded. 

As a predictor and as an instrument for 
exploring the correlates of creativity, the 
biographical inventory offers particular prom- 
ise, having demonstrated its validity in pre- 
dicting many other complex criteria in in- 
dustrial, military, and educational contexts 
(Henry, 1966). When cross-validated against 
criteria of scientific creativity, biographical 
inventories have proved successful with several 
types of research workers. Relevant investiga- 
tions include those of Smith, Albright, Glen- 
non, and Owens (1961) with petroleum re- 
search scientists, Buel (1965) with research 
personnel in a pharmaceutical company, Mc- 
Dermid (1965) with technical engineering 
personnel, and Chambers (1964) with eminent 
psychologists and chemists. In a series of 
investigations of National Aeronautics and 


1The present study is part of a larger project 
supported initially by Grant No. MH 10233-01 from 
the National Institute of Mental Health and sub- 
sequently by Subcontract No. 2 of the Center for 
Urban Education’s Contract OEC-1-6-062868-2039 
with the United States Office of Education. 
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Space Administration (NASA) | scientists, 
Taylor, Ellison, and Tucker (1966) obtained 
validity coefficients in the .40s and .50s when 
biographical inventories were cross-validated 
against several criteria of creative achieve- 
ment. 

With only a few exceptions, investigations 
of creativity at the high school or college 
level have employed tests as criteria, pre- 
dictors, or both. Dauw (1966) reports suc- 
cessful differentiation between highly creative 
and less creative adolescents by means of a 
biographical inventory, but his subjects (Ss) 
were selected solely on the basis of creativity 
test scores. Parloff and Datta (1965) com- 
pared contrasted groups of participants in 
the Westinghouse Science Talent Search, 
selected on the basis of judges’ ratings of their 
research projects. However, these groups were 
compared chiefly in personality test scores, 
the only background items reported being 
father’s occupation, socioeconomic level, and 
intactness of family. Taylor, Cooley, and 
Nielson (1963) applied a modified version 
of the biographical inventory developed on 
NASA scientists to high school students par- 
ticipating in a National Science Foundation 
(NSF)-supported summer science program. 
The biographical inventory proved to be the 
best overall predictor of the creative research 
performance of these students, its validity 
reaching .47 in one of the groups. In a 
follow-up of National Merit Scholarship 
Finalists, Nichols and Holland (1963) cor- 
related biographical data with self-reported 
evidence of creative achievement during the 
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first year of college. The relationships found 
were rather low, a fact that may be attribut- 
able in part to inadequate criterion data and 
narrowly restricted ability range of the sample. 
The present study * was concerned with the 
development and cross-validation of biograph- 
ical inventory keys against achievement cri- 
teria of creativity among high school boys. 
An examination of these keys provides a de- 
scription of significant antecedents and cor- 
relates of creativity in this population. 


METHOD 
Subjects 


The Ss employed in the principal data analyses 
were 400 male students from six high schools in the 
New York metropolitan area.2 The group included 
188 seniors, 144 juniors, and 68 sophomores. Socio- 
economic level and parental education of the entire 
sample were superior for the geographical area. 
The schools were chosen because they offer courses 
or programs providing an opportunity for creative 
achievement. 


Creativity Criteria 


Four criterion groups, of 100 students each, were 
formed as follows: Creative-Artistic (CrA), Con- 
trol-Artistic (CoA), Creative-Scientific (CrS), and 
Control-Scientific (CoS). The creative Ss had to 
meet two criteria: (a) teacher nomination on the 
basis of one or more creative products to be de- 
scribed on a teachers’ nomination form; (0b) score 
above a minimum cutoff on Guilford Alternate 
Uses and Consequences tests. The “artistic” category 
covered graphic art and literary expression; the 
“scientific” included sciences and mathematics. 

The control Ss were nominated by the same 
teachers as having produced no evidence of creative 
achievement, and they scored below a maximum 
cutoff on the two Guilford screening tests. It should 
be noted that the Guilford tests were employed only 
as a check on irrelevant factors that might have 
influenced the nomination of creative or control Ss. 
The scores were used only to exclude cases—never 
to admit them. Moreover, the two cutoff scores 
were sufficiently extreme as to exclude only in- 
dividuals whose test performance was highly dis- 
cordant with their reported achievement. In terms 


2More detailed information on procedure and 
results can be found in Schaefer (1967). 

8 The authors acknowledge the invaluable coopera- 
tion of J. Wayne Wrightstone, assistant superintend- 
ent, Board of Education of the City of New York, 
and of the administrative and teaching personnel of 
the participating schools: Forest Hills, Jamaica, 
Midwood, and Stuyvesant (public high schools in 
New York City), Bronxville (a suburban public 
high school), and Regis (a private high school con- 
ducted by the Jesuit order in New York City). 


of available published norms, the mean scores of the 
creative students on the two Guilford tests were 
approximately equal to those of college students, 
while the mean scores of the control groups ap- 
proximated those of ninth-grade students. 

The control Ss were matched with the creative 
Ss in school attended, class, and grade-point average 
(GPA). Since the CrS students significantly excelled 
the CrA students in GPA, two separate control 
groups were assembled. Although no attempt was 
made to equate groups in socioeconomic level, no 
significant differences were found either between 
creative and control or between artistic and sci- 
entific groups in mean Shevky-Bell index—an index 
based on the characteristics of the census tracts 
in which the individual resides. 


Biographical Inventory 


A comprehensive 165-question biographical in- 
ventory was prepared for this study. The questions 
were formulated principally on the basis of previous 
research findings of other investigators and hy- 
potheses regarding the correlates of creativity. The 
165 questions were grouped into five sections labeled: 
physical characteristics, family history, educational 
history, leisure-time activities, and miscellaneous. 
While most of the questions dealt with objective 
facts regarding present or past activities and ex- 
periences, some called for expressions of preference 
and others concerned anticipated plans and goals. 

Although some multiple-choice and checklist items 
were included, many questions were open-ended. 
Even with the objective items, there was provision 
for adding other unlisted responses. While scoring 
and data analysis are more difficult under these 
conditions, these types of items yield a richer return 
of information and are especially appropriate in an 
exploratory study. All responses were coded prior to 
tabulation. For each question, there were several 
possible responses, and for some questions this num- 
ber was quite large. In addition, several questions 
yielded responses that could be classified from dif- 
ferent viewpoints to test different hypotheses. Con- 
sequently, the 165 questions yielded a total of 3,930 
“scorable items” or individual response alternatives 
used in the item analyses. 


Procedure 


The biographical inventory (together with three 
tests used in another part of the project) was 
administered to groups of 30-180 students during a 
2-hr. testing session held outside of school hours. 
The Ss were paid for participating in this testing 
session. Identification numbers were employed to 
provide anonymity, and students were assured of 
the confidentiality of the data. 

For purposes of data analysis, each of the four 
groups was subdivided into two groups of 50, used 
for development of scoring keys and cross-valida- 
tion, respectively. Each pair of groups was equated 
in number of students from each school, class dis- 
tribution, GPA, and mean score on the screening 
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TABLE 1 


BIOGRAPHICAL INVENTORY SCORES OF CRITERION 
Groups IN CROSS-VALIDATION SAMPLES 


























Creative-Artistic Creative-Scientific 
key* key> 
Score 
Creative- | Control- | Creative- | Control- 
Artistic | Artistic | Scientific | Scientific 
Over 110 6 0 0 0 
101-110 6 1 0 0 
91-100 11 2 0 0 
81-90 10 4 1 0 
71-80 7 8 7 2 
61-70 8 2 ih 3 
51-60 2 7 9 11 
41-50 0) 9 5 9 
31-40 0 9 6 ih 
21-30 0 3 4 10 
11-20 0 3 2 2 
1-10 0 2 1 0 
N 50 50 50 50 
M 88.76 52.38 53.59 41.62 
SD 19.58 24.32 17.69 13.91 
Range 52-140 3-110 4-84 17-72 
NCR Sl 3F 3.74* 
one .64* Eon 








® Possible range —96 to 335. 
b Possible range —82 to 239. 
*p <.001. 


tests. For each of the 3,930 scorable items, classified 
as present or absent, a phi coefficient was computed 
against the dichotomous criterion of creative versus 
control. These coefficients were computed separately 
in artistic and scientific criterion groups. All items 
with phi coefficients at the p<.20 or better sig- 
nificance level were considered for inclusion in the 
initial CrA or CrS scoring keys. Some of these items 
were excluded because they duplicated other items, 
were checked by three or fewer Ss, or were incon- 
sistent with other responses and hence were likely to 
have yielded isolated chance correlations. 

In the initial scoring keys, a weight of 1 was 
assigned to items discriminating between the p< .20 
and p< ..05 levels, and a weight of 2 to items dis- 
criminating at the p< .05 level or better. Items with 
higher frequencies in the creative group received posi- 
tive weights; those with higher frequencies in the 
control group received negative weights. The initial 
CrA and CrS scoring keys were used in scoring 
the biographical inventories of the corresponding 
creative and control Ss in the cross-validation 
samples. The scorers were unaware of the criterion 
status of the Ss. The scores thus obtained were cor- 
related with the dichotomous criterion to provide 
an estimate of the validity of the scoring keys. In 
order to utilize all the data in the selection of items 
for final keys, item analyses were carried out in- 
dependently in initial and cross-validation samples 
and those items were selected that differentiated 


between criterion and control groups with a com- 
pound probability of .05 or better (Baker, 1952; 
Katzell, 1951). 


RESULTS 


The cross-validation data for the initial 
keys are summarized in Table 1.* In both 
artistic and scientific groups, the mean scores 
of creative Ss excelled those of controls at the 
.001 level of significance, with normal curve 
ratios of 8.23 and 3.74, respectively. The 
point-biserial criterion correlations, whose 
significance is indicated by the above-normal 
curve ratios, were .64 and .35, respectively. 

Examination of the frequency distributions 
in Table 1 indicates that for both keys a 
cutoff score of 60 yields the maximum dif- 
ferentiation between creative and control 
groups. A simple index of differentiation for 
this purpose is the difference between cumula- 
tive percentage frequencies in creative and 
control groups. With a cutoff above 60, 96% 
of the CrA Ss in the cross-validation sample 
will be correctly classified, while 34% of the 
controls will be incorrectly classified as crea- 
tive. With the CrS key, 46% of the creative 
Ss will be correctly classified, while 10% of 
the controls will be incorrectly classified as 
creative. As proposed by Levy (1967), the 
“substantive significance” of a group differ- 
ence may also be evaluated in terms of the 
percentage of misclassification, defined as the 
percentage of individuals deviating from the 
mean of their own group more than halfway 
toward the mean of the other group. This 
percentage, which can be readily found from 
the point-biserial correlation, is 21 for the 
CrA key and 36 for the CrS key, as compared 
to a chance expectancy of 50. 

Once the validity of the initial keys (de- 
rived from one-half the available cases) was 
established, the final keys were constructed 
with items whose compound probability was 
based on data obtained from the total sample. 
The CrA key thus developed contains 140 
scorable items, while the CrS key contains 
65. Examination of the items in these final 


#In order to avoid negative scores in computa- 
tion, 70 was added to each score on the Artistic 
key and 40 to each score on the Scientific key. This 
adjustment, however, did not exclude negative scores 
from the total possible range, as shown in the 
Table 1 footnotes. 
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keys provides the most conclusive available 
description of the characteristics of the crea- 
tive students surveyed in this investigation; 
this description will be summarized in the 
next section. 


DISCUSSION 


Identifying Creativity in Artistic versus Sci- 
entific Fields 


In all comparisons—including mean differ- 
ences, criterion correlations, extent of overlap 
of score distributions, and number of signifi- 
cantly differentiating items in initial and final 
keys—the CrA students were more sharply 
differentiated from their controls than were 
the CrS students. It should be recalled in this 
connection that, in both groups, creative and 
control Ss were equated in GPA and that 
CrS students significantly excelled CrA stu- 
dents in this regard. From many sources of 
evidence it appears that the CrS high school 
student resembles the academically superior 
student more closely than does the CrA stu- 
dent. Both the nature of scientific creativity 
itself and current educational practices in 
science courses suggest that high academic 
achievement—together with its characteristic 
antecedents and correlates—plays a predomi- 
nant role in scientific creativity. On the other 
hand, the traditional activities of artists and 
writers, as well as the content of courses in 
these areas, differ quite conspicuously from 
typical academic activities. While creative 
students in both artistic and scientific fields 
can be significantly differentiated from ac- 
ademically equated controls by means of a 
biographical inventory, this study strongly 
suggests that the former can be more readily 
differentiated than the latter. 


Correlates of Creativity across Both Fields 


An examination of items common to the 
final CrA and CrS keys provides a picture 
of the correlates of creativity in high school 
boys, regardless of field. In terms of specific 
scorable items, only 14 differentiated in the 
same direction at the .05 level or better in 
both final keys. This number is somewhat 
of an artifact, however, since there were many 
other differently worded but basically equiva- 
lent items in the two keys. Based upon both 


identical and equivalent items, the following 
description focuses upon the best established 
characteristics which were corroborated by 
several items. It might be added that even 
these common characteristics tended to be 
revealed in more items in the CrA than in the 
CrS group. 

The familial background of the creative 
students was not only academically superior, 
but the parents tended also to provide role 
models of interest and creative expression in 
the student’s field. College education was more 
frequent among parents of creative students, 
especially among mothers. Magazines regu- 
larly available at home were more likely to 
be of the political commentary-foreign affairs 
or the cultural-intellectual types. Creative 
students were less likely than controls to 
come from homes in which neither student 
nor parents play a musical instrument. With 
regard to specific field of interest, both fathers 
and mothers in the CrA group read more 
books than those in the control group, and 
reading was more often listed as the father’s 
favorite leisure-time activity. The fathers had 
more often won honors or awards in literary 
or art fields. The mothers had more often 
specialized in art or literary subjects in col- 
lege, had more creative hobbies, more fre- 
quently visited art museums or galleries, and 
were more often listed as the family member 
who had taught the student most about art. 
CrA students were also less likely than con- 
trols to come from families having no hobbies. 
The CrS students more often than the con- 
trols reported that science magazines were 
read regularly in the home and that their 
fathers had scientific hobbies. It is also rele- 
vant to note that in the CrS group one or 
more siblings were likely to have received 
national recognition for academic achieve- 
ments. 

In a number of ways, the creative student 
himself manifests a strong intellectual and 
“cultural” orientation. In both artistic and 
scientific groups, creatives more often than 
controls reported that they were considering 
an Ivy League college. In both groups, a sig- 
nificantly greater number of creatives reported 
that they read over 50 books a year. Other 
relevant items, occurring in either artistic or 
scientific groups, pertain to types of news- 
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papers and sections of newspapers read 
regularly, membership in academic and honor 
societies, and interest in plays, music, and 
art museums. Both creative groups, on the 
other hand, reported significantly less interest 
in sports than did their respective control 
groups, as evidenced by attendance at spec- 
tator sports, reading about sports, or even 
daydreaming about them. 

A third set of items can be characterized 
as indications of pervasive and continuing 
enthusiasm for one’s chosen field and single- 
ness of purpose in its pursuit. Early mani- 
festations of creativity are evident in several 
items, such as childhood creation of new 
games, mechanical or electronic objects, 
poems, stories, or art products. Persistence 
of interest in the same field is indicated in 
elementary school awards and in favorite and 
best subjects in elementary school. Creative 
students more often than controls received 
awards in their special field outside of school 
and they were more likely to own such field- 
related articles as microscopes, chemistry 
sets, and artists’ supplies. They also reported 
having become so absorbed in a project in 
their field that they skipped a meal or stayed 
up very late. They more often chose hobbies 
in their special fields, which were usually 
directly related to their vocational goals. 
Extracurricular activities, membership in out- 
of-school organizations, magazines read, and 
summer jobs all illustrate the pervasiveness 
of their interest. The creative adolescent evi- 
dently finds his chosen work so satisfying and 
refreshing that he is willing to devote his 
leisure time to it. He chooses an area that 
is personally and deeply valuable to him 
and is hence able to persevere at it. 

The creative student’s perseverance and 
goal orientation are not to be construed as 
narrowing influences, however, since they are 
likely to be accompanied by a characteristic 
breadth of interest. Thus creative students 
in both fields were more likely than their 
controls to have pursued a variety of hobbies 
(frequently since childhood), to have collec- 
tions of different sorts, to play muscial in- 
struments, to attend plays, and to participate 
in extracurricular activities. In such extra- 
curricular activities, moreover, they were more 
likely to have held offices and to have been 


elected to the highest offices, such as president 
or editor. Another example is provided by the 
CrA students who, when asked to check their 
favorite high school subjects, checked more 
subjects than did the controls. 

Finally, certain items suggest that creative 
students exhibit a stronger drive toward 
novelty and diversity in their activities than 
do controls. To some extent, the previously 
discussed breadth of interest may be regarded 
as a manifestation of this drive. But other 
illustrations can be cited. For example, crea- 
tive students reported more frequent prefer- 
ence than controls for unusual art forms, such 
as Pop, Op, and primitive art; they had more 
unusual collections, such as road signs and 
spider webs; and they more often owned 
different kinds of records, such as classical 
and folk-song. Similarly, the tendency of crea- 
tive students in both fields to prefer friends 
both older and younger than themselves may 
indicate a desire for diversity in interpersonal 
relations. 


Differences between Creativity Correlates in 
Artistic and Scientific Groups 


A direct comparison of the biographical 
inventory responses of CrA and CrS students 
would be contaminated by differences in 
school and class distribution and GPA. In 
the present experimental design, each creative 
group was equated with its own control group 
in these variables. Hence the question now to 
be considered is whether the characteristics 
that differentiate the CrA students from their 
own controls differ in any systematic way 
from those that differentiate the CrS students 
from their controls. This question can be 
answered by comparing the two final biograph- 
ical inventory keys. 

The most conspicuous difference between 
the CrA and CrS keys occurs in a set of items 
indicative of the CrA student’s greater ex- 
posure to environmental diversity, through 
either personal or parental experiences. The 
parents of the CrA students were more likely 
to have been born in a different city, state, or 
country than the one in which the family was 
living. In the CrS group, the reverse relation 
was found, the parents being less likely than 
those in the control group to have been born 
in a foreign country and more likely to have 
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been born in the home city. CrA students 
were also more likely than controls to have 
moved from one state to another during their 
lifetime and to have visited more different 
parts of the country and more distant parts 
than the controls; they had also more often 
traveled out of the country—to Mexico, the 
Caribbean, and South America. 

Several items in the CrS key give evidence 
of some conventionality in both parents and 
students, evidence that is certainly absent 
from the CrA keys. The CrS students were 
more likely than their controls to have fathers 
who majored in business subjects in college, 
who play bridge as a hobby, and from whom 
the students had learned about sports. The 
CrS students themselves participate in in- 
dividual-type high school sports such as tennis 
more than do the controls, although no 
significant group difference was found in 
team sports. In contrast, CrA students spend 
less time participating in sports of any kind 
than do their controls. These and other items 
also suggest a stronger sex-role identification 
among the CrS than among the CrA students, 
the former tending to enjoy closer relation- 
ship with the father, the latter with the 
mother. 

With regard to childhood experiences, the 
CrS students reported less evidence of social 
participation in early childhood than did the 
controls. They more often stated, for example, 
that they had only one close friend and were 
never a leader of their childhood group ac- 
tivities under the age of 12. Although the 
picture of early childhood that emerges from 
the CrA key does not contrast with that of the 
CrS key, it nevertheless highlights a some- 
what different pattern of experiences and 
traits. Thus the CrA child was more likely 
than the controls to daydream and to have 
imaginary companions; his earliest childhood 
memory was more likely to date back to the 
age of 4 or earlier; and he had more unusual 
experiences, such as eidetic-type imagery. 
His relationships to his parents appear to 
have centered more often on shared interests 
in artistic or literary pursuits and less often 
on a warm personal relationship than was true 
of the control Ss. For example, the CrA 
student more often described his mother as 


enthusiastic and less often as understanding 
or sincere. He attended nursery school more 
often and did not describe family discipline 
as “always fair” as frequently as did the 
controls. If he had a personal problem, he 
was more likely to seek help from “several 
persons” than from his mother. 

A few noteworthy differences emerged with 
regard to school experiences. Preference for 
artistic and literary activities appeared in the 
CrA students’ reactions to their elementary- 
school curriculum, in questions pertaining to 
favorite subjects and subjects in which the 
students did best or poorest. In contrast, no 
items relating to the elementary-school cur- 
riculum differentiated CrS students from con- 
trols, these differences first appearing at the 
high school level. This finding may indicate 
that elementary-school instruction offers more 
creative opportunities in artistic and literary 
fields than in science. Both groups of crea- 
tive students gave some differentiating re- 
sponses to questions about outstanding teach- 
ers, but the focus of their replies differed. 
The CrA students reported having had two 
or more such teachers more often than did the 
controls. The CrS students differed from the 
controls in their more frequent listing of 
“wide knowledge” as the principal character- 
istic of such teachers and also in naming 
more than one trait to characterize them. 
It is also of some interest to note that CrS 
students tended to give more “uncertain” 
answers in the biographical inventory as a 
whole, a finding that may be related to the 
habits of caution fostered by scientific train- 
ing. 

In conclusion, creative students in artistic 
and scientific fields do share important com- 
mon features in experiential background and 
personal traits, but they also exhibit notable 
differences in both respects. Consequently, any 
investigation of creativity that fails to con- 
sider area of creative achievement may pro- 
duce a blurred or distorted picture of the 
creative person. Although the distinction be- 
tween scientific and artistic creativity is un- 
doubtedly a major one, further breakdown 
of artistic creativity into literary production 
and graphic art might be fruitfully explored 
in further research. 
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Taxicab drivers who reported an abnormal number of accidents in which they 
were struck from behind, when tested on a classical laboratory reaction-time 
apparatus, were found to have a reaction pattern made up of slow initiation 
time and compensatingly fast movement time. Drivers with this perceptual- 
motor pattern probably stopped their vehicles more abruptly in a way that 
cannot be duplicated by a following driver, or in the headway of the following 


vehicle. However, 


drivers with such an atypical perception reaction had 


fewer accidents in their headways and therefore had lower overall accident 
rates. Suggestions for further research and the possible implications of these 


findings were discussed. 


Motor-vehicle accidents are not a homo- 
geneous class of events with a common set 
of causes but heterogeneous events with 
multiple causes. In spite of this, much ac- 
cident research uses such criterion data as 
total number of accidents during the previous 
5 yr., or overall accident and violation rates. 
This practice may have contributed to the 
limited progress in accident research. Focus- 
ing on one normal aspect of driving such as 
stopping, and one kind of accident, the struck- 
from-behind accident, should result in greater 
research success. Although driving is a per- 
ceptual-motor skill, few relationships between 
perceptual-motor variables and accident cri- 
terion data have been found (e.g., Goldstein, 
1962; Lauer, 1959). The abandonment of 
perceptual-motor variables does not neces- 
sarily follow from these findings. Why not 
consider perceptual-motor variable patterns 
or combinations and relate these to the be- 
havioral patterns of drivers that might lead 
to accidents? Drake (1940) considered per- 
ceptual-motor tests in combination and 
found that individuals who performed faster 
on motor tests than perceptual tests had more 
industrial accidents than individuals whose 
perceptual speed was faster than their motor 
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speed. He hypothesized that this perceptual- 
motor discrepancy should identify high-ac- 
cident drivers. 

Although the existence of either a general 
perceptual speed factor or of a general motor 
speed factor has been disproved (e.g., Fleish- 
man, 1953, 1954), the possibility that a 
refined pattern of component perceptual- 
motor factors should be associated with 
specific driving acts, if not overall driving 
success, has to be considered as an alternative. 
Of course, such hypothesized predictor vari- 
ables must be related to more refined driving 
measures than overall accident rate. Accord- 
ingly, the primary aim of this study was to 
determine the relationship between one pat- 
tern of perceptual-motor behavior and one 
type of accident. The perceptual-motor be- 
havior considered was the ratio of simple 
reaction time (RT) to jump reaction time 
(JRT) and the type of accident was the rear- 
end collision or the struck-from-behind ac- 
cident. 

The  struck-from-behind accident was 
chosen as the criterion variable not only be- 
cause of the common-sense likelihood of a 
link with reaction pattern, but also because 
this kind of accident is numerous enough to 
yield sufficient cases for statistical analysis; 
for example, in one recent state study this 
class of accident accounted for 25% of all 
highway accidents, and what is of even greater 
significance, 30% of all injuries (State of 
New York, 1964). In freeway driving, where 
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there are no intersections and all traffic is 
moving in the same direction, more than 50% 
of the accidents fall into this category. 

In the present study, response time on 
reaction-time tests had been fractionated into 
initiation time and movement time. Initiation 
time is the time required to begin the move- 
ment in response to the stimulus. The time 
to complete the perception reaction as rep- 
resented in the jump reaction time (JRT) 
test is made up of initiation time plus move- 
ment time.? 

In the present study, it was hypothesized 
that drivers whose initiation time is an ab- 
normally long portion of their total percep- 
tion reaction (i.e., drivers who have a high 
ratio of RT to JRT) would have a higher 
percentage of accidents in which they were 
struck from behind than would drivers whose 
perceptual-motor RT shows a typical ratio. 
Why should there be any difference in the 
stopping characteristics of two drivers, both 
of whom take .60 sec. for their total percep- 
tion reaction, but who differ in the length of 
the initiation time and movement com- 
ponents? Why should there be a difference 
between Driver A who typically takes .15 sec. 
to initate the response and .45 sec. for move- 
ment, thus has a reaction-pattern score of 
J or .25, and Driver B who takes .30 sec. 
for movement, thus has a reaction-pattern 
score of at) 

.60 
partly a function of the force that is applied 
by the driver, and force equals Mass X Ac- 
celeration (F = MA). Therefore, it was hy- 
pothesized that the driver who makes up 
for long initiation time with fast movement, 
that is, has short movement time, has greater 
limb acceleration and therefore applies greater 
force and stops his vehicle in fewer feet after 
contact with his brake pedal is made and 
his brake lights come on. Actually with such 
compensating limb speed, not only is the ac- 


of .50? A vehicle’s stopping is 


8 The author is indebted to A. Zavala for this 
information on the RT and JRT tests. These com- 
ponents have been identified in factorial studies 
(e.g., Fleishman, 1954) as the reaction-time factor 
(RT) and the speed of limb movement factor 
(SLM). Reference to these components as initiation 
time and movement time is consistent with both 
factorial findings and logical analysis. 


celeration increased, but the mass is increased 
as additional g forces enter into braking to 
make the stop even more abrupt and un- 
repeatable by a following driver. As total 
reaction time is transformed into reaction dis- 
tance, drivers with this reaction pattern typi- 
cally stop in less road distance or within the 
expected headway of the following driver. By 
systematically leaving less distance between 
themselves and following drivers, such drivers 
increase their probability of being struck 
from behind, especially when such hazards 
as tailgating and slippery roads exist. In 
effect, long initiation time followed by com- 
pensating short movement time probably 
desynchronizes the traffic flow or constitutes 
a desynchronizing reaction pattern. 

The  struck-from-behind accident is a 
prima-facie extreme instance of such a re- 
action pattern. While the principal hypothesis 
was that drivers with a desynchronizing re- 
action pattern will have a higher incidence of 
struck-from-behind accidents than drivers 
with normal reaction patterns, if the atypical 
drivers stop short, the corollary hypothesis 
that they should collide with vehicles in their 
headway Jess frequently would also follow 
and was also tested. Although struck-from- 
behind accidents constitute a large proportion 
of accidents, the majority of accidents still 
occur in one’s headway. Therefore, it would 
follow that drivers with a desynchronizing 
reaction pattern would have a lower overall 
accident rate. 

The question might well be raised as to 
why the probability of a struck-from-behind 
accident is not primarily a function of a 
simple perceptual-motor factor movement 
time (as measured by JRT— RT), rather 
than as a function of a less parsimonious 
and more tricky ratio, the desynchronizing 
reaction pattern (as measured by RT/JRT). 
First, it is not more parsimonious to use 
movement time, as movement time is also a 
derived score, obtained by subtracting RT 
from JRT. Second, to use movement time 
as a predictor of the accident criterion would 
be to ignore previous findings of no relation- 
ships between movement time and accidents 
(Goldstein, 1962). Third, as noted above, 
Drake (1940) found a relationship between 
perceptual-motor ratio and accident proneness 
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but none between individual perceptual-motor 
tests and accidents. Finally, a relationship 
between the desynchronizing reaction pattern 
and automobile accidents, especially struck- 
from-behind accidents, follows from a rational 
consideration of automobile braking. The 
braking perception reaction approaches the 
laboratory JRT only in a forced or emergency 
stop. The driver whose initiation time is 
usually longer will have to utilize his shortest 
movement time more often than the driver 
with faster initiation time. While it is granted 
that movement time determines the stopping 
distance, other things being equal, it seems 
reasonable to assume that short movement 
time will increase the probabilities of struck- 
from-behind accidents only when it follows a 
long initiation time that necessitates a 
“crash” stop. Accordingly, it was also hy- 
pothesized ad hoc that movement time alone 
would not correlate with accident incidence. 
Since measures of RT and JRT were to be 
obtained in computing the desynchronizing- 
reaction-pattern and the movement-time fac- 
tors, possible relationships between these two 
variables and struck-from-behind accidents 
could also be examined. Following Goldstein 
(1962) it was also hypothesized ad hoc that 
no relationship between the RT and JRT 
tests and accident incidence exists. 


METHOD 


The data upon which this study was based were 
obtained from 127 Washington, D. C. taxicab 
drivers. The study began with 160 Ss. However, 
only the 127 experienced cab drivers—those with 
2 or more yr. full-time hacking or equivalent (part- 
time hacking was considered as one-half time)— 
were retained. The Ss averaged 48 yr. of age. Each 
S was asked for the number of accidents he had 
had in the past 5 yr. He was then asked to de- 
scribe each accident. Drivers were first classified 
according to their overall accident rates into three 
croups: (a) O accidents or accident free, (b) 1 ac- 
cident, and (c) 2-9 or multiple accidents. Then 
those drivers who reported one or more accidents 
were classified according to kind of accident— 
struck-from-behind and _ other-than-struck-from-be- 
hind (Other), or accidents in their tailway and 
accidents in their headway. 

The drivers were administered two kinds of RT 
tests. One was simple reaction time (RT) where 
S’s finger rested on a microswitch which was de- 
pressed in response to a light signal. The second 
test was jump reaction time (JRT) where S 
had to move his hand 12 in. from a rest position to 


DRIVER REACTION PATTERN 51 


the button in making the response. In both tests, 
there was a set of 20 trials separated by a fore- 
period ranging from .5 to 1.5 sec.—varied in random 
order. The ratio of RT to JRT was computed and 
the drivers were classified as having either a normal 
or an atypical reaction pattern. A ratio that was .5 
standard deviations above the overall mean was 
considered to be great enough to constitute an 
atypical or desynchronizing reaction pattern (DRP). 
Contingency tables were prepared in which the 
accident rates of the Normal and DRP groups 
were compared. Measures of initiation time—(JRT 
minus RT)—and of jump reaction time (JRT) were 
also computed and correlated ad hoc with accident 
incidence. 


RESULTS 


To study the relationship of the reaction 
pattern to kinds of accidents, the 23 Ss re- 
porting no accidents were removed since this 
group contributed no variance regarding their 
kind of accident; or, to be fatalistic, they had 
not manifested their kinds of accidents yet. 

From Table 1 it appears that drivers with 
the atypical reaction pattern, if they had an 
accident, were more apt to be struck from 
behind than were drivers who have a normal 
reaction pattern. The drivers with a de- 
synchronizing reaction pattern were heavily 
represented in the multiple struck-from-behind 
accident category—36% of the 28 drivers 
found to have this pattern had been struck 
from behind more than once, or were chronic 
struck-from-behind accident “victims” com- 


TABLE 1 


COMPARISON OF TAXICAB DRIVERS WITH A DESYN- 
CHRONIZING REACTION PATTERN (DRP) AND 
Norma REACTION PATTERNS WHO HAD ONE 
oR More ACCIDENTS WITH RESPECT TO 
STRUCK-FROM-BEHIND (SFB) 











ACCIDENTS 
Reaction 
pattern Total 
Ao ee | OE 
Normal | DRP 
No. SFB accidents 
0 (only Other) accidents 37 9 46 
1 accident ol 9 40 
2-8 multiple 8 10 18 
Total 76 28 104 








Note.—x? = 9.19; p < .02. 
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TABLE 2 


COMPARISON OF TAXICAB DRIVERS WITH A DESyYN- 
CHRONIZING REACTION PATTERN (DRP) AND 
NorMAL REACTION PATTERNS WITH RESPECT 
TO MULTIPLE STRUCK-FROM-BEHIND 
(SFB) AND MULTIPLE OTHER 

















ACCIDENTS 
Reaction 
pattern Total 
no. 
Normal | DRP 
Multiple (2-8) SFB accidents 8 10 18 
Multiple (2-8) Other 
accidents 32 6 38 
Total multiple-accident 
drivers 40 16 56 





Note.—x? = 9.86; p < .001. 


pared with 10% of the 76 drivers with a 
normal reaction pattern. 

While a single accident may be a chance 
event, multiple accidents of the same type 
present unusually strong support for the 
operation of a systematic variable and causal 
link. So, drivers with two or more accidents 
were compared further. The double-edged 
effect of the atypical reaction pattern is shown 
in Table 2. Only 6 out of 16 (or 38%) of the 
multiple-accident DRP group had multiple 
Other accidents, whereas 32 (80%) of the 40 
drivers with a normal reaction pattern had 
their accidents “up front.” The corollary hy- 
pothesis that drivers with the desynchronizing 


TABLE 3 


COMPARISON OF TAXICAB DRIVERS WITH A DEsyn- 
CHRONIZING REACTION PATTERN (DRP) AND 
NorMAL REACTION PATTERNS WITH 
RESPECT TO OVERALL 
ACCIDENTS 








Reaction pattern 





No. accidents Total no. 
Normal DRP 
0 nit 12 23 
1 32 9 41 
2-9 44 19 63 
Total 87 40 127 





Note.—x? = 6.35; » < .05. 


reaction pattern have fewer Other accidents 
was thus supported. 

Having considered the relationships of the 
reaction pattern to accidents in one’s tailway 
and also in one’s headway, the safe driver— 
one who had neither class of accidents or was 
accident-free—was examined next. In Table 3 
the accident-free drivers were compared with 
drivers who had either headway or tailway 
accidents. Thirty percent of the 40 drivers 
with a desynchronizing reaction pattern were 
found to be accident free compared to only 
13% out of the 87 drivers with normal re- 
action patterns. When the number of ac- 
cidents was examined, the Normal group re- 
ported 2.01 accidents in the preceding 5 yr., 
whereas the DRP group reported 1.65 ac- 
cidents over the same period. On two-way 
roads with cross traffic the majority of ac- 
cidents occur in one’s headway. The Other 
accident rate reflects this fact. However, in 
one-way driving without intersections the 
struck-from-behind accident is often the pre- 
vailing kind of accident. When most driving 
occurs under such circumstances, then the 
driver with the desynchronizing reaction 
pattern may well have the highest overall 
accident rate and no longer be the safer 
driver. 

Only the desynchronizing-reaction-pattern 
score accounted for accident variance when 
chi-square contingency tables were used. 
When product-moment correlation was util- 
ized to compare the four possible variables, 
no relationship was found between the single 


TABLE 4 


CORRELATION OF REACTION-TIME VARIABLES AND 
STRUCK-FROM-BEHIND (SFB) 











ACCIDENTS 
r 
Initiation time (IT) and no. SFB accidents ot 
Jump reaction time (JRT) and no. SFB 
accidents .05 
Movement time (JRT — IT) and no. SFB 
accidents on 


Desynchronizing reaction pattern (==) and 


no. SFB accidents one 
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perceptual-motor scores of RT and JRT and 
accident incidence. Contrary to hypothesis, 
movement time was significantly correlated 
with struck-from-behind accidents. However, 
the finding that the correlation of movement 
time was lower than the correlation of de- 
synchronizing reaction pattern with struck- 
from-behind accidents could be taken as sup- 
port for the general thesis that while the 
speed of movement time determines the stop, 
the stop becomes more hazardous when move- 
ment time is combined with a slow initiation 
time as represented by the desynchronizing 
reaction pattern. 


DISCUSSION 


The finding of an empirical relationship, 
if confirmed by further studies, has practical 
significance. However, considerable research 
is called for before it would be clear how 
this finding would pay off. The crux of such 
research would be to test the assumption 
underlying the desynchronizing reaction pat- 
tern, that is, that the DRP driver’s percep- 
tion reaction in stopping results in system- 
atically less space between him and the fol- 
lowing car. The arrangement used in re- 
search on car following (Gazis, Herman, & 
Rothery, 1961) could measure the hy- 
pothesized differential space relationship re- 
sulting from the desynchronizing reaction 
pattern and other variables considered in this 
present study. If the hypothesized hazard is 
confirmed and specified quantitatively, fur- 
ther research to determine whether the de- 
synchronizing reaction pattern is relatively 
fixed or immutable would be called for. 
Implications along the following lines might 
follow if the desynchronizing reaction pattern 
were found to be relatively stable: 

(a) Selection and driver testing. Taxicab 
drivers, chauffeurs, patrolmen, and com- 
mercial drivers could have a test of their re- 
action pattern included in their selection tests 
and the results put on record. 

(6) Training. Follow-up counseling and 
instruction procedures could be tried and, if 
found effective, incorporated into driving 
training in general. Specific counseling that 
one is prone to a specific kind of accident, 
in this case to being struck from behind, 


might prove more effective than general warn- 
ings to be careful. 


(c) Legal. This measure could constitute 
evidence for assigning a greater portion of the 
costs and culpability to a leading driver. 
This might not only be more equitable than 
the present tendency to charge the following 
driver, but also provide an incentive to the 
antecedent driver to compensate for his de- 
synchronizing reaction pattern. Such a driver 
might also have a warning fixed to the rear 
end of his car after he had a struck-from- 
behind accident to caution following drivers. 

(d) Human engineering. If the perceptual- 
motor regularity studied here is established 
as a variable in accidents, changes in the 
braking system might be called for. These 
might even take such forms as dampening 
the application of braking force for some 
drivers, more effective signaling of stopping 
to the following driver, and perhaps adjusting 
the braking system to the driver as one ad- 
justs the seats. 

If this pattern is mutable, then changing 
the desynchronizing reaction pattern would 
be called for. The most obvious way would 
be to reduce the initiation-time factor and 
thus bring the perception-reaction pattern 
into line with typical drivers. This would 
assume that the pattern has a large attitudinal 
rather than psychophysiological basis. One 
possible and feasible way of altering the re- 
action pattern is to slow down the movement- 
time component so that it is no longer com- 
pensatingly fast. 

A slowing of movement to decrease a 
hazard and probability of accident seems in- 
credible. Would not a reduction of accidents 
in one’s tailway be at the expense of more 
accidents in one’s headway? A post hoc ex- 
amination of the 40 DRP drivers brought to 
light 6 men who had both very long RT 
and long JRT; that is, their movement time, 
while short in relation to their initiation time, 
was longer than the sample mean or not fast 
in absolute terms. It was found that this 
group had a mean struck-from-behind ac- 
cident rate of .33 (total sample mean = .74), 
with only one driver reporting any struck- 
from-behind accidents; he reported two. This 
group reported a total of five accidents, or a 
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mean rate of .83 (total sample mean = 1.83). 
Three of these six drivers were accident free. 
While no conclusions are warranted on the 
basis of this small group alone, the hypoth- 
esis, that a slowness in initiating a response 
without compensating limb speed is associated 
with the below-average accident rate, calls 
for further investigation. It may be that 
these drivers learned to live with their slow 
reaction pattern. If subsequent studies were 
to confirm that such a reaction pattern does 
not lead to more headway accidents, and 
results in a lower rate of tailway accidents, in 
a roundabout way an aspect of safe driving 
would be identified. 
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Questionnaire data were obtained from 685 faculty members in 12 liberal 
arts colleges. Measures included amounts of administrative influence, bases of 
influence for the dean and faculty, and levels of faculty satisfaction. Faculty 
satisfaction was greatest in those colleges having the highest total influence 
across all levels of the academic hierarchy. Faculty also indicated greatest 
satisfaction with deans who have relatively high influence over college affairs, 
and whose influence is based upon expertise and respect rather than upon 
legitimate authority and coercion. The results are discussed in the light of 
similar findings in business organizations. 


The research reported in this paper has two 
basic purposes. The first is to provide some 
tentative, but hopefully useful, guidelines for 
college administrators. A second and broader 
purpose is to explore the parallels between 
administrative processes in higher education 
and those in other kinds of organizations, 
especially industrial ones. Studies of this sort 
should guide our efforts to apply the results 
of organizational research to the academic 
setting. And perhaps more important, they 
should tell us something about the possibilities 
for developing general theories of administra- 
tion, applicable to business, governmental, 
political, voluntary, and academic organiza- 
tions. The present paper, which focuses on 
the role of the academic dean, is the first of 
several reports based on this research in 
liberal arts colleges. 

In many respects, the academic dean is the 
faculty’s most important voice in the adminis- 
tration of the liberal arts college. He repre- 
sents the faculty viewpoint to the president 


1The research reported here was supported by 
grants from the Carnegie Corporation of New York 
to the Survey Research Center and the Center for 
Study of Higher Education, The University of 
Michigan. 

Data for the present analysis are taken from a 
larger survey carried out by A. L. Fisch and the 
author. Special thanks are due to the many faculty 
members who devoted their time and effort to 
provide the information for this study. 

Thanks are also due to the following colleagues 
who read an earlier draft and offered suggestions: 
Robert Cope, Glenn Jones, Arnold Tannenbaum, 
George Wieland, and Alvin Zander. 


and trustees, and he also represents higher 
administration to the faculty. His role is thus 


“ similar to many middle-management positions 
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in business organizations. Given this set of 
dual loyalties and commitments, what are 
some of the qualities of an effective dean? 
If for the moment we are guided by research 
findings in nonacademic organizations, the 
following hypotheses may be considered: 


1. The effective dean is influential. A study 
in industry by Pelz (1951, 1952) indicated 
that a supervisor’s effectiveness in dealing with 
his subordinates depends in part upon his 
influence with his own superiors; the in- 
fluential supervisor can “deliver the goods” 
for his men. A recent study of account execu- 
tives engaged in the sale of intangibles 
(Bachman, Smith, & Slesinger, 1966) pro- 
duced similar results; satisfaction and per- 
formance tended to be higher when the office 
manager was perceived as highly influential 
over “how the office is run.” To the extent 
that these findings are applicable to an aca- 
demic situation, they imply that the “strong” 
dean will be effective. 

It does not follow that if the dean is 
highly influential then someone else in the 
college administration will be proportionately 
less influential. Recent studies suggest that 
the total amount of influence (or control) in 
an organization is best treated as a variable, 
not a fixed value; indeed, a relatively high 
level of control across all organizational eche- 
lons is associated with high performance and 
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satisfaction (Tannenbaum, 1962). Tannen- 
baum (1961) and Likert (1961) have argued 
that this pattern of “high total control” is 
successful because it involves members at all 
levels of the organization, leading to more ef- 
fective decisions and also to higher motivation. 
This suggests a supplementary hypothesis: the 
effective college administration is one with 
relatively high influence at all levels. 

2. The effective dean’s influence is based 
upon a relatively high degree of competence 
and personal attractiveness, and a relatively 
low dependence upon his “legitimate au- 
thority” and coercion. Five different ‘bases of 
power,” originally distinguished by French 
and Raven (1960), were examined by Bach- 
man et al. (1966). Satisfaction and to some 
extent performance were positively related to 
“expert” power (based on competence and 
experience) and “referent” power (based on 
personal admiration); negative correlations 
occurred with “legitintate” power (based on 
hierarchical “rights”) and “reward” and 
“coercive” power (based on the potential use 
of positive and negative sanctions). 

The above hypotheses are stated in very 
broad terms. For example, they deal with the 
dean’s “effectiveness” as if it were a unitary 
dimension. In fact, of course, there may be 
many criteria for evaluating performance in 
the role of dean. The present research is 
necessarily limited to studying the dean’s 
effectiveness in terms of his impact on the 
faculty; specifically, the criteria will be 
several measures of faculty satisfaction, in- 
cluding their view of how well the dean ful- 
fills his responsibilities. 


MeEtHOD 


Research Sites 


The respondents selected for study consisted of 
all full-time faculty members at each of 12 liberal 
arts colleges belonging to a regional association of 
colleges. This choice provided a number of institu- 
tions which are somewhat homogeneous with respect 
to size, geographic location, and educational reputa- 
tion. Since our interest is concentrated upon the 
effects and correlates of certain administrative fac- 
tors, it seemed desirable to select colleges which 
would be fairly similar in other characteristics. A 
second and more practical reason for the choice was 
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the willingness of member colleges to participate 
in the study.” 

At the time of the survey (April-May 1964), 
faculty size at the 12 colleges ranged from 67 to 
173, with a median of 92 and a mean of 101. 
A total of 1,210 questionnaires were sent to eligible 
faculty members; after several months, during which 
time two follow-up letters and a duplicate question- 
naire were sent to each nonrespondent, a total of 
685 were returned. While a higher response rate 
would have been preferred, of course, the final re- 
sponse (approximately 60% in each of the 12 col- 
leges) was consistent with the author’s expectations 
for a long (19-page) mail questionnaire administered 
near the end of the academic year.® 


Measures 


All data used in the study consist of faculty 
responses to questionnaire items. Only a small pro- 
portion of the total questionnaire material has 
been included in the present analysis. 

Influence. The following question was used to 
assess influence over college matters: “In general, 
how much influence does each of the following 
groups or persons actually have in determining the 
policies and actions of your colleges?” Response 
categories ranged on a 5-point scale from (1) 
“Little or none,” to (5) “A great deal.” The respond- 
ents were asked to rate the amount of influence of 
each of the following groups or persons: the board 
of trustees, the president, the academic dean, the 
department chairmen (all of them), the faculty as 
a group, and the students as a group. The present 
study uses two measures based on this item: a 
measure of “total influence” (as indicated by the 
mean of all six influence ratings), and the in- 
fluence of the academic deans (based on the rating 
of the dean alone). 

Bases of influence. The following questionnaire 
item was used to assess five bases of the dean’s 
influence over the respondent (referent, expert, 
reward, coercive, and legitimate influence, in that 
order): 


Listed below are five reasons generally given 
by people when they are asked why they do the 
things their superiors suggest or want them to do. 


2The problem of gaining access to a research 
site and obtaining the cooperation of prospective 
respondents is crucial in research of this sort. Special 
thanks are due to Algo D. Henderson and Eldon L. 
Johnson, of the Center for Study of Higher Educa- 
tion, for their help in assuring the full cooperation 
of each college in the association. 

3Tt must be stated clearly that the purpose of the 
research based upon this sample of college faculty 
is not to describe “the average faculty member.” 
Rather, the primary intent is to discover relation- 
ships among variables; such relationships are likely 
to appear even if the set of respondents departs 
somewhat from an accurate probability sample 
drawn from the universe of faculty members. 
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Please read all five carefully. Then number them 
according to their importance to you as reasons 
for doing the things your academic dean suggests 
or wants you to do. Give rank “1” to the most 
important factor, “2” to the next, etc. 

A. I respect him personally, and want to act 
in a way that merits his respect and admiration. 

B. I respect his competence and judgment about 
things with which he is more experienced than I. 

C. He can give special help and benefits to those 
who cooperate with him. 

D. He can apply pressure or penalize those who 
do not cooperate. 

E. He has a legitimate right, considering his 
position, to expect that his suggestions will be 
carried out. 


An additional item was included on an exploratory 
basis to measure faculty perceptions of their in- 
fluence over the dean. The respondents were given a 
list of alternatives similar to those listed above, 
but in this case they were asked to “. . . number 
them according to what you think is their importance 
to your academic dean as reasons for doing the 
things you suggest or request of him.” 

Because of the ranking procedure involved, the 
five bases of influence are not independent; in a 
sense, any single basis of influence can be given 
prominence only at the expense of the other bases. 
On the other hand, the ranking procedure has the 
advantage of forcing the respondent to discriminate 
among all the bases, rather than giving prominence 
to only one or two. Moreover, it avoids a common 
contaminating tendency to rate the amount of in- 
fluence rather than the nature or bases of influence. 

Faculty satisfaction. Three measures of satisfaction 
were used; two deal with overall “job satisfaction,” 
and the third specifies satisfaction with the dean. 

In order to obtain some indication of commitment 
or attachment to present college positions, each 
respondent was asked to indicate (a) his first choice 
of another college or university and (6b) the mini- 
mum salary condition that would induce him to 
leave his present position to accept a similar one at 
the other college or university. Response alternatives 
covered a 7-point scale ranging from (0) “I would 
accept even if it involved about a 20% salary 
reduction,” to (6) “I would not accept such a posi- 
tion, no matter how great a salary increase might 
be involved.” 

A second measure of satisfaction simply pre- 
sented the respondent with the following state- 
ment, and asked him to indicate his agreement or 
disagreement on a 5-point scale: “While no job 
can be expected to be perfect, there are really very 
few things that I would change about mine if I 
had the power to do so.” 

The measure of satisfaction with the dean also 
presented a statement, and asked the respondent to 
indicate agreement or disagreement on a 5-point 
scale: “All things considered, I am personally quite 
satisfied with the way the academic dean of my 
college fulfills his responsibilities.” 











TABLE 1 
COLLEGE ADMINISTRATIVE CHARACTERISTICS AND 
FACULTY CRITERIA: MEANS AND 
STANDARD DEVIATIONS 
M SD among SD 
rating individuals coltege Ws 
(N = 685) | (NV = 685) (N = 12) 
Influence 
By dean 3.03 1.00 O54 
Total influence 3.49 0.47 0.14 
Bases of dean’s influence 
Legitimate 2.40 1,14 0.20 
Expert 1.94 ples 0.38 
Referent 2.48 1.09 0.19 
Reward 3.67 0.96 0.16 
Coercive 4.40 1.07 0.26 
Bases of faculty influence 
Legitimate eae 0.91 0.14 
Expert 1.93 0.92 0.11 
Referent 2.85 0.91 0.17 
Reward 3.66 0.86 0.16 
Coercive 4.78 0.63 0.13 
Faculty satisfaction 
Minimum conditions 
for leaving S205 1.66 0.41 
With job as a whole 3.29 ead 0.19 
With dean of college 3.56 ets 0.45 














Clearly, these several measures do not relate to a 
simple unitary dimension of satisfaction. It is quite 
possible, for example, to find a faculty member who 
would rate his present position favorably, and yet 
be eager to go to a large and prestigious university— 
even at a slight reduction in salary. And that same 
respondent could be either satisfied or dissatisfied 
with the way the dean fulfills his responsibilities. 
Nevertheless, moderate positive correlations appeared 
among the three measures. The product-moment 
correlation between the two measures of overall 
satisfaction is .35; satisfaction with the dean cor- 
relates .20 with attachment to present position, and 
.34 with the second measure of overall satisfaction 
(in each case p < 01, two-tailed test). 


RESULTS 


The major findings of the study are sum- 
marized in Tables 1, 2, and 3. Table 1 pre- 
sents mean responses and standard deviations 
for all variables. Tables 2 and 3 present cor- 
relations between college administrative char- 
acteristics and faculty criterion ratings at 
two levels of analysis (described below). 
Before presenting the substantive findings, it 
will be helpful to outline the procedures and 
rationale for data analysis. 


Statistical Analysis and Rationale 


The analytic approach used in this study 
closely parallels that used by Bachman et al. 
(1966). This approach assumes that the best 
available measure of an “organizational char- 
acteristic” is often some composite of percep- 
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TABLE 2 


COLLEGE ADMINISTRATIVE CHARACTERISTICS CORRE- 
LATED WITH MEAN FACULTY 
CRITERION RATINGS 














College mean faculty criterion 
ratings 
College administrative 
characteristic Minimum | Satisfac- | Satisfac- 
condi- tion with | tion with 
tions for jobasa dean of 
leaving whole college 
Influence 
By dean 16 —.07 .69* 
Total influence -42 19 74% 
Bases of dean's influence 
Legitimate —.24 —.27 —.52 
Expert 38 56 Tote 
Referent 46 57 .67* 
Reward —.31 —.61* —.80** 
Coercive —.44 —.57 —.70* 
Bases of faculty influence 
Legitimate —.16 .42 47 
Expert Ad .61* .62* 
Referent Be ou 39 
Reward —.40 —.75** —.75** 
Coercive —.05 —.24 —.71** 














Note.—N = 12 colleges. } 
* » < .05, two-tailed, product-moment correlation. 
#* » < .01, two-tailed, product-moment correlation. 


tions by a number of respondents in the 
organization. Thus in the present study the 
measure of a college administrative char- 
acteristic always consists of the mean of all 
faculty perceptions. Each characteristic is 
correlated with faculty satisfaction measures 
at two distinct levels of analysis: college mean 
criterion scores and individual criterion scores. 

College mean criterion scores. The first 
level of analysis is the college as a whole; 
Table 2 presents correlations computed be- 
tween administrative characteristics and mean 
faculty satisfaction scores (V = 12 colleges). 
An individual faculty member’s satisfactions 
with his position and dean are presumably 
influenced by (@) certain college-level ef- 
fects—administrative characteristics which 
are common to most of his campus colleagues, 
and also (0) a host of idiosyncratic factors 
which differ from person to person. The use 
of mean satisfaction scores largely cancels the 
effects of personal factors (6), and thus high- 
lights the impact of college-level effects (a). 

Individual criterion scores. The second level 
of analysis is the individual faculty member; 
Table 3 presents correlations computed be- 
tween administrative characteristics (still 
based on mean perceptions) and individual 
faculty satisfaction ratings (V = 685 faculty 
members). This form of analysis deals with 
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the question: Given the many causes of any 
individual’s satisfactions, what is the relative 
importance of certain college administrative 
characteristics? It is clear at a glance that 
the correlations in Table 3 are, on the whole, 
much smaller than those in Table 2. In 
Table 2 college-level effects have a ‘mo- 
nopoly” in their impact upon satisfaction, 
since individual differences are held more-or- 
less constant through the use of college mean 
satisfaction data. In Table 3 this impact of 
college-wide factors is “shared” with the many 
causes that may be different for each in- 
dividual. 

Isolation of college-level effects. An addi- 
tional question which can be treated at the 


TABLE 3 


COLLEGE ADMINISTRATIVE CHARACTERISTICS CORRE- 
LATED WITH INDIVIDUAL FACULTY 
CRITERION RATINGS 

















Individual faculty criterion ratings 
College administrative eae F , 
hate remuhic Minimum | Satisfac- Satisfac- 
condi- tion with | tion with 
tions for jobasa dean of 
leaving whole college 
Influence 
By dean -05 .00 .26** 
—.02 —.08 .06 
Total influence wis .03 iar 
.07 —.03 .20** 
Bases of dean's influence 
Legitimate —.06 —.07 —.21** 
—.05 —.06 —.18** 
Expert .09% .10* .29** 
.05 .05 .18** 
Referent .11* ie .26** 
.09* .08 .22** 
Reward —.07 —.11* —.34** 
—.07 —.09* —.31** 
Coercive —.11* —.11* —.27** 
—.08 —.06 —.20** 
Bases of faculty influence 
Legitimate —.04 .09* ade 
—.04 .09* .20** 
Expert .09* -09* 24% 
.08 .08 .22** 
Referent .05 .02 .14** 
.04 .00 .10* 
Reward —.08 —.13** —.31** 
—.07 —.10* — .28** 
Coercive .00 —.01 — .27** 
01 —.03 —.22** 














Note.—N = 685 faculty members. For each cell in the 
table two correlations are given: The upper value is the zero- 
order correlation, 745; the lower (boldfaced) value is the partial 
correlation, 745.2; where, 


A =college-wide mean of ratings of administrative char- 
acteristics 

individual faculty criterion ratings (satisfaction, etc.) 
individual faculty ratings of administrative character- 
istics. 


As a matter of convenience, all significance levels were cal- 
culated on the basis of 500 df. Actual response varied some- 
what from item to item, but no combination of items (ab) 
provided less than 500 df. 

*p < .05, two-tailed, product-moment correlation. 

*k » < .01, two-tailed, product-moment correlation. 
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individual level of analysis has to do with the 
way in which administrative characteristics 
have an influence on individual satisfactions: 
Is it simply the perception of these character- 
istics which affects satisfaction? In order to 
deal with this issue, each relationship be- 
tween an administrative characteristic and in- 
dividual faculty satisfaction is presented in 
two forms in Table 3: the zero-order correla- 
tion and a partial correlation. The partial cor- 
relation rules out the effects of each individ- 
ual’s own perception of the college administra- 
tive characteristic, thus removing that portion 
of the relationship which might be attributed 
to “phenomenological” or “halo” effects. The 
portion that remains thus represents a ‘“phe- 
nomenology-free” measure of the college-level 
effect. For a discussion of the background 
and rationale for this form of analysis, see 
Tannenbaum and Bachman (1964); its im- 
plications for the present study are treated 
later in this paper. 


Influence 


The mean level of influence attributed 
to the dean is midway between “moderate” 
and “considerable.” As Table 1 indicates, the 
standard deviation among college mean 
ratings is relatively large; thus it appears that 
the 12 deans vary considerably in their in- 
fluence. The mean total influence rating is 
also midway between “moderate” and “con- 
siderable”; however, the variation from col- 
lege to college is much smaller (perhaps be- 
cause influence in this case is averaged across 
six levels from board of trustees to students). 

Colleges in which the total amount of in- 
fluence is high, and those in which the dean’s 
influence is high, show greater mean satisfac- 
tion with the dean. These administrative 
characteristics are also related to individual 
faculty members’ satisfaction with the dean; 
however, the correlations with dean’s influence 
disappear almost completely when the effects 
of individual perceptions are removed through 
partial correlations (Table 3). The more gen- 
eral measures of faculty satisfaction (mini- 
mum conditions for leaving, and satisfaction 
with the job as a whole) show little relation- 


ship to our influence measures at either level 
of analysis.‘ 


Bases of Influence 


When asked to rank reasons for doing what 
the dean suggests or wishes, faculty members 
gave the highest ranking to the measure of 
expert influence (“I respect his competence 
and judgment about things with which he is 
more experienced than I’); next most im- 
portant were legitimate and referent influence; 
lowest rated were reward and coercive in- 
fluence, that is, the use of sanctions (Table 1). 
Correlations with the criterion variables were 
positive for expert and referent influence, 
negative for legitimate, reward, and coercive 
influence.® 

Turning next to faculty perceptions of 
why the dean follows faculty suggestions or 
requests, it is clear that the results are largely 
similar to those for the bases of the dean’s 
influence. The important exception is legiti- 
mate influence. Faculty members indicated 
that the dean’s most important reason for 
acting as they wish is that “The dean feels 
that his job properly includes an obligation 
to consider and act upon my _ suggestions 
as a faculty member.” Moreover, this legiti- 
mate influence of the faculty over the dean is 
positively related to satisfaction with the 
dean. The ordering of the four other bases 
of dean’s influence, and the actual mean rank- 
ings are quite similar (see Table 1). The 
correlations with satisfaction measures are 


4Tt is of interest to note that the respondents 
were also questioned about the ideal amount of 
influence their dean should have. The level they 
recommended was consistently higher than their 
rating of the dean’s actual influence. The increase 
occurred in 11 of the 12 colleges; in the twelfth 
case, the estimates of actual and ideal influence were 
practically identical, and very high. 

5 Some caution must be exercised in interpreting 
correlations with the present measure of bases of 
influence. The ranking method used in obtaining 
the data makes it impossible for all five bases of 
influence to be correlated in the same direction with 
any single criterion variable. Thus, it may be that 
positive correlations with expert and referent influ- 
ence “caused” negative correlations with other bases 
of influence, or vice versa. In spite of this limitation, 
the overall pattern of correlations with bases of in- 
fluence is of considerable interest. 
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also parallel, but they tend to be somewhat 
lower in the case of faculty influence. 


DISCUSSION 


In general, the results of this study of 
administration in liberal arts colleges are quite 
consistent with our hypotheses based upon 
other organizational studies. It is particularly 
useful to draw a comparison between the 
present research and the earlier study of 
account executives and office managers (Bach- 
man et al., 1966). In both studies satisfaction 
was positively related to the total amount of 
influence at all levels. Also in both studies it 
appeared that satisfaction with an organiza- 
tional “‘superior” (office manager or dean) is 
higher when the superior enjoys a relatively 
high degree of influence, and when this in- 
fluence is based upon expertise and personal 
attractiveness, rather than upon legitimate 
authority or the use of sanctions. One interest- 
ing difference appeared in the rank ordering 
of bases of the dean’s influence. In the study 
of account executives and office managers, 
legitimate influence was consistently rated 
most important (although it was negatively 
related to satisfaction with the office man- 
ager). In the present study, on the other 
hand, expertise was considered the most 
prominent basis of the dean’s influence, with 
legitimate authority given a second level of 
priority. As might be expected, college faculty 
members are clearly less likely than account 
executives to see themselves as subordinates 
in a hierarchy. 

Although most faculty members do not 
consider a legitimate obligation as their most 
salient reason for doing what the dean wishes, 
it appears that they do consider this the most 
important reason why the dean complies with 
their desires. Moreover, the greater the rela- 
tive importance of this basis of faculty in- 
fluence over the dean, the greater is their 
satisfaction with him. The explanation for 
this reversed relationship when ‘the shoe is 
on the other foot” is not immediately clear. 
It should be noted that the two questions are 
(necessarily) different in wording, and this 
may contribute to the effect. But it may also 
be that faculty members really are happy to 
perceive their rights as being respected by 
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the dean, and at the same time are less 
satisfied to view their own actions as deferring 
to the dean’s legitimate rights. 

In the present study, as in many others, it 
has been necessary to measure administrative 
behavior solely in terms of perceptions. Often 
such a procedure is criticized because what 
appears to be an “objective” relationship 
(e.g., deans who rely more upon expert and 
referent influence are likely to have more 
satisfied faculty members) may in fact be 
nothing more than a “phenomenological” 
effect (e.g., if a faculty member likes a dean, 
he will then perceive the dean as using expert 
and referent influence—quite apart from the 
dean’s actual behavior). This problem may 
be clarified by a hypothetical illustration: 
Professor A at Alpha College and Professor B 
at Beta College have identical perceptions of 
their deans’ bases of influence. But the mean 
perceptions of all faculty members at Alpha 
are different from the corresponding means at 
Beta; the dean at Alpha is viewed on the 
average as making relatively greater use of 
expert and referent influence. Now if the 
phenomenological explanation is correct, Pro- 
fessors A and B should not differ in satisfac- 
tion with their respective deans. On the other 
hand, if the use of mean perceptions provides 
at least some rough indication of objective 
conditions, and if these conditions influence 
satisfaction as we have hypothesized, then we 
should find that Professor A is more satisfied 
than Professor B. The partial correlations 
presented in Table 3 provide just this sort of 
test, and the results clearly support the ob- 
jective interpretation. When individual per- 
ceptions are in effect held constant (thereby 
removing the possibility of a phenomenological 
effect), the strong relationships between bases 
of influence and satisfaction with the dean 
are scarcely diminished at all. Of course, this 
type of analysis provides no guarantee that 
our operations have measured exactly what 
was intended; but it does indicate that some- 
thing objective has been measured, and that 
the present findings cannot be dismissed as 
“mere perceptions.” 

An adequate empirical foundation for 
policy recommendations in academic adminis- 
tration awaits much further work. Neverthe- 
less, it may be of interest to summarize a 
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few tentative implications of the present study 
in terms of what constitutes an effective dean 
(i.e., a dean whom the faculty perceives as 
doing a good job). First, so far as faculty are 
concerned, the effective dean appears to have 
a good deal of influence “in determining the 
policies and practices of the college.” Second, 
there is some support for the view that the 
faculty see the dean as a colleague or “first 
among equals” rather than as a hierarchical 
superior. The effective dean is influential 
through personal qualities such as expertise 
and respect, rather than basing his influence 
upon legitimate authority and the use of 
punishments or rewards. The collegial rela- 
tionship should not, however, be construed as 
minimizing the legitimate authority of the 
faculty. On the contrary, it would seem that 
the effective dean is one who obviously ap- 
preciates his obligation to respond to faculty 
suggestions. 

Qne additional implication of the study 
must be emphasized. It was noted at the 
outset of this discussion that the present find- 
ings are largely similar to those obtained in 
other organizational studies. This may serve 
to encourage further efforts applying the 
findings of organizational research to aca- 
demia. And it may indicate a common ground 


of organizational phenomena subject to gen- 
eral theories of administration. 
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VALIDITY OF THE GUILTY-KNOWLEDGE TECHNIQUE: 


THE EFFECTS OF MOTIVATION ? 
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48 Ss divided into 12 groups took part in an experimental investigation of 
detection of a simulated crime. For each of 12 crimes, 3 Ss were highly 
motivated to commit the crime; 1 succeeded, 1 attempted but failed, and 1 
did not make an attempt. The 4th S interrogated for each crime was an 
innocent control. Motivation for deception (hope of reward) was high ($25- 
$50) for half of the crimes and low (<$1) for the other half. Detection of 
deception using a polygraph recording of GSR and the guilty-knowledge tech- 
nique resulted in correct classification of 92% of “guilty” Ss and 100% of 
“innocent” Ss. No significant differences were found due to level of motivation 
for deception. Results were discussed in terms of their relationship to similar 
studies and actual criminal investigations. 


The first recommendation arising out of 
Orlansky’s (1962) review on _lie-detection 
capability was for further investigation of the 
validity of lie-detection techniques. This view 
has been reiterated recently in investigations 
by the Committee on Government Operations 
(1965, 1966) of the United States House of 
Representatives concerning the use of poly- 
graphs as “lie detectors” by the Federal 
Government. A common problem in studying 
the validity of lie detection, although not 
commonly recognized by many lie-detector 
operators, is that the polygraph merely in- 
dicates that a physiological response has oc- 
curred. The operator must infer, the lie on 
the basis of the observed physiological re- 
sponse. In spite of some extravagant claims 
(cf. Lee, 1953), the inference that autonomic 
responses can indicate lying must be viewed 
with extreme skepticism since the physiolog- 
ical responses measured by polygraphs can be 
produced by a wide variety of stimuli quite 
incidental to any process of lying. Lykken 
(1959) has suggested that the detection of 
guilt by using physiological measurements in- 
volves a more reasonable inference than the 
detection of lying. If a guilty individual is 
presented with some relevant details about his 





1 The author gratefully acknowledges the assistance 
of Geri Gold and the technical advice of Arthur R. 
Roberts of the Calgary City Police Department in 
carrying out this study which was supported in 
part by Canadian NRC Grant No. APA 213. 
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crime interspersed among similar but ir- 
relevant items, he would be expected to re- 
spond differently than an innocent individual 
who had no “guilty knowledge” about which 
items were relevant. The assumption here is 
not that the subject (S) will respond in a 
particular, recognizable way when he is lying, 
but merely that he will respond differently to 
relevant details of his crime than to irrelevant 
details. 

Lykken found that he could correctly 
identify, using a simple objective scoring 
procedure, all innocent Ss and 93.9% of his 
guilty Ss against a chance expectancy of 50%. 
He noted, however, that his Ss were not par- 
ticularly emotionally involved in their crime, 
nor were they strongly motivated to try and 
defeat the test. In a later study he found 
this guilty-knowledge technique highly re- 
sistant to faking (Lykken, 1960). The present 
study was designed to evaluate the validity 
of the guilty-knowledge technique under more 
ego-involving circumstances and _ varying 
amounts of motivation. 


MeETHOopD 


The Ss used in this experiment were 48 volunteer 
college students assigned at random to 12 groups. 
Three Ss chosen at random from each group took 
part in the crime as “hunters”; the fourth S did not 
and he had no knowledge of the nature of the 
experiment. The “hunter” game was used for enacting 
each crime since it seemed to be a highly ego- 
involving game for university students. The three 
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hunters from each group were seen individually by 
the experimenter (£) and shown a photograph of a 
student who was to be their intended victim (the 
12 victims were all collaborating psychology stu- 
dents). Basic information concerning the name of 
the victim, his faculty, and his lecture schedule 
was supplied. Each hunter was told that the 
victim would be carrying a sealed envelope which 
the hunter was to take and examine if he could 
successfully hunt down the victim and enact his 
murder. The hunter was informed that the envelopes 
contained vouchers for prizes ranging in value up 
to $50 and that he would be permitted to cash the 
voucher if he was not detected as the criminal after 
a polygraph examination. The hunter would not 
know the value of the voucher unless he was suc- 
cessful in enacting the crime. Although the hunters 
were not aware to it, half of the 12 victims had 
envelopes with vouchers ranging in value from $25 
to $50, while the other half ranged in value from 
10¢ to $1. Each hunter was informed that there 
were other hunters after the same victim and that 
the first successful hunter would be the one to get 
the envelope. Each hunter was free to choose his 
own method, time, and place for enacting the crime; 
the only restrictions were that he must work alone 
and tell no one of his plans or success. As an addi- 
tional incentive to the hunters, prizes were awarded 
upon completion of the experiment for the most 
ingeniously devised crimes, whether the hunter was 
successful in getting the envelope or not. 

The hunters were told that in order to add some 
uncertainty to the success of the crime the victim 
had been given a random timetable with specified 
time intervals on it when any crime attempted would 
not be successful. If an attempt was made during 
one of these “safe periods” the crime was said 
to have failed, the envelope was not surrendered, 
and the hunter was barred from trying again for 
24 hr. The hunter would not know until he made an 
attempt whether it would succeed or not. In fact, no 
such timetable existed and the first hunter to make 
an attempt was always told that his victim was in 
a “safe period” and the attempt had failed. The 
second hunter to make an attempt was always judged 
successful and took the envelope. 

Each victim had a code number which was posted 
on a central campus bulletin. This number was to be 
checked before making any attempt and to be re- 
moved as soon as any hunter was successful in ob- 
taining the envelope by enacting the murder of his 
victim. Thus, if the number was gone, the other 
hunters would know the game was over and would 
not make any further attempts on the victim. 

In this way, for every victim there were four 
suspects: one who had committed the crime, one 
who had tried and failed, one who was motivated 
to attempt the crime but had no opportunity, and a 
fourth “suspect” who in fact knew nothing about 
the crime or the victim and was in no way in- 
volved or motivated. 

As soon as the second hunter had succeeded in 
obtaining the envelope the victim supplied E with 


answers to some questions relevant to the crime 
(e.g.. where it occurred, murder weapon used, etc.). 
Six multiple-choice questions about the crime and 
the content of the envelope were made up with 
five alternatives for each question (cf. Lykken, 1959). 
The first alternate on every question was always a 
foil and not scored. This was done in order to 
reduce the possibility of incorrectly scoring orienting 
responses. The four suspects were then brought in 
for interrogation by another E who had no knowl- 
edge about who the criminal was, or which answers 
to the multiple-choice questions were correct. The 
S was seated in the interrogation room, GSR 
electrodes were attached to his dominant hand, and 
a pair of headphones adjusted to his ears. The E 
was located with a Grass Model 7 polygraph in an 
adjoining room and spoke to S$ via a microphone. 
Each S was told he was to be questioned in relation 
to a crime. He was to listen to each question con- 
sisting of several parts but to make no answer. 
The standardized list of six questions was then 
read to S and his GSR responses to each of the 
five alternates were recorded. Heart rate and respira- 
tion rate were monitored but not scored. 

Scoring of the GSR response after the recording 
was completed was simple and objective (Lykken, 
1959). Scoring was done without prior knowledge 
of the correct answers in order to prevent any 
criterion contamination. An S’s GSR’s to the several 
alternatives in a given question were ranked in order 
of amplitude. If his largest response was found to 
be a relevant alternative he was given a score of 
2 on that question. If his second largest response 
was to the relevant alternative he was given a score 
of 1. Thus a perfect innocent score was 0 and a 
perfect guilty score was 12. 


RESULTS 


Following Lykken’s (1959) criteria, all 
scores of 6 or less were classified “innocent” 
and all scores over 6 “guilty.” All six of the 
guilty Ss who were highly motivated to decep- 
tion because of the high value of their vouch- 
ers were correctly classified. Five of the six 
guilty Ss who were under low motivation to 
deception (negligible value of vouchers) were 
correctly identified. The difference in classi- 
fication between these two groups was not sig- 
nificant (Fisher exact probability p = .50 
[Siegel, 1956]). Combining the two groups, 
11 of the 12 guilty Ss were correctly classified 
as guilty (92%). All 36 of the innocent Ss 
(100% ) were correctly classified as not guilty. 
There was a total of 48 interrogations and 47 
of these were correctly classified—giving an 
overall 98% correct classification against a 
chance level of 25%. 
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DISCUSSION 


There were several reasons for the fairly 
involved procedure used in this study. Pri- 
marily it was designed to simulate, a little 
closer, actual situations involving real crimes, 
but at the same time maintain rigorous ex- 
perimental control. By allowing S to choose 
his own time, place, and method, plus the 
incentive for ingenious crimes, S became much 
more actively and personally involved than if 
he merely walked through the motions of a 
staged crime. Some of the uncertainties in- 
volved in real crimes were introduced. Not 
every crime attempted is automatically suc- 
cessful and there is usually no way of knowing 
in advance. A criminal usually does not know 
exactly what his gains will be (if any) until 
he successfully completes the crime. Further- 
more, in order to keep the rewards of his 
crime he must be able to escape detection. 
The motivation for escaping detection may be 
a complex one involving some combination 
of fear of punishment and hope of reward. 
Most experimental studies have increased this 
motivation by threat of punishment (usually 
electric shock). In the present study, the 
hope of reward was experimentally varied to 
see if this had any effect on ability to escape 
detection in a polygraph examination. It has 
sometimes been suggested that successful 
detection rates in actual crimes would be sig- 
nificantly higher than in experimental studies 
since the tremendous motivation involved in 
the former would increase the amount of 
autonomic responsiveness to questions relevant 
to the crime. While the differences in this 
study are in this direction, they do not sup- 
port this hypothesis. They do not clearly 
refute it either. The small samples of guilty 
Ss and the limited rewards combined with 
no real threats of punishment in the present 
study require caution in interpreting these 
results. 

One observation did occur which merits 
further study. A common attitude in the lie- 
detection field (cf. Lee, 1953) is that the 
GSR is not useful because it is “‘too sensitive.” 
Lykken (1960) discounted this objection to 
the GSR on the basis of his experimental 
studies with essentially fairly low levels of 
motivation to deception. In the present study 


it was observed, particularly in guilty Ss who 
possessed $50 vouchers, that there seemed 
to be a great many spontaneous fluctuations 
in base-line GSR. Scoring GSR responses to 
relevant questions must be done very carefully 
in such cases to prevent the confusion of 
spontaneous base-line fluctuations from being 
counted as GSRs. While this was not too 
difficult in the present experiment, it does 
indicate caution in uncritical adoption of 
Lykken’s scoring criteria for criminal in- 
vestigations. Malmo (1957) has suggested 
that base-line fluctuations can be a useful 
index of drive level and, consequently, a very 
high drive level produced in a suspect for 
any reason (e.g., merely being suspected of 
a crime) would reduce the reliability of scor- 
ing GSR responses. In this sense the “extra- 
ordinary sensitivity of the GSR” may not be 
a “clear virtue” as Lykken (1960) suggests 
that it is. 

Some experimental studies stack the odds 
for correct detection very much in favor of 
the polygraph examiner by merely requiring 
him to separate one guilty suspect from 
several innocent controls. In criminal investi- 
gations the odds are not always so nice. 
Often there are several suspects who had good 
motives for committing the crime, some may 
have been planning it and some may have 
even attempted it unsuccessfully. But to the 
extent that these experimental results can be 
generalized, the guilty-knowledge technique 
seems to work as well in either situation. The 
92% correct classification of guilty Ss and 
100% correct classification of innocent Ss is 
essentially the same using the present pro- 
cedure as the 93.9% correct classification of 
guilty Ss and 100% correct classification of 
innocent Ss using Lykken’s (1959) procedure. 


REFERENCES 


COMMITTEE ON GOVERNMENT OPERATIONS. (10th re- 
port) Use of polygraphs as “lie detectors” by the 
federal government. 89th Congress, 1st Session, 
House Report No. 198. Washington, D. C., 1965. 

COMMITTEE ON GOVERNMENT OPERATIONS. (39th re- 
port) Use of polygraphs as “lie detectors” by the 
federal government (Part 2). 89th Congress, 2nd 
Session, House Report No. 2081. Washington, D.C.: 
1966. 

Lrg, C. D. The instrumental detection of deception. 
Springfield, Ill.: Charles C Thomas, 1953. 


THE GuILTY-KNOWLEDGE TECHNIQUE 65 


Lyxxen, D. T. The GSR in the detection of guilt. Ortansky, J. An assessment of lie detection capa- 


Journal of Applied Psychology, 1959, 43, 385-388. bility. Institute of Defence Analyses, Research and 
LyKxen, D. T. The validity of the guilty knowl- Engineering Support Division. Technical Report 

edge technique: The effects of faking. Journal of No. 62-16. Washington, D. C., 1962. 

Applied Psychology, 1960, 44, 258-262. SIEGEL, S. Nonparametric statistics for the be- 


Rear OnE R: @ BME Ans ioxrmn ude hehavioralimeroucal: havioral sciences. New York: McGraw-Hill, 1960. 


Psychological Review, 1957, 64, 276-287. (Received March 10, 1967) 


List of Manuscripts Accepted for the 
Journal of Applied Psychology 


Goals and Intentions as Mediators of the Effects of Monetary Incentives on Behavior: Edwin A. Locke,* Judith F. 
Bryan, and Lorne M. Kendall: Department of Psychology, University of Maryland, College Park, Maryland 
20740. 

Relation between Ethnic Origin and GSR Reactivity in Psychophysiological Detection: Sol Kugelmass* and 
Israel Lieblich: Department of Psychology, The Hebrew University of Jerusalem, Jerusalem, Israel. 

Influence of Training, Method, and Relationship on the Halo Effect: Eva Metzger Brown*: Apartment 134, 115-25 
Metropolitan Avenue, Kew Gardens, New York 11418. 

Driver Gap Acceptance at Intersections: William L. Gibbs*: United States Department of Transportation, Federal 
Highway Administration, Bureau of Public Roads, Washington, D. C. 20591. 

Relationship between Perceptual Style and Driver Reaction to an Emergency Situation: Gerald V. Barrett* and 
Carl L. Thornton: Director Human Factors Laboratory, Life Sciences Research Department, Goodyear 
Aerospace Corporation, 1210 Massillon Road, Akron, Ohio 44315. 

Environmental Frustration and Creative Problem Solving: Bernard L. Hinton*: Graduate School of Business, 
Indiana University, Bloomington, Indiana 47401. 

Effects of the Type of Stimulus Employed and the Level of Subject Awareness on the Detection of Deception: 
Richard L. Thackray and Martin T. Orne*: Unit for Experimental Psychiatry, University of Pennsylvania, 
111 North 49th Street, Philadelphia, Pennsylvania 19139. 

Scales for Rating the Taste of Water: William H. Bruvold*: University of California, School of Public Health, 
Earl Warren Hall, Berkeley, California 94720. 

Occupational Validity of the GATB: Stephen E. Benis*: United States Department of Labor, Bureau of Employ- 
ment Security, Washington, D. C. 20210. 

How to Succeed in Business According to Business Students and Managers: Bernard M. Bass*: 401 Bruce Hall, 
Graduate School of Business, University of Pittsburgh, Pittsburgh, Pennsylvania 15213. 


* Asterisk indicates author for whom the address is supplied, 


Journal of Applied a eeeey 
1968, Vol. 52, No. 1, 66-70 


COMPARISON OF BEHAVIORAL STYLES BETWEEN 


ENTERING AND GRADUATING STUDENTS IN 
OFFICER CANDIDATE SCHOOL 


PETER B. PETERSEN ann GORDON L. LIPPITT 


George Washington University 


2 


groups of students in an officer candidate school (N = 880) are compared to 
determine their differences in behavioral styles. The instrument used is the 
Job Analysis and Interest Measurement. There is a difference at the .01 
level of significance in the mean scores between entering (N = 561) and gradu- 
ating (N= 319) students in 4 prechosen behavioral dimensions. Entering stu- 
dents compared with graduating students place a higher value on approval 
from others. In the 3 other scales considered, the mean graduating students’ 
scores are higher than the entering students’ scores. This indicates that gradu- 
ating students as compared with entering students tend to be more self- 
assertive, are more likely to be persuasive leaders, and are more prone to like 


supervisory activities. 


A worthwhile evaluation of training may 
be accomplished by evaluating groups of 
trainees before and after they are trained. 
Schein (1965) favors an evaluation of train- 
ing which includes an observation of trainees 
after they have returned to their jobs. A con- 
trol group may be helpful in determining if 
the changed behavior is actually caused by 
training. Prior studies tend to indicate that 
the change process may be conducted in a 
variety of ways. Differences in the change 
process for the most part seem to center on 
the degree of influence exercised by the person 
being changed. Schein (1961) visualizes that 
“coercive persuasion” is employed in some 
management development programs. He con- 
siders that many managers being developed 
need to be “unfrozen” because they are 
psychologically unprepared to change certain 
attitudes in favor of untried new ones. Bennis 
(1964) suggests that an indoctrination form 
of change is being used by many schools. 
While the authors are interested in the change 
process, primary attention will focus on find- 
ing the specific differences between entering 
and graduating students, and then investigat- 
ing why these particular dimensions changed. 
The test instrument used in this study to 
compare these two groups of students is 
the Job Analysis and Interest Measurement 
(JAIM).1 It has been used for studying 


1The authors are grateful to Regis H. Walther, 
author of the test instrument, for his personal as- 
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more than 30 occupational groups including 
engineers, lawyers, foreign service officers, 
ambassadors, judges, social workers, police- 
men, physicists, and secretaries. Walther 
(1964) found that this instrument can dis- 
tinguish among occupational groups and be- 
tween superior and weak performances within 
an occupational group. It is a 125-item self- 
report questionnaire used to measure the 
personal qualities of the worker (other than 
aptitudes, training, or knowledge), which have 
an influence on job success or failure. 

This study is a comparison of the average 
behavioral styles between a group of entering 
and graduating students in one specific army 
officer candidate school (OCS). Behavioral 
styles refer to the consistent ways in which 
a person organizes his physical, emotional, 
and energy resources. The specific army OCS 
used in this study, while anonymous, is 
representative of nine others. The duration 
of the training is 23 weeks. The mission of 
OCS is to develop combat leaders. This train- 
ing, according to a Department of the Army 
pamphlet (United States Army, 1966), is 
designed to place the student under physical, 
mental, and emotional stress to simulate the 
stress and fatigue of combat. Within this 
particular branch being studied, 77% of its 
newly commissioned officers will be provided 
by this training source, 22% will be provided 





sistance in accomplishing much of the work de- 
scribed in this paper. 
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by Reserve Officer Training Corps training 
conducted in colleges, and 1% of their officers 
will be provided by the United States Military 
Academy at West Point. The high percentage 
of newly commissioned officers provided by 
this training source indicates a need for this, 
and similar, studies. 


Problem 


The authors were impressed by the fact 
that some students have a propensity to suc- 
cessfully complete OCS, while others are 
prepared to resign after they are in the course 
for only a short time, some for only a few 
hours. An initial thought was that students 
with a favorable degree of certain behavioral 
styles might be more likely to successfully 
complete OCS. A start in making such an 
idea fruitful might be a comparison of the 
behavioral styles between entering and grad- 
uating students in OCS. This study will at- 
tempt to make such a comparison, pertaining 
to four specific behavioral style scales. 

Groups of students (rather than individ- 
uals) will be analyzed. A group of entering 
students will be compared with a different 
group of graduating students. Behavioral 
styles will be analyzed in terms of the signifi- 
cant difference between the mean behavioral 
styles of the two groups. The hypotheses will 
reflect directly the purpose of this research, 
which is to compare four specific behavioral 
style scales between entering and graduat- 
ing students in OCS. 

There are a number of factors which, in 
the opinion of the authors, influence the 
behavioral styles of officer candidate students 
so that they may become qualified to grad- 
uate. Of the many factors, four are considered 
to be especially pertinent to this study. Their 
pertinence is judged on the basis of their 
role in determining the culture that char- 
acterizes OCS, the social situation of the 
student within OCS, and the relationship be- 
tween the student’s social and academic 
experiences. These factors are: (1) the goal 
and standards of OCS, (2) the course of in- 
struction, (3) the tactical officer, and (4) 
the system of voluntary resignation. 

It is understood that many other factors 
may influence their behavioral styles. Some 
of these other factors are: (1) the prior 


TABLE 1 


SUMMARY OF Pitot Stupy RESULTS 





Difference between 
the means of 


eae entering students 





Scales Peayligs (N = 56) and 
graduating 
students 
W = 33) 
Perseverance EOD .67 
Persuasive leadership S.0len 1.58 
Self-assertiveness 3.9155 2.19 
Supervisory activities S107" 132 
Approval from others | —6.02** —2.70 





a A positive ¢ value indicates graduating students are higher 
in this particular scale, while a negative ¢ value indicates that 
entering students are higher as pertains to this particular scale. 

* p <.05 with ¢ = 1.99 for df = 87. 

¥#* > < .01 with ¢ = 2.64 for df = 87. 


experience and training of the student, (2) 
the student’s relationship with internal fac- 
tors within the OCS, (3) the student’s rela- 
tionship with various factors external to the 
OCS, (4) the role of the student within the 
formal and informal organization, (5) the 
student’s age and physical fitness, and (6) 
the changing amounts of power, authority, 
and influence exercised by the student. 


Pilot Study 


In the development of the hypotheses of 
this study an initial thought was that students 
high in the JAIM Perseverance scale would 
have a high propensity to graduate and that 
those low in the JAIM scale of Perseverance 
would be likely to fail or resign from the 
course. A pilot study was conducted in June 
1966 in which the average behavioral styles 
of 56 entering students were compared with 
the average behavioral styles of 33 graduat- 
ing students. A summary of the results of this 
pilot study is shown in Table 1. 

The hypotheses of this study are stated 
below: 


1. Hypothesis I: Entering students, com- 
pared with graduating students, tend to place 
a higher value on approval from others; 
therefore, entering students will score sig- 
nificantly higher on the JAIM scale Approval 
From Others. 

2. Hypothesis II: Compared with entering 
students, graduating students in OCS tend 
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to be more self-assertive; therefore, graduat- 
ing students will score significantly higher on 
the JAIM scale Self-Assertiveness. 

3. Hypothesis III: Compared with entering 
students, graduating students in OCS are 
more likely to be persuasive leaders; there- 
fore, graduating students will score signifi- 
cantly higher on the JAIM scale Persuasive 
Leadership. 

4. Hypothesis IV: Compared with entering 
students, graduating students in OCS are 
more prone to like supervisory activities; 
therefore, graduating students will score sig- 
nificantly higher on the JAIM scale Super- 
visory Activities. 


METHOD 


This study attempts to compare, systematically, 
several variables in the behavioral styles between 
entering and graduating students in OCS. This 
difference is revealed by comparing the mean 
behavioral styles of a sample from each of these 
two populations. The hypotheses of this study are 
concerned with the differences in four specific be- 
havioral style scales represented in the test instru- 
ment used in this study. The instrument, called the 
Job Analysis and Interest Measurement (JAIM), 
provides measures for 34 scales relating to attitudes, 
interpersonal behavior, formal organizational be- 
havior, decision-making behavior, work preferences, 
values, and academic achievement. The difference 
in the specific scale means, if statistically sig- 
nificant (determined by using Fisher’s t), will form 
the basis for further evaluation, explanation, and 
possible implications and speculation. 

Entering students sampled were administered the 
JAIM during their first week of training in OCS, 
while graduating students were sampled in their 
twentieth to twenty-second week. Questionnaires 
were administered during the period from June 21, 
1966 through October 21, 1966. The conditions of 
the testing facilities were reasonably equal in all 
cases. The data preparation included transferring 
the student answers from each answer sheet to 
three data-processing cards. Starting October 7, 
1966, an administrative-data questionnaire was also 
completed by each person administered the JAIM. It 
is represented by a fourth data-processing card 
for each student. The analysis of the data was 
accomplished by an IBM 1620 computer.? 

The method of evaluating the JAIM results in 
this study is to determine if there is a significant 3 
difference between the mean scores on a specific 
JAIM scale between entering students (N = 561) 


2 Computer operations were conducted by David 
Reimer, staff associate in the Social Research Group 
of The George Washington University. 

3 The term “significant” is reserved for the mean- 
ing statistically significant. 


and graduating students (NV = 319). The significance 
in the difference of means is tested by conducting a 
t test of the two means. In this study a positive 
t value indicates that graduating students are higher 
in a particular scale, while a negative t value in- 
dicates that entering students are higher as per- 
tains to this particular scale. In the evaluation of 
the two experimental groups, significance at the 
.01 confidence level is p< .01 with t=2.58 for df 
= 878. 

The students in the population sampled were 
homogeneous as to their age, sex (male), state of 
health, and general educational level. Some Negro 
students were present, but the proportion of Negroes 
in this OCS was less than the proportion of Negroes 
in the Army enlisted ranks. The results of an ad- 
ministrative-data questionnaire completed by enter- 


TABLE 2 


Group AGGREGATES OF INDIVIDUAL CHARACTERISTICS 
FOR ENTERING (N = 151) AND GRADUATING 
(N = 152) StupENTs 








Entering | Graduating 








students | students 
Method of entering service 
Volunteer 61% 72% 
Draft 39% 26% 
Other (i.e., National Guard 
Unit called to active duty) 0% 2% 





Students presently in class who have been turned back 
from another class 





Turned back for academic 


reasons 1% 7% 
Turned back for leadership 

reasons 1% 13%» 
Turned back for other reasons 

(i.e., medical) 0% 4% 


Educational level 








High school or equivalent 18% 20% 
Some college but did not 

graduate 54% 47% 
Graduated from a junior 

college 5% 5% 
Graduated from college 16% 20% 
Some postgraduate work 6% 1% 
Master’s degree or higher 1% 1% 


® This may indicate that, for this study, difficulties in meeting 
leadership-proficiency standards have a higher propensity of 
being corrected than difficulties in meeting academic-pro- 
ficiency standards. This is probably indicated by the higher 
percentage of students graduating who have previously been 
turned back for leadership reasons than those previously 
turned back for academic reasons. It may also indicate that 
judgments on leadership proficiency are more subjective and, 
therefore, less reliable than judgments on academic proficiency. 
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ing and graduating students are shown in Table 2. 
This questionnaire was designed to help describe 
the sample. It reflects the aggregate of individual 
characteristics for a group containing 151 entering 
students and a group containing 152 graduating 
students. 

Since 13 entering classes and 10 graduating 
classes were sampled, the writers felt it advisable 
to determine the variance among different classes. 
The statistical test selected for this comparison of 
classes was the analysis of variance, which provides 
a method of testing for significant differences be- 
tween means among and between groups. Each of 
the four JAIM scales was evaluated in terms of 
the stability of the entering classes’ set of 13 
means, and the stability of the graduating classes’ 
set of 10 means. 


RESULTS 


The results of the analysis-of-variance tests 
for entering and graduating classes are given 
in Table 3. Among the entering classes, one 
significant difference (p < .05 with F = 1.77 
for df = 12) was found on the Approval From 
Others scale. The fact that one out of eight 
comparisons shows a significant difference 
among groups indicates that different classes 
vary more than would be expected due to 
chance. This should be considered when 
interpreting the data. 

As shown in Table 4, the significance in 
the difference of means is tested by conduct- 
ing a ¢ test of the two means. In this analysis 
a positive ¢ value indicates that graduating 


TABLE 3 


ANALYSIS OF VARIANCE AMONG ENTERING AND 
GRADUATING CLASSES 











JAIM scale F p 

Approval from others 

Entering classes (V = 13) 1.967 05 

Graduating classes (V = 10)> 1.880 — 
Persuasive leadership 

Entering classes* 945 — 

Graduating classes 1.552 _- 
Self-assertiveness 

Entering classes* 798 — 

Graduating classes? 1.065 a 
Supervisory activities 

Entering classes* 908 —_ 

Graduating classes 1.026 — 








® Entering classes (NV = 13): 
Pe 1.77 for df = 12; p <.01 with F = 2.19 
for df = 

b Caine classes (V = 
p <),05 with F = 1.91 for oF p <.01 with F = 2.48 
for df = 9. 


TABLE 4 


SUMMARY OF SCORES FOR THE Two 
EXPERIMENTAL GROUPS 





; aa r » | Hypothesis 
JAIM scale {value® | p fens 
Entering students 
higher (V = 561): 
Approval from others | —7.40 | .01 | Yes 
Participative leader- —5.43 | .01 | Not 
ship hypothe- 
sized 
Graduating students 
higher (V = 319): 
Self-assertiveness 5.61 {Oil || Wes 
Persuasive leadership 415 | .01 | Yes 
Supervisory activities 2S ee Olen Mes 














a A positive t value indicates graduating students are higher 
in this particular scale, while a negative ¢ value indicates that 
entering students are higher as pertains to this particular scale. 

by < .01 with ¢ = 2.58 for df = 878. p < .05 with t = 1.96 
for df = 878. 

e¢ At the .01 level of significance. 


students are higher in a particular scale 
and a negative ¢ value indicates that entering 
students are higher as pertains to a particular 
scale. In the analysis of the two experimental 
groups, a significance of at least the .01 
confidence level occurs when ¢ is greater 
than or equal to 2.58 for 878 degrees of 
freedom. 

Entering students compared with graduat- 
ing students tend to place a higher value on 
approval from others, as determined by the 
Job Analysis Interest Measurement (JAIM). 
In the other three scales, the graduating 
students were higher, which indicated the 
following: 


1. Compared with entering students, grad- 
uating students in OCS tend to be more self- 
assertive, as determined by the JAIM. 

2. Compared with entering students, grad- 
uating students in OCS are more likely to be 
persuasive leaders, as determined by the 
JAIM. 

3. Compared with entering students, grad- 
uating students in OCS are more prone to like 
supervisory activities, as determined by the 
JAIM. 


There is a significant difference in the 
mean JAIM scores between groups of entering 
and graduating students in the four prechosen 
comparisons established by the four hy- 


70 Peter B. PETERSEN AND GorpoNn L. Lippitt 


potheses. Within the structure of the re- 
straints established by earlier assumptions, 
the four hypotheses of this study are affirmed 
and accepted. 

The Approval From Others scale measures 
the degree to which the individual values 
himself by obtaining the approval of others. 
The results of this study indicate that the 
Approval From Others scale, located within 
the “Value” category of scales, has the high- 
est f value among any of the 34 JAIM scales 
for this particular research study (¢ value 
of —7.40). Graduating students appear to be 
mission oriented: some express their philos- 
ophy as “not overly concerned with what my 
buddies think; it’s the mission that counts.” 
In contrast, entering students tend to “worry 
too much about what their buddies think.” It 
seems that mission orientation is paramount 
during training, while the approval from 
others tends to be secondary. 

Studies in social psychology during World 
War II suggest that the personality of the 
officer candidate is changed to adapt to 
new standards. Stouffer, Suchman, DeVinney, 
Star, and Williams (1949) describe this 
change as follows: 


The hopeful candidate is now subjected to a nearly 
catastrophic experience, which breaks down to a 
large extent his previous personality organization. 
His previous valuations fail him, and in order to 
find a basis for self-respect, he must adopt new 
standards or escape from the field. His high mo- 
tivation to become an officer usually rules out the 
latter alternative. At the same time, new, ap- 
propriate attitudes are built up and established. 
The catastrophic experience provides a kind of 
purgatory, a definite demarcation from the candi- 
date’s enlisted incarnation that puts a barrier be- 
tween the new officer and his enlisted memories. 
It has some of the characteristics of a conversion 
experience, or the ordeal of a medieval knight. The 
effect of this ordeal on the officer candidate is not 
only to attack his previous personality, but to 
exert a positive influence in the desired direction. 


An interpretation that may be drawn is that 
students are being molded into the behavioral 
styles required to become successful junior 
officers. The graduating student is action 
oriented, with a greater concern for the mis- 
sion than for the welfare of his subordinates. 
However, this does not imply that he is not 
concerned with their welfare. It does imply 


that concern for subordinates is second only 
to the accomplishment of the mission. 


Suggestions for Further Research 


An evident weakness in this research is 
that the authors cannot state what portion 
of the difference in behavioral styles is caused 
by: 


1. Students actually changing their be- 
havioral styles during the time considered; 

2. Students with certain behavioral styles 
resigning from the course and subsequently 
not being sampled as graduating students; 

3. Students with certain behavioral styles 
failing from the course and subsequently not 
being sampled as graduating students. 


Additional research should devise a system 
so that individual comparisons may be made 
instead of comparing the mean of each group. 

A study conducted a year or two from now 
may provide answers as to the stability of 
the behavioral styles “learned” or “changed” 
while at OCS. To what extent, if any, the 
individual reverts toward his original be- 
havioral style offers attentive consideration. 
A most interesting determination in the opin- 
ion of the authors is: To what extent are the 
behavioral styles of students  actuallv 
changed? 
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SOCIAL IMMATURITY, INTELLECTUAL ABILITY, AND 


ADJUSTIVE BEHAVIOR IN COLLEGE’ 


DAVID KIPNIS 


Temple University 


It was hypothesized that students classified as socially immature, in comparison 
to less socially immature students, would be more visible to their instructors 
because of a higher verbal output in class, would have a higher absentee rate, 
and would underachieve in academic performance. The Ss were male Temple 
University students enrolled in 23 sections of an introductory psychology 
course. Classification of social immaturity was by paper-and-pencil test. In 
support of the 1st hypothesis it was found that socially immature students 
were significantly more visible to their instructors by virtue of their classroom 
behavior. That is, as freshman they were evaluated as more disruptive of 
classroom proceedings (p<..05), and as upperclassmen they were evaluated 
as contributing more to classroom discussion (p < .05). In support of the 
2nd hypothesis it was found that significantly more socially immature students 
were absent from class (p < .01). Finally, among students in the top tertile of 
Scholastic Aptitude Test Scores, socially immature students underachieved in 


an introductory mathematics course (p < .01), but not in an introductory 


psychology course. 


In the last several years there has been 
increased interest in describing adult per- 
sonality structure in terms of stages that have 
their origins in the person’s psychosocial de- 
velopment. For example, Loevinger (1966) 
has described at least seven stages or mile- 
stones in the development of the ego; Erikson 
(1959) has formulated this developmental 
sequence in terms of stages of ego identity; 
Sullivan, Grant, and Grant (1957) in terms 
of maturity levels; and more recently Maslow 
(1964) has described individuals in terms of 
levels of synergy. Despite the diversity of 
terms and stages proposed, there appears to 
be reasonable agreement about the behaviors 
of persons classified as at relatively early 
stages of personality development, that is, 
socially immature individuals. Among the 
behaviors prominently associated with social 
immaturity are impulsivity, a lack of ac- 
ceptance of conventional norms, and an ex- 
ploitive mode of interpersonal relations. 

There is also evidence that persons classi- 
fied as socially immature tend to fail in 
socially defined achievement roles such as are 
associated with high school (Kipnis, 1965a; 
Roessel, 1954), college (Gough, 1965; Heil- 
brun, 1965), or job (Kipnis, 1965b). The 


1 This research was aided by a grant from the 
trustees of Temple University. 
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evidence also suggests that the relation be- 
tween social immaturity and underachieve- 
ment is strongest among more intelligent 
persons. For example, Roessel (1954) found 
that among more intelligent students those 
with high Pd scores on the MMPI dropped 
out of high school more frequently than those 
with low Pd scores, but this relation was not 
present among less intelligent students. Gough 
(1965) has consistently found among intel- 
lectually gifted persons that the Socialization 
scale of the California Psychological In- 
ventory is related to school achievement 
(poorly socialized students underachieved), 
and Heilbrun (1965) has reported a measure 
of conformity to social values as predictive 
of college dropouts among the more intel- 
ligent, but not the less intelligent. 

While the findings are consistent on the 
relation between achievement and social im- 
maturity, there is less information available 
on the nature of the day-to-day activities of 
socially immature individuals that eventually 
result in their underachievement. A_ start 
toward providing this information was made 
by the present writer in a taxonomic study 
of the behaviors and attitudes that dis- 
tinguish persons classified as socially im- 
mature (Kipnis, 1966). Among the findings 
was that socially immature persons did not 


~I 
bo 


react as frequently as less socially immature 
persons to embarrassing, shameful, or anxiety- 
provoking situations. Presumably, this lowered 
level of reactivity would result in the socially 
immature person not experiencing anxiety 
over failure and consequently not being mo- 
tivated to reduce anxiety by striving for 
achievement. It was also found that socially 
immature individuals were less accepting 
of conventional values relating to obeying 
rules and regulations.” This lack of concern 
over regulations might further provoke failure 
through a disregard of institutional rules. The 
foregoing findings however were based upon 
the intercorrelation of several questionnaires 
measuring strength of belief in conventional 
social norms, reactivity to anxiety, and level 
of social immaturity. There remains the ques- 
tion of whether these differences in attitudes 
and beliefs are also manifested behaviorally. 
The present study reports evidence on this 
point. 

Three studies were done. In the first the 
prediction was tested that socially immature 
students would participate more in classroom 
discussions and be more visible to their in- 
structors by virtue of their classroom be- 
havior. This prediction was based upon a 
prior finding that socially immature students 
reported that speaking in public was not 
likely to make them anxious. Accordingly it 
was expected that they would talk more in 
class. The second prediction was that socially 
immature students would be absent more 
frequently from class. This prediction was 
based upon the assumption that the behavior 
of socially immature students would be less 
constrained by general university regulations 
forbidding the cutting of classes. The final 
prediction was that among more intelligent 
students, the socially immature student in 
comparison to the less socially immature stu- 
dent should earn lower grade-point averages 
(GPAs). This prediction was based upon 
the previously cited literature on the relation 
between academic achievement and _ social 
immaturity among the more intellectually able 
population. 


2Socially immature respondents oriented their 
values around the acquisition of material things. 
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PROCEDURE 


Measure of Social Immaturity 


An index of social immaturity was obtained by 
adding the unweighted scores from two purported 
measures of social development—the Socialization 
scale from the CPI and the Insolence scale (de- 
velopment of this latter scale is given below). The 
use of this combined score was based upon the goal 
of improving the measure of immaturity. A sub- 
sequent section of this paper will report on the 
relative effectiveness of the combined scores versus 
the scores taken singly. 

In a sample of 573 male Temple University 
students, the Insolence scale correlated .45 with the 
Socialization scale (scoring key reversed so that 
high scores indicated poorly socialized), suggesting 
that both scores were measuring aspects of the 
same personality organization. Both scales con- 
tributed relatively equal weight to the combined 
social immaturity index, as indicated by the fact 
that the means and standard deviations of both 
scales were almost exactly equal (Socialization 
scale: M = 16.70, SD =5.71; Insolence scale: M= 
Ui, SO = 558). 

In the present research, persons with social im- 
maturity index scores in the top third of the 
Temple University distribution (39 or more) were 
designated as high in social immaturity (high), 
persons with middle third scores as middle in social 
immaturity (middle), and persons with bottom 
third scores of 29 or less were designated as low 
in social immaturity (low). It should be noted that 
students classified as low in social immaturity could 
probably be further subdivided into several higher 
stages of development if additional cognitive and 
attitudinal information were available. In general 
the scales used appear most sensitive in distinguish- 
ing persons fixated at earlier stages of development 
from persons at all hypothesized latter stages. 

Insolence scale. This is a 41-item paper-and-pencil 
biographical-type inventory. If all items were an- 
swered in the keyed direction, it would convey the 
impression of a physically active, aggressive, pleasure- 
seeking personality whose childhood was character- 
ized by early independence from family and con- 
flicts with grade-school authorities. The scale is 
based upon 27 of the 58 items in Torrance and 
Ziller’s (1957) Risk Scale to which have been added 
14 additional items describing aspects of childhood 
behavior. All items were selected on the basis of 
their relation to school and job performance in 
the United States Navy. 

The usefulness of the Insolence scale as a measure 
of social immaturity has been explored in a series 
of studies. In several laboratory studies, persons 
classified as high in social immaturity on the basis 
of Insolence scale scores were found to be non- 
compliant and resistant to an experimenter’s in- 
fluence (Kipnis & Wagner, 1965), and to be non- 
productive when working for a leader who was low 
in status and power (Kipnis & Wagner, 1967), 
and to be easily aroused to aggressive behavior 
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(Kipnis, 1966). Among more intelligent patients 
admitted to a psychiatric ward in a Navy hospital, 
the scale had a biserial correlation of approximately 
.60 with psychiatrists’ diagnoses of character and 
personality disorder (Kipnis, 1965b). 

In the population at large, the Insolence scale 
has been found to correlate .41 with the Extrover- 
sion scale of the Maudsley Personality Inventory, 
—.41 with the Achiever Personality scale of Fricke’s 
Opinion, Attitude and Interest Survey, .52 with the 
Sensation Seeking Scale of Zuckerman et al. (1964), 
32 with a measure of lack of emotional responsive- 
ness to shame-provoking situations developed by 
Lykken (1957), and .29 and .25 with the Pd and 
Ma scales of the MMPI, respectively. The scale 
does not correlate with age or general intelligence 
as estimated by both the Navy’s General Classifica- 
tion Test and the Scholastic Aptitude Test. 

Socialization scale. This is a 51-item true-false 
attitude scale from the CPI. Gough (1965) has 
described the scale as identifying individuals along 
a continuum of asocial to social behavior and as 
able to forecast the likelihood that any person will 
violate the rules and norms of his culture. Re- 
sults of a wide variety of studies, summarized in 
Gough (1965), indicate that persons described as 
asocial by the scale do indeed exhibit such be- 
haviors as delinquency, underachievement in school, 
violations of paroles, and show a lack of responsive- 
ness to social reinforcement. 


Subjects 


The Ss consisted of male Temple University 
students enrolled in 23 sections of an introductory 
psychology course, taught by 13 different instructors. 
Class size averaged 42 students with approximately 
half of each class being male students. The specific 
number of Ss used varied from study to study, and 
the exact numbers used will be given when the 
results from each study are presented. Approxi- 
mately 49% of the Ss were freshmen and 51% 
were upperclassmen (mainly sophomores). The Ss 
were tested with the Socialization scale and the 
Insolence scale during the first week of the term. 


Intellectual A bility 


Intellectual ability was estimated on the basis of 
the combined verbal and numerical scores from 
the Scholastic Aptitude Test (SAT). To detect any 
possible moderating effects of intelligence, Ss were 
divided into thirds on the basis of their combined 
SAT scores, or at the median of the combined SAT 
distribution. The number of Ss available for each 
study dictated the decision as to whether Ss were 
divided into two or three equal groups. 


3 Three items were eliminated as inappropriate 
for the present population. Permission to use the 
scale for research purposes was granted by Consulting 
Psychologists Press, Inc., publishers of the CPI. 


Classroom Visibility 


Nine instructors teaching 13 sections of intro- 
ductory psychology were asked to rate all male 
students in terms of their classroom participation.* 
There were between 20 and 28 male students in 
each section. The evaluations were made in the 
ninth and tenth weeks of the school term, 

The evaluation forms consisted of three scales. 
Each scale contained the alphabetical listing of all 
male students in a section, and the instructor was 
asked to circle the names of the students on each 
list that were like the description given at the top 
of the scale. The scales read as follows: 

Classroom visibility. “Some students attract the 
attention of their instructors because of their class- 
room behavior. These students may attract notice 
because they consistently ask questions or make 
comments about the course material. (Consider in 
this scale the frequency of classroom participation 
rather than the quality of the participation.) Other 
students attract attention because they talk to 
classmates or they do not appear to be attending 
to the lecture, or they exhibit other forms of 
inappropriate classroom behavior. Place a check 
alongside the names of students whose classroom 
behavior caused you to notice them.” 

Facilitative behavior. “Some students contribute 
to the teaching process by asking reasonable ques- 
tions or by making constructive comments. Place a 
check next to the names of the students who most 
frequently contribute to discussion in your class.” 

Disruptive behavior. “Some students disrupt 
classroom proceedings by asking inappropriate or 
hostile questions, by obvious inattention, or by 
other kinds of disruptive acts. Place a check 
alongside the names of students who best fit the 
above description.” 


Analysis 


The ratings on each scale were analyzed by means 
of a 3 X 2 X 2 factorial design, with three levels of 
social immaturity, two levels of intellectual ability 
(median split on SAT scores), and two levels of 
students’ class (freshman versus all upper classes) 
as the independent variables. The classification of 
students in terms of class level was added to test 
the generally held belief that freshmen participate 
less in classroom discussions than do upperclass- 
men. It may be noted that the instructors had no 
systematic knowledge of students’ SAT scores, class 
level, or level of immaturity scores. 

For each S, a score was obtained by assigning a 
weight of 1 if the instructor checked his name on 
a given scale, and a score of 0 if his name was not 
checked. These scores were used as the dependent 
measure in the analysis of variance. A total of 285 


4Four instructors did not participate: two be- 
cause they knew the immaturity scores of their 
students, one because he did not notice any students 
by name, and one instructor because he did not 
complete the ratings. 
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TABLE i 


SUMMARY OF THE ANALYSIS OF VARIANCE: 
TEACHERS’ RATINGS OF CONSTRUCTIVE 
CLASSROOM BEHAVIOR 











Source df MS F 
Immaturity (I) 2 wise 1.47 
Class level (CL) 1 1.0890 9.24** 
Intellectual ability (A) 1 .6792 5.76* 
ISeCls 2 4743 4.02* 
IXA 2 2151 1,82 
CLXA 1 5116 4.34* 
ES< ChSaA: 2 1857 1:57 
Error 273 1179 

* > <.05 
ar Dae OL 


Ss were evaluated. Overall, 23% of the students 
were nominated as visible to the instructor, 14% 
were nominated as contributing constructively to 
classroom discussions, and 9% were nominated as 
disruptive of classroom proceedings. Two percent 
of the Ss were evaluated as both contributing 
to classroom discussion and as being disruptive 
of classroom procedures. The number of Ss in each 
of the nine cells of the design ranged from 17 to 
36; therefore the analysis of variance was based 
upon an unweighted means solution. 


RESULTS 


The analysis of the instructors’ first 
ratings on classroom visibility yielded no 
significant relations between visibility and 
social immaturity or class level. However, 
there was a significant main effect for in- 
tellectual ability—F (1/273) = 5.74, p < .05. 
More intelligent students were rated by their 
instructors as being more noticeable by virtue 
of their classroom behavior than were the 
less intelligent students. An average of 30% 
of Ss with above-median SAT scores and 
17% of Ss with below-median SAT scores 
were rated by their instructors as visible. 

The second rating scale requested instruc- 
tors to nominate students whose classroom 
behavior was constructive and contributed to 
the teaching process. The results of the 
analysis of variance of these ratings of con- 
structive behavior are summarized in Table 1. 
Table 2 shows the percentage of students 
nominated by their instructors. It can be seen 
in Table 1 that both class level and intel- 
lectual ability were related to ratings of 
constructive behavior, at the .01 and .05 
levels, respectively. However, there was also 


a significant interaction between these two 
variables. Inspection of Table 2 reveals the 
basis for this interaction. In general, upper- 
classmen who were above the median in 
intellectual ability were nominated more 
frequently as contributing to classroom dis- 
cussions than were below-the-median upper- 
classmen or freshmen at all levels of intel- 
ligence. The last column in Table 2 sum- 
marizes these data. 

Table 1 also shows a significant interaction 
between social immaturity and class level 
(p < .05). Primarily, this significant effect 
was based upon the fact that among upper- 
classmen, high immature Ss were evaluated 
as contributing significantly more to class- 
room discussion than were middles or lows. 
Inspection of Table 2 shows that this effect 
was limited to more intelligent upperclass- 
men. Within this more intelligent upperclass 
grouping, 53% of the highs as compared to 
19% of the middles and 22% of the lows 
were evaluated by their instructors as con- 
tributing to the classroom discussion. This 
latter finding is consistent with Hypothesis 1, 
which predicted that high immature students 
would be more visible to their instructors. 
However, it appears that, in terms of con- 
structive behavior, the hypothesis holds only 
among more intelligent upperclassmen. 

The last rating scale asked instructors to 
nominate those students in their class whose 
behavior was disruptive of classroom proceed- 
ings. Table 3 summarizes the analysis of vari- 
ance of these ratings, and Table 4 shows the 
percentage of students, classified in terms of 
the independent variables, who were nomi- 
nated by instructors as being disruptive. 


TABLE 2 


PERCENTAGE OF SUBJECTS EVALUATED AS CON- 
TRIBUTING TO CLASS DISCUSSION 








Level of social 
immaturity 


Class Intellectual 








level ability Average 
High | Middle] Low 
Freshmen | Above median 8% 6% 16% 10% 
Below median 3% 15% 8% 9% 
Upper Above median | 53% 19% 22% 30% 
classes Below median | 16% 5% 17% 13% 





Note.—N = 285. 
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TABLE 3 


ANALYSIS OF VARIANCE: INSTRUCTORS’ RATINGS 
OF STUDENTs’ DISRUPTIVE BEHAVIOR 














Source df MS ie 
Immaturity (1) 2 2479 3.17* 
Class level (CL) 1 .0023 <1 
Intellectual ability (A) i 1088 1639 
TCL 2 2649 3.39* 
1x A 2 0725 cal 
CHA 1 .0906 et 
ie CLC A 2 0804 1.03 
Error 273 0781 

p< .05. 


Social immaturity was significantly related 
to ratings of disruptive behavior (p < .05). 
However it can be seen in Table 3 that this 
relationship was modified by class level, as 
indicated by the significant interaction be- 
tween the two variables. Among freshmen, 
high immature Ss were nominated more fre- 
quently than middle immature or low im- 
mature Ss as being disruptive. Among upper- 
classmen on the other hand there was no con- 
sistent relation between levels of immaturity 
and disruptive behavior. The findings are 
again consistent with the general prediction 
that high immature Ss would be more visible 
to their instructors than middle or low im- 
mature Ss. However, in terms of disruptive 
behavior, the findings indicate that this pre- 
diction only holds among freshmen. 


Absenteeism 


During the twelfth week of the 16-wk. 
semester, attendance was taken in all classes. 
Any person absent on the day attendance was 
taken was assigned a score of 1 and those 
present a score of 0. In all, 12 instructors 
took attendance among 459 students; 21% 
of the students were absent. 

Seven instructors habitually took attend- 
ance during the semester, and these instruc- 
tors had 212 students in 11 class sections. 
The remaining five instructors had never 
taken attendance in their classes prior to 
this time. These instructors had 247 students 
in 10 class sections. It should be noted that 
three of the five instructors who did not 
take attendance usually took attendance dur- 
ing other semesters, but refrained this semes- 


On 


ter at the request of the present writer. The 
request not to take attendance arose from 
the author’s interest in testing the effects 
of environmental restraints (the taking of 
attendance in this case) upon the conforming 
behavior of high immature Ss. A prior study 
had suggested that high immature Ss tended 
to conform to social norms when the environ- 
ment contained restraints against noncon- 
formity (Kipnis & Wagner, 1967). 
Attendance was analyzed by means of a 
3 X 2 X 2 X 2 analysis of variance with three 
levels of social immaturity, two levels of 
restraint (instructors did or did not take 
attendance), two levels of class (freshman 
versus upper classes), and two levels of in- 
tellectual ability (above and below the median 
SAT scores) constituting the independent 
variables. There was an average of 18 Ss 
in each of the 24 cells resulting from the 
four-way classification. The cell with the 
smallest entry had 13 Ss and the cell with 
the largest entry had 30 Ss. The dependent 
measure was a score of 1 or O, depending 
upon the absence or presence of the student. 


RESULTS 


Table 5 presents the summary of the 
analysis of variance of the attendance data. 
Three of the independent variables showed 
significant association with attendance; they 
were social immaturity (p < .01), class level 
(p < .05), and restraint (p < .001). 

Table 6 gives the average percentage of 
absent Ss, classified in terms of social im- 
maturity, class level, and restraint. For ease 
of presentation, classification of Ss in terms 


TABLE 4 


PERCENTAGE OF STUDENTS NOMINATED AS 
DiIsRUPTIVE BY THEIR INSTRUCTORS 








Level of social immaturity 





Class Intellectual 
level ability 
High | Middle | Low 
Freshman | Above median} 21% 6% 0% 
Below median | 17% A% 5% 
Upper Above median] 6% 21% 6% 
classes | Below median 8% 5% 0% 





Note.—N = 285. 
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TABLE 5 


SUMMARY OF THE ANALYSIS OF VARIANCE: 
STUDENT ABSENCES 























Source df MS F 
Immaturity (I) 2 859 53455 
Class level (CL) 1 901 5.60* 
Ability (A) 1 033 <1 
Restraint (R) 1 2.166 ioesaee 
iat 2 098 cal 
LA 2 048 <1 
IeeR 2 024 <1 
CES 1 093 <a 
Cle: 1 082 <1 
AXR 1 197 eee, 
aCe ae 2 097 <1 
iP eGr ak 2 .146 <1 
liSGAS aR: 2 004 <1 
CHE OS IN SCAR 1 065 <a 
pCi) ane ak 2 .005 Sault 
fe 
Error 435 161 

*p <.05. 
Kp < 01. 
eee p< (001, 


of intellectual ability has been omitted, since 
this variable was not related to absenteeism, 
either by itself or in interaction with the 
other three independent variables. 

Whether or not instructors took attendance 
yielded the strongest relation with absentee 
rate. When the instructors exercised their 
restraining powers by taking attendance, an 
average of 13% of the students were absent 
as compared to 27% when these powers 
were not used. Further, freshmen had a higher 
absentee rate (25%) than upperclassmen 
(17%). Finally, as predicted, an average of 
29% of the high immature Ss, 18% of the 
middle immature Ss, and 14% of the low 
immature Ss was absent. 

It appears then that, without knowledge 
of actual illness which presumably accounted 
for at least some of the absentees, such 
factors as the student’s personality, his years 
of experience as a college student, and the 
use of restraining power by the instructor 
were significantly related to the presence of 
a student in class. In the extreme cells we 
have 42% of the high immature freshmen 
who were in classes where attendance was 


not taken absent, as compared to 2% of the 
low immature upperclassmen who were in 
classes where attendance was taken. 


Academic Achievement 


Final grades in introductory psychology 
and in an introductory mathematics course 
were used as the primary criteria of academic 
achievement. The analysis was limited to 
216 freshmen completing their second term 
at Temple University. This restriction was 
used in order to control the year in which 
the Ss had taken mathematics. The necessity 
for this control was indicated by a preliminary 
analysis which found that upperclassmen had 
significantly higher grades in a given course 
than did freshmen in the same course, even 
after controlling for intellectual ability. 

Grades were analyzed by means of a 3 X 3 
factorial design, with three levels of social 
immaturity and three levels of intellectual 
ability, as measured by SAT scores, con- 
stituting the independent variables. Final 
grades in both courses, expressed on a 
O (=F) to 4 (=A) scale, were entered as 
the dependent measures. 

Table 7 presents the results of the analysis 
of variance of mathematics and psychology 
grades. There was no significant relation 
between level of immaturity and the final 
grade in psychology. There was, however, a 
significant relation between final mathematics 
grades and level of immaturity (p < .05). 

Table 8 shows the average mathematics 
grades for Ss classified in terms of immaturity 
level and intellectual ability. High immature 
Ss earned lower mathematics grades than 
middle or low immature Ss. While the inter- 


TABLE 6 


PERCENTAGE OF STUDENTS ABSENT 








Level of immaturit 
Class ¥ 


Restraint 
level 
High | Middle} Low 
Freshman | Attendance taken 24% | 14% | 11% 
Attendance not taken| 42% | 37% | 21% 
Upper Attendance taken 229%} OVomee 
classes | Attendance not taken] 27% | 16% | 20% 





Note.—N = 458, 
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TABLE 7 


SUMMARY OF ANALYSIS OF VARIANCE: MATHEMATICS AND INTRODUCTORY PSYCHOLOGY 
GRADES IN RELATION TO INTELLECTUAL ABILITY AND SOCIAL IMMATURITY 
(UppER FRESHMEN ONLY) 




















Mathematics Psychology 
Source 
df MS F df MS F 

Immaturity (I) 2 Salil Sil 2 1.35 1532 
Intellectual ability (A) 2 1.46 1.20 2 AS 11.89** 
eA 4 DNS) erie 4 .08 <1 
Error 207 LA 207 1.02 

7p <.05 

** > < 01 


action between immaturity level and ability 
was not statistically significant, the trend 
appears to be that, as intellectual ability 
increased, the relation between immaturity 
and mathematics grades also increased. Since 
these findings were consistent with prior re- 
search on the moderating effects of ability, 
it was decided to compute separate F tests 
between mathematics grades and level of im- 
maturity within each ability level. These 
separate tests found that the relation be- 
tween level of immaturity and mathematics 
was only significant among Ss who were in 
the top third of intellectual ability (p < .01). 
The differences in grades between high, 
middle, and low immature Ss were not sig- 
nificant among those students in the middle 
third or bottom third of intellectual ability. 

In addition to grades in mathematics, a 
tally was made of the number of freshmen 
either on probation or who had dropped out 
of Temple University at the end of their 
freshman year. Those freshmen on probation 


TABLE 8 


AVERAGE MATHEMATICS GRADES OF SUBJECTS 
CLASSIFIED BY LEVEL OF IMMATURITY AND 
INTELLECTUAL ABILITY 








Level of immaturity 





Intellectual 
ability 
High Middle Low 
High third 1.46 1.80 2.55 
Middle third 1.93 2205) Deol 
Low third aii 1.93 lee 





Note.—N = 216, 


(cumulative GPA of less than 1.60) or who 
had dropped from Temple University were 
given a score of 1 and those freshmen not on 
probation were given a score of 0. These 
data were subjected to a two-way analysis 
of variance and the results of this analysis 
are summarized in Table 9. An additional 
eight freshmen who were excluded from the 
previous grade analysis because of incom- 
plete mathematics or psychology grades were 
included in the present analysis. 

The analysis of variance indicated that 
level of immaturity and intellectual ability 
were both related to unsatisfactory academic 
performance at the .05 and .01 levels, re- 
spectively. Table 10 shows the proportion 
of Ss classified by immaturity and ability 
level who were on probation or dropped at 
the end of their freshman year. It can be 
observed that at all levels of ability a greater 
proportion of high immature Ss than of 
middle immature or low immature Ss was on 
probation or had dropped out of Temple Uni- 
versity. Once again, to test for possible 


TABLE 9 


SUMMARY OF ANALYSIS OF VARIANCE: SUBJECTS 
ON PROBATION OR DROPPED FROM TEMPLE 
UNIVERSITY (UPPER FRESHMEN ONLY) 














Source df MS F 
Immaturity (1) 2 759 4.03* 
Intellectual ability (A) 2 1.586 8.43** 
Wye aN 4 198 1.05 
Error 215 .188 

*p < .05 


78 Daviw KIPNIs 


TABLE 10 


PERCENTAGE OF STUDENTS ON PROBATION 
OR DROPPED 


Level of immaturity 








Intellectual | 
ability 
High Middle Low 
High third ae 71% 4% 
Middle third 36% 4% 
Low third 7 38% 37% 





moderating effects of intelligence, separate F 
tests relating immaturity level to academic 
performance were computed within each abil- 
ity level. The results indicated that only 
among Ss in the middle third of intellectual 
ability was the relation between immaturity 
and grades significant (p < .05). 

In summary, the findings on academic per- 
formance suggest that high immature Ss tend 
to do as well as middle immature or low 
immature Ss in an introductory psychology 
course, but do considerably poorer than 
middle immature or low immature Ss in 
mathematics courses. This latter finding holds 
mainly among Ss who were in the top third 
of intellectual ability. In addition, there was 
a significant trend for more high immature 
Ss than middle immature or low immature 
Ss to be on probation or to have dropped 
from Temple University by the end of their 
freshman year. This trend was most marked 
among Ss in the middle third of intellectual 
ability. 


Comparison of the Validity of the Socializa- 
tion Scale and the Insolence Scale 


As a_ general statement the combined 
Socialization and Insolence scales yielded 
slightly stronger relations with the dependent 
variables studied here than did either scale 
by itself. Considering each dependent measure 
in turn, neither scale was related to the 
instructors’ first ratings of general visibility. 
The Socialization scale was significantly 
related to evaluations of facilitative class- 
room behavior (p < .05), while the Insolence 
scale was not. The Insolence scale was signifi- 
cantly related to evaluations of disruptive be- 
havior (p < .05), while the Socialization scale 
was not significantly related to this evalua- 


tion. In terms of relationships with absen- 
teeism, the Insolence scale was significantly 
related to absenteeism (p< .05) and the 
Socialization scale just missed statistical sig- 
nificance (p < .08). Both scales were equally 
related to mathematics grades. 


DISCUSSION 


It was originally proposed that high im- 
mature students would be more visible in 
class than middle immature or low immature 
students because they would experience less 
anxiety over speaking in public and hence 
should speak more. It was found that as 
freshmen, high immature students were more 
visible to their instructors because of their 
disruptive behavior. As upperclassmen, high 
immature students were more visible to their 
instructors because of their facilitative be- 
haviors. While these findings are consistent 
with the hypothesis of a higher verbal output 
for high immature students, the changes in 
evaluation of this output by instructors from 
freshmen to upper-class years also suggests 
that some form of social learning may be 
occurring during this period; that is, high 
immature students may be learning to modify 
their verbal output so that what they say 
is more acceptable to their instructors. An- 
other obvious possibility, of course, is that 
those high immature students who were 
rated as disruptive in their freshman year 
were dropped from Temple University and 
different high immature students were evalu- 
ated as facilitative in subsequent years. Un- 
fortunately, there are no data on the class- 
room behavior of the same students over a 
period of several years. Such information 
would be necessary to clarify the basis for 
the shift in evaluations (i.e., a longitudinal 
study). 

The second expectation of the present 
study was that high immature students would 
be less conforming to general university 
regulations against “cutting” classes than 
would middle immature or low immature 
students. This prediction was based upon 
previous findings that high immature stu- 
dents had lower internalized restraints against 
violating rules and regulations than did 
middle immature or low immature students. 
Lacking such internalized restraints, it was 
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expected that high immature students would 
be more easily instigated by friends or their 
own impulses to “cut” classes. The findings 
supported the hypothesis in that significantly 
more high immature than middle or low im- 
mature students were absent from class. It 
was also found that when instructors en- 
forced university regulations against absentee- 
ism by taking attendance, there was a sig- 
nificant reduction in the number of all stu- 
dents who were absent. This latter finding 
can be interpreted as indicating that if the 
environment is perceived to contain punitive 
restraints against nonconformity, acts of non- 
conformity will decrease. This interpretation 
is consistent with more general studies of 
coercive power (French, Morrison, & Levin- 
ger, 1960; French & Raven, 1959; Zipf, 
1960) in which it has been found that reliance 
by authority figures upon threats of punish- 
ment induced higher rates of conformity than 
when threats were not relied upon. In essence, 
the taking of attendance by instructors rep- 
resents an exercise of threatening power, 
since university regulations proscribe punish- 
ments for excessive numbers of unexcused 
absences. And to the extent attendance is 
taken, the probabilities are higher that stu- 
dents who are continually absent will be 
reported to the administration. 

As a final point it should be noted that 
the findings on absenteeism among high im- 
mature students are not consistent with a 
hypothesis of Heilbrun’s (1965) that regi- 
mentation imposed upon students by uni- 
versities contributed to the academic failures 
of bright, poorly socialized students. Heil- 
brun, of course, was referring to a general 
system of social and academic regimenta- 
tion imposed upon students. Nevertheless, to 
the extent that taking attendance imposes 
restraints upon students, from Heilbrun’s 
hypothesis one would expect increased resist- 
ance and increased nonconformity among 
bright, poorly socialized students in classes 
where attendance was taken. Actually the 
present study found a rise in conforming 
behavior of presumably an adjustive kind 
among poorly socialized students when re- 
straints were imposed. While there is no long- 
term evidence on whether this kind of re- 
straint eventually frustrated bright, poorly 


socialized students and caused them to drop 
out of school, an analysis of a subsample 
of second-term, bright (upper-third SAT 
scores) freshmen’s final psychology grades 
yielded no differences in grades of high im- 
mature students in classes that did or did 
not take attendance. 

Turning next to academic performance, the 
most interesting finding was that among 
students with high third scores in intellectual 
ability, high immature students did as well 
as middle immature or low immature students 
in a course in introductory psychology, but 
not as well in a course in introductory mathe- 
matics. One likely explanation for these dif- 
ferences in achievement appears related to 
the task demands imposed upon students by 
the mathematics and psychology courses. The 
mathematics course requires constant class 
attendance and home study, including the 
solving of assigned problems, in order to 
follow the developing logic of the course. It 
is generally agreed that once a student falls 
behind in his mathematics assignments, it 
becomes difficult for him to catch up with 
the class. The psychology course, on the 
other hand, can be mastered by a reading of 
the textbook at reasonable intervals. With 
few exceptions, the quizzes and final test in 
psychology are taken directly from the text- 
book. As a result, class notes and attendance 
do not appear crucial for obtaining a satis- 
factory psychology grade. Furthermore, the 
interdependence between various sections of 
the psychology course is only slight, so that 
reading the section of the textbook on per- 
sonality theory, for instance, does not pre- 
suppose much knowledge of learning theory 
which had been taught earlier in the term. 

The task requirements of mathematics then 
appear to require more sustained effort by 
the student and fewer absences from class 
than do the task requirements of psychology. 
Such requirements seem less consonant with 
the behaviors typically shown by high im- 
mature students. While no data were available 
on this point, the author would speculate 
that high immature students’ underachieve- 
ment in mathematics resulted from their 
spending less time in study outside of the 
classroom and being absent from class more 
often than middle immature or low immature 
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students. More generally, these findings 
point to the possibility that high immature 
individuals will underachieve in courses in 
which (a) the units of information are highly 
interdependent so that progress in the latter 
stages depends upon having learned the earlier 
stages of information, and (0) the course 
requires sustained effort by the student both 
in class and in doing homework. 
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The 1st part of this report delineates the importance of organizational 
considerations to the personnel psychologist as he concentrates on recruiting, 
selection, training, job design, and so forth. The 2nd part deals with the 


converse: 


the importance of personnel considerations to the organizational 


psychologist as he deals with problems of morale, supervision, teamwork, 
organizational design, and conflict resolution. 


Among the Inca farmers, work was re- 
garded as an end in itself, a ritualistic form 
of religious worship. But work that was not 
ceremonial lacked sense and meaning. Work 
and worship were inseparable (Von Hagen, 
1957). Nevertheless, more often in non- 
industrialized societies, work was done merely 
to subsist. Better yet, it was delegated to 
women or slaves. The Tibetan concentrated 
on his prayer wheel; the Talmudist on study- 
ing the Law; the Balinese on art; the Athen- 
ian on citizenship; and the Iroquois on mili- 
tary honors. Success, well-being, security, and 
prestige depended not on one’s job but on 
contemplative zeal, scholarship, artistic efforts, 
oratory, or bravery in battle. 

On the other hand, in modern industrial- 
ized society, as among the Incas, work is 
becoming the center of life. Twenty years may 
be spent preparing to enter the world of 
work. As much as half of each working day 
will be spent at work or commuting to and 
from it. Success in life, sense of well-being, 
sense of accomplishment, security, and pres- 
tige all will be tied to what work is done. 
Likewise, work will determine one’s standard 
of living and economic well-being. Therefore it 
should come as no surprise to find that in our 
““work-intoxicated” society there are few con- 
siderations about employment recruiting, test- 
ing, hiring, training, job design, and other 
personnel problems that can be divorced from 
their setting within a larger context of social 
forces, organization, and society. 

More generally, as Porter (1966) noted in 
his recent annual review chapter on person- 
nel management, few issues in personnel psy- 
chology can be completely examined without 
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attention to what usually are thought to be 
problems of organizational psychology. It may 
be profitable at this time to look at some of 
the more lively issues that lie at the interface 
between the two fields. Let us examine some 
organizational questions that appear when 
dealing with what usually are the concern of 
technicians in personnel psychology. To do 
this, I will focus briefly on recruiting, the 
application blank, testing, interviewing, train- 
ing, and job design. Following this I will 
reverse the examination and look briefly at 
some significant personnel questions, ques- 
tions of individual differences, which usually 
are the concern of organizational technicians. 


PERSONNEL IssuES INVOLVING 
ORGANIZATIONAL QUESTIONS 


Recruiting 


In one large sales organization, approxi- 
mately 3% of those applicants for jobs as 
salesmen were hired if they had been at- 
tracted to apply for the jobs through news- 
paper ads. On the other hand, 14% of those 
who had made application because of friends’ 
suggestions or who had been recommended to 
the sales agency by friends were hired. Un- 
fortunately, analyses of the merit ratings a 
year afterwards of those hired found that 
those who had been recruited through news- 
paper ads were in the fiftieth percentile in 
merit while those who had been introduced to 
the organization through friends were likely 
to be found below the twentieth percentile in 
merit. What had been revealed was quite sim- 
ple to a technician. Different selection ratios 
were operating. 
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Those recruited by newspaper advertise- 
ments were screened more carefully; pro- 
portionately fewer were hired. Those recom- 
mended by friends were screened less thor- 
oughly; proportionately more were hired. The 
same phenomenon was seen quite differently 
by the social psychologist. For him, inter- 
viewers were showing too much acceptance of 
applicants simply because they had been sent 
for consideration for employment by friends. 

There has been surprisingly little research 
on the social psychology of applicant sources. 
Yet, personnel technicians are aware of the 
overriding importance to the utility of a se- 
lection program of the selection ratio as well 
as the importance of the quality and quantity 
of recruits who apply for testing. There is 
little systematic evidence of the many biases 
involved in decisions about where to recruit 
and whom to recruit for testing. For example, 
in academia, we seldom try for a low selection 
ratio. Rather, we solicit applications for aca- 
demic jobs from only the most prestigeful 
sources, neglecting the utilities of uncovering 
a larger number of recruits. The problem is 
complicated further when the recruits gravi- 
tate toward or away from certain occupations 
and organizations because of extraneous fac- 
tors. Consider how difficult it is to hire Ne- 
gro engineers when the problem is compounded 
by the difficulties in encouraging Negroes to 
prepare for careers in engineering, since tradi- 
tionally it has been so difficult for them to 
enter such a career. 


Application Blank 


In a sales selection program, serendipity re- 
vealed two dynamic items on an application 
blank (dynamic application questions concern 
the interplay between applicant and organi- 
zation). When combined, these two items 
yielded a phenomenal validity of .70 against 
subsequent merit ratings. Hindsight made it 
relatively easy to understand why these two 
items were such powerful predictors of subse- 
quent job performance. These items were: “Is 
it all right to contact your boss?” and “How 
soon can you start work?” Applicants who 
subsequently were meritorious on the job 
gave an unqualified ‘‘yes” in answer to the 
first question. It was perfectly all right to 
contact their boss. Applicants who did not 


turn out well on the job either said “no” or 
“ves, but. . . .” Those who turned into poor 
performers were ready to start work immedi- 
ately, while those who insisted on at least 
2-weeks notice were likely to be better sales- 
men, subsequently. Probably, what was being 
assessed by these dynamic items was the sense 
of responsibility and sense of security of 
applicants. 

The increasing attention we are giving to 
biographical information blanks points to a 
likelihood of focusing more fully on the social 
and organizational history of the appliant, for 
so much of what is contained in biographical 
information is of this form. Consider some of 
the factors which emerged in Cassen’s (1966) 
factor analyses of biographical information 
blank items in three cultures: upward mobil- 
ity, family attitudes, achievement by conform- 
ing, interpersonal relations, etc. 


Testing 


There has been a merger of interest of so- 
cial and personnel psychologists concerning 
attitudes toward work. There seems to be no 
simple response to the question ‘“‘Why men 
work?” Much research suggests quite strongly 
that what may be rewarding for the task- 
oriented worker may be punishing to the inter- 
action-oriented worker (Bass, 1967b). Non- 
cognitive dimensions of particular importance 
for selection and placement testing lie close 
to work itself or to organizational considera- 
tions. Heretofore, to a considerable degree, 
personality and motivation assessment have 
developed within counseling and_ clinical 
psychology and have been lifted bodily for 
application to industry. There has been more 
concern about neuroticism than there has been 
for need for achievement, yet there is likely 
to be more payoff from concentrating on the 
latter. Even in the case of hospitalized mental 
patients, it seems more prognostic in voca- 
tional rehabilitation efforts for them to know 
about how task oriented they are than how 
neurotic they are (Distefano & Pryer, 1964). 

There has been little further exploration so 
far of some intriguing evidence that ability 
and aptitude tests tend to be more predictive 
of early success in organizations while interest 
and personality factors tend only to become 
useful for prediction of criteria where the 
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criteria are measured much later in the ap- 
plicant’s subsequent career. This proposition 
as well as others which are being generated 
about the dynamic nature of criteria cannot 
help but result in a more sophisticated ap- 
proach to forecasting using psychometric 
measurements. 

Early in the history of industrial psychol- 
ogy, the miniature test was devised for selec- 
tion purposes. Replicas of the real-life job 
were constructed and performance evaluated 
in these simulated situations. Now we see 
much more use of such simulations of organi- 
zations for assessment purposes. Such simu- 
lations can be accomplished through the “‘in- 
basket” technique, leaderless discussions, 
business games, organizational design prob- 
lems, and the like. To date, there has been 
relatively little exploration of the possibili- 
ties of creating for assessment purposes rela- 
tively simple games, each representing impor- 
tant organizational dilemmas with which to 
confront applicants who are seeking careers 
in those organizations. 

Culture fair testing is still another critical 
issue lying between personnel and social psy- 
chology. Is culture fair testing even theoreti- 
cally possible if past behavior is the best pre- 
dictor of future behavior? Is it possible to 
rule out cultural influences in testing without 
eliminating the validity of the tests? To what 
extent does previous organizational history 
play a role in test responses? How different is 
the applicant of today who grows up in a 
school system where the administration of 
objective tests is an almost weekly routine 
from the applicant of the last generation for 
whom objective testing was more of a novelty? 

As long as there is a basic mistrust between 
the applicant and the organization, we will 
live in a situation where the sensible appli- 
cant attempts to distort and fake his results 
on noncognitive tests. In what ways could 
sufficient trust be promoted between appli- 
cant and organization to increase the appli- 
cant’s desire to give honest responses rather 
than socially desirable ones, or responses 
which he judges to be ideal or like those of 
his boss or like those that would land him the 
job? 

Still another issue in testing which may 
involve public and organizational policy ques- 


tions deals with the selection ratio and how it 
is set. 

In one northeastern state, the Human Rela- 
tions Commission suggests that cut-off scores 
should be no higher than necessary to screen 
out unqualified applicants. The company which 
exploits the selection ratio by recruiting more 
applicants and setting higher cut-off scores 
does so at the expense of discouraging appli- 
cants who score reasonably well on the tests 
in question. Which is more important: more 
efficient use of screening devices by the com- 
pany or avoiding the discouragement of ave- 
rage applicants? 


Interviewing 


Studies of the bases upon which interview- 
ers make their decisions are beginning to 
appear more frequently. They may point ulti- 
mately to an interview which is a situational 
test, a replica of an organizational problem 
where performance can be evaluated com- 
pletely objectively, primarily by attention to 
the process by which the interviewee copes 
with the interviewer. This most promising ap- 
proach to interviewing treats it primarily as 
“the social psychology of two-person inter- 
action.” Thus Yonge (1956) permitted the 
interviewer considerable latitude as far as the 
content of the interview was concerned, but 
had him rate the interview process itself, 
specifically to assess the social skill and moti- 
vation demonstrated during this process by 
the interviewee. 

The interview presents a particular chal- 
lenge to the personnel and social psycholo- 
gist, for no matter what can or cannot be 
demonstrated about its utility as an assess- 
ment device, it is almost universal in use for 
selection purposes. Above and beyond con- 
sideration of the interview as needed for trans- 
mittal of information about the job and 
organization to the interviewee and for help- 
ing the interviewee develop realistic expecta- 
tions about the job for which he is applying, 
the typical employer still feels the need for 
face-to-face contact with the applicant be- 
fore making the employment decision. Yet, it 
may be that even where the interviewer has 
been found to add validity to the assessment 
predictions above and beyond what could be 
done by statistical integration alone, such 
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results may be handled more adequately in 
the future by better statistics. For instance, 
more attention to modifier variables may be 
required. Actually, modifiers may be merely 
symptoms of curvilinearity in the predictors. 
Under such circumstances, a Bayesian solution 
will always yield more accurate decisions, and 
in the last analysis will match the regression 
decisions when the data is linear (Clampett, 
1966). At best, Sawyer’s (1966) comprehen- 
sive review of research on methods for com- 
bining assessments strongly indicates that 
predictive efficiency is increased when inter- 
viewer judgments are mechanically combined 
or synthesized with test data to yield final 
predictions in contrast to following the usual 
practice of having interviewers make clinical 
judgments about the job applicant from his 
test scores and interview performance. 

The problem and its solutions therefore 
may be stated as follows: no matter what, 
interviews will be held with applicants. To 
insure that the interviews do as little damage 
as possible to the accuracy of the entire selec- 
tion process, it may be best to increase the 
number of recruits who are tested, thereby 
reducing the selection ratio and at the same 
time increasing the likelihood that all those 
who passed the first test screen, and as a 
consequence receive interviews, are all fairly 
good prospects for the job in question. At the 
same time, the interviewer’s judgments should 
be treated like additional measurements to be 
combined statistically with test data available 
on the applicant to provide a final prediction 
for him. This required change in practice is 
an organizational question since the inter- 
viewer’s role is reduced in importance. Re- 
sistance is likely particularly if the inter- 
viewer has considerable status or experience 
but in its resolution may lie the major por- 
tion of the utility of a selection program. 


Training 


Commenting on the fads and fashions in 
training is like commenting about the weather 
and what can be done about it. Nevertheless, 
organizational matters are vital in the deci- 
sions of training programs to be introduced, 
continued, or abandoned. Let me enumerate 
some well-known training problems at the in- 
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terface of personnel and organizational psy- 
chology. 

1. Trainers have discovered to their cha- 
grin that it is one thing to maintain an ef- 
fective program which meets its training ob- 
jectives, but it is another to create a training 
program which both meets its objectives and 
obtains the approval of the trainees. Unfor- 
tunately, evaluation and continuation of the 
program are more likely to rest upon the latter 
rather than the former. 

2. Familiar also is the organizational prob- 
lem associated with the returning trainee. 
Fleishman (1953) provided sufficient evidence 
that whether or not the effects of the train- 
ing program made their appearance on the 
job 6 months after training depended primar- 
ily on the attitudes of the boss and the 
climate of the organization to which the 
trainee returned. 

3. The professional and the manager need 
to see themselves engaged in “life-long 
learning” to avoid obsolescence. The organi- 
zation can help considerably to maintain ex- 
pectations that its members must continue to 
keep up with new developments. 

4. To a considerable extent much more 
attention may need to be paid to a basic 
conflict between management trainer and 
management trainee. Managers see themselves 
performing a complex art whereas one may 
be trying to teach them simple science. A 
recent unpublished experiment by Alex Ba- 
velas is relevant here and illustrates the tre- 
mendous difficulties involved in the training 
of people under these circumstances. One sub- 
ject learns to discriminate the slide photos of 
healthy and sick cellular tissue through ap- 
propriate reinforcement of his responses. A 
second subject receives the same reinforce- 
ment schedule regardless of what responses 
he makes. The first subject forms a few sim- 
ple hypotheses about what differentiates pho- 
tos of a healthy and a sick cell; the second 
subject forms a complex set of hypotheses 
since his reinforcements have not been asso- 
ciated in any simple way with his different 
responses. Unfortunately, when the two sub- 
jects discuss the matter, the one with the 
complex art of judgment is more confident, 
more resistant to change, and less readily in- 
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fluenced than the other subject with the sim- 
ple hypotheses. 


Job Design 


As the organization introduces automation, 
there is a flattening of the distribution of 
skill demands on newly created jobs and the 
remaining old ones. More routine jobs as 
well as more skilled jobs emerge while those 
at intermediate levels of skill demand are 
abolished. One can only vaguely foresee the 
creation of less-fluid worker castes, the un- 
skilled, and the elite, unless some effort is 
made to intervene with designs for more func- 
tional intermediate-level jobs. For instance, 
electronic data processing (EDP) has created 
the elite programming job. It has also created 
the unskilled job of pressing bent IBM cards 
which are returned by mail. 

At the level of the manager, there seem to 
be two diametrically opposed predictions 
about how EDP is changing the organization 
of the future and the jobs within it. On the 
one hand Leavitt and Whisler (1958) see 
greater centralization decreasing responsibili- 
ties accorded middle managers; others see 
the reverse. Selection and training of future 
middle managers need to take account of 
EDP, but how to do so in the face of the 
uncertain effects of EDP is a dilemma we 
face at this time. Individual differences play 
an important role here. For instance, younger 
men at the same level in the organization see 
that their own jobs are changing as a conse- 
quence of EDP much more so than do older 
men at the same level of management 
(Vaughan & Porat, 1967). 


Compensation 


For too long the questions concerning 
equitable compensation have been left in the 
hands of organizational rationalists. We are 
just beginning to see the full extent of indi- 
vidual differences in attitudes toward com- 
pensation. For instance, in a simulation we 
use for training managers, the managers must 
award salary increases to 10 engineers each 
of whom differs in merit and in other at- 
tributes. Each has a job somewhat different 
than those of the others who are to be as- 
signed salary increases. Wide differences ap- 
pear in the average increases recommended. 
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North Europeans seem to feel that 4-6% is 
equitable, while South Europeans may push 
for as high as 36%. Obviously, differences in 
the rate of inflation in the economies of the 
different countries in question are of conse- 
quence. But just as important are the atti- 
tudes toward technicians and engineers that 
vary greatly from one country to the next. 
In this same exercise, American, British, and 
Irish managers generally seem unmoved by 
the possibility of losing good men to competi- 
tion and do not feel that counteroffers should 
be the basis for salary increases. In fact, 
they sometimes tend to punish men who 
receive offers from other firms. Contrarily, 
Flemings, Norwegians, Italians, Indians, and 
Latin-Americans tend to be more prone to 
award relatively large increases to men with 
counteroffers. Even among managers from the 
same firm, we can see some opting for ex- 
tremely complex differentials while others in- 
sist that regardless of merit or job or senior- 
ity, all engineers ought to receive the exact 
salary increase (Bass, 1967a). 


Criteria of Individual Performance 


Concern for an employee’s performance in 
a firm must be viewed in terms of his firm’s 
objectives. For example, Smith sells more 
merchandise faster than Jones, but Smith’s 
customers complain more about their pur- 
chases. Smith brings in more new, nonrepeat 
business, but Jones has more steady, satisfied 
customers. Who is the better salesman? The 
answer depends on the firm’s goals on what 
it values most. Smith is the better salesman 
if the firm is concerned most about its cur- 
rent share of the market. Jones is the better 
salesman if the firm is concerned about its 
long-term standing in the market. 

We must understand something about how 
to assess organizational worth if we are to 
appreciate how individual personnel differ in 
their contributions to it. Individual differ- 
ences in the proficiency of executives, for 
instance, can only be made meaningful if we 
understand the purposes of their jobs. This 
in turn entails determining the purposes of 
the department’s divisions and ultimately the 
purpose of their enterprise (Fiske, 1951). 
Assessing the adequacy of the performance of 
a manager can be tricky business indeed. As 
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Shartle (1956) pointed out, one may discover 
10 years too late that an executive who con- 
tributed the most to the firm had been dis- 
charged. 


ORGANIZATIONAL IssUES INVOLVING 
PERSONNEL QUESTIONS 


We have seen that the problems of recruit- 
ing, application forms, testing, interviewing, 
training, job design, and compensation, all 
ordinarily the primary interest of the per- 
sonnel psychologist, are likely to contain so- 
cial issues of interest to organizational psy- 
chologists, as well. Now let us reverse roles 
and look at some problems which usually are 
the concern of organizational technicians 
which nevertheless contain significant person- 
nel questions as well: supervision, communi- 
cations, conflict resolution, team composition, 
and organization design. 


Supervision 


For 25 years, there has been major interest 
by social psychologists in the utility of demo- 
cratic or permissive rather than directive 
supervision. But evidence is continuing to 
accumulate that what type of supervision 
works best is often a matter of individual and 
cultural differences. In a number of studies, 
more directive approaches are favored by 
subordinates. Indeed, much direction is more 
often expected in many locales. For instance, 
in a recent pilot study completed in Spain, 
those who acted as subordinates were much 
more favorably disposed toward supervisors 
who attempted to persuade them rather than 
who attempted to share the decisions with 
them, for, they said, a supervisor who does 
not try to influence his subordinates fails to 
accord the subordinates the dignity to which 
they are accustomed by showing that he really 
does not have sufficient interest to bringing 
them over to his position. Numerous studies 
in the United States suggest that those who 
are highly authoritarian prefer in turn to sub- 
mit to authority rather than to operate in an 
environment with opportunities for sharing 
decisions with their superiors. In turn, they 
expect to make decisions for their own sub- 
ordinates (Bass, 1965). 


Communications 


In a similar vein, two-way communication 
is thought more effective as well as more sat- 
isfying than one-way communication. Yet, a 
group of Japanese frustrated an American 
management trainer who was trying to dem- 
onstrate the differences and relatively greater 
values of two-way communication. Among 
the Japanese, with whom he was working, the 
ease of communicating a pattern of rectangles 
one-way was somewhat greater for the task 
which was imposed, and as a consequence 
there was considerable confusion on the part 
of the students who were ostensibly being 
taught and shown the value of two-way over 
one-way communication. We have had simi- 
lar experiences with engineering students who 
if they are communicating the pattern of 
rectangles to others with a great deal of 
background in mechanical drawing and blue- 
print reading, may communicate faster and 
more effectively one-way than two-way. Given 
individuals with common codes, one-way 
communication can be more effective and 
more satisfying than two-way communication. 


Conflict Resolution 


Some zero-sum games where one party wins 
only under the condition that the other party 
loses are situations where conflict resolution 
is impossible for almost anybody. In some 
mixed-motive games, resolution will or will 
not occur, depending on who is playing the 
game. If one is able to compete with a gen- 
erous opponent, it is possible to fleece him. If 
one himself is nurturant, speedy and highly 
satisfying resolutions can be _ achieved 
(Loomis, 1959). There would seem to be con- 
siderable payoff in the study of the interac- 
tion of person and structural conditions in 
various types of negotiating situations. Thus, 
Bass (1966) found that task-oriented nego- 
tiators were most likely to achieve settle- 
ments of high quality in contrast to settle- 
ments reached by less task-oriented negoti- 
ators. 


Team Composition 


We are beginning to understand the per- 
sonal ingredients in the assembly of members 
of a group required to achieve particular 
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outcomes. For example, evidence accumu- 
lated so far suggests if we put together a 
number of highly task-oriented members, we 
are likely to generate a great deal of socio- 
emotional conflict, although at the same time 
the chances are reasonably good that plenty 
of work will get done. At the same time if we 
mix together a group of primarily interaction- 
oriented personnel, the likelihood is that the 
group will be most satisfying to the members, 
there will be much play, but relatively little 
work will be accomplished (Bass, 1967b). 


Organizational Design 


To conclude, let us look at what rational 
organizational designers would suggest as in- 
violate principles of organizational design and 
at the same time note the very opposite points 
of view voiced by behavioralists. 

The rationalists would say that an organi- 
zation should be designed so that someone 
is responsible for supervising all essential ac- 
tivities. The behavioralists would be primarily 
concerned with creating structures where 
leadership was shared. 

The rationalists would argue that responsi- 
bility for specific acts should not be dupli- 
cated or overlapping. The behavioralists 
would feel that overlapping had much merit, 
providing opportunities for cross-training, 
backup, and increasing the reliability of the 
system of interacting workers. 

The rationalists would argue for job simpli- 
fication; the behavioralists would push for 
the reverse, job enlargement. 

The rationalists would insist that responsi- 
bilities should be written, clear, and under- 
stood by job occupants. The behavioralists 
would argue that each person brings to a job 
somewhat different potentials and should be 
given freedom and flexibility to develop his 
own particular way of doing the job to make 
the most of the situation in which he finds 
himself. 

The rationalists would argue that author- 
ity to make decisions should be commensurate 
with responsibility for those decisions. The 
behavioralists would state that authority can- 
not be assigned but rather goes with ability 
and esteem as an individual among associates. 
Furthermore, they would note that the ra- 
tionalists’ principle of authority primarily 


makes it easy for some people to shirk re- 
sponsibility and also increases staff-line con- 
flict. 

The rationalists would want authority to be 
delegated so that decisions take place as 
close as possible to the point of action. The 
behavioralists would insist that the rational- 
ists do not mean what they say, for to ac- 
complish this goal they would accept the be- 
havioralists’ position that individuals should 
be responsible for decisions which affect 
themselves, whereas, in fact, the rationalists 
design organizations so that such decisions are 
lodged with the superiors of the individuals 
who must execute those decisions. 

The rationalists have fixed notions about 
span of control; the behavioralists feel that 
the span of control creates as many problems 
as it solves. For example, the smaller the span 
of control in an organization the taller will be 
its structure resulting in greater separation 
of the top of the organization from the bot- 
tom. More possibilities will arise for filtering 
communications that must be transmitted 
through the organization. There will be greater 
differences between the goals of those at the 
top from those at the bottom of the hierarchy. 
Rather than concern themselves about span 
of control, the behavioralists would be much 
more interested in the opportunities for feed- 
back between subordinate and superior; span 
of control could be much greater where feed- 
back was accurate and easy. 

The rationalists insist on a chain of com- 
mand; the behavioralists say the chain of 
command is often a fiction interfering with 
the flow of communication that is needed by 
the organization, a flow which may be hori- 
zontal, diagonal, and in forms quite different 
than assumed by the rationalists. For exam- 
ple, the relation between the foreman and an 
assembly-line worker has been likened to that 
between a travel consultant and his client. 
The “real boss” in the situation is the ma- 
chine (Bass, 1965). 

Optimum organizational designs are likely 
to be found somewhere between rationalist 
emphasis on predictability and accountability 
through impersonal structure and behavioral- 
ist concern for interpersonal trust and inter- 
personal confidence as the bases of organiza- 
tional stability and growth. Again, one can 
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only agree that organizational design must 
attend to what has been learned about intra- 
personal as well as interpersonal dynamics. 
Where the optimum lies between the ration- 
alist’s and the behavioralist’s positions de- 
pends on the capabilities, training, and in- 
volvement of the personnel in the system. The 
fully programmed rationalist’s organization is 
closer to the optimum when personnel capa- 
bility, training, and involvement are low (and 
the rationalist’s organization is likely to keep 
personnel involvement low). When personnel 
potential is high, the behavioralist’s model 
becomes more feasible. 

The personnel specialist no longer can hide 
behind his validity coefficients, test blanks, 
and training manuals. In turn, the organiza- 
tion scientist cannot remain assured of the 
generality of results from his surveys, field 
studies, and rational analyses. Models for 
organizational and personnel research need 
to take into account sources of variance due 
to jobs and _ organizational environment, 
sources due to individual characteristics, and, 
most important of all, sources due to the pe- 
culiar interactions of individuals and environ- 
ments. 
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TEAM PERFORMANCE AS A FUNCTION OF TEAM 
ARRANGEMENT AND WORK LOAD * 


WILLIAM A. JOHNSTON ann GEORGE E. BRIGGS 


Ohio State University 


Pairs of Ss served as radar controllers in a simulated approach-control task, 
and they were required to alternate in directing aircraft approaches. Pre- 
scribed approach rate was 1 approach every min. (high work load) or every 
2 min. (low work load). S could compensate for his partner’s early or late 
approaches in compensatory teams but not in noncompensatory teams. Each 
team completed 4 experimental sessions. Standardized algebraic average of 
approach times was closer to criterion in compensatory teams than in non- 
compensatory teams, particularly under low load. Under high load fewer 
flight errors occurred in compensatory teams than in noncompensatory teams. 
Team communication inhibited team performance only in the noncompensa- 
tory high-load condition. It was concluded that some team functions hinder, 


others enhance, team output. 


The available evidence favors the conclu- 
sion that team output is an inverse function 
of the extent to which teammates are re- 
quired to coordinate, communicate, or other- 
wise interact with one another (Briggs & 
Naylor, 1965; Johnston, 1966; Kidd, 1961; 
Lanzetta & Roby, 1956, 1957; Naylor & 
Briggs, 1965; Williges, Johnston, & Briggs, 
1966). An important implication of this con- 
clusion is that such team functions should 
be minimal and individual functions maximal 
in the design of multiman systems. However, 
the previous investigations have neglected at 
least two potentially useful team functions: 
the fail-stop and compensatory functions. The 
former is one in which a team member pre- 
vents his partner from committing an error, 
and the latter is one in which a member cor- 
rects or counteracts his partner’s error after 
it has been committed. 


1 This research was carried out at the Human 
Performance Center and was supported in part 
by the United States Navy under Contract No. 
N6i339-1327, sponsored by the United States Naval 
Training Device Center, Orlando, Florida. Repro- 
duction of this publication in whole or in part is 
permitted for any purpose of the United States 
Government. The authors are grateful to Robert 
C. Williges for his help in analyzing the data. 
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These team functions would seem to be 
particularly valuable in systems in which 
errors can eventuate in disaster. For example, 
one can well imagine the fatal consequences 
which would ensue if fail-stop and com- 
pensatory team functions were not incorpor- 
ated into surgical and trapeze teams! Of 
course, the utility of these functions should 
vary with system load. For example, the 
frequency of individual errors and therefore 
the advantage of the fail-stop function should 
be directly related to the load on the system. 
On the other hand, the opportunity for com- 
pensatory activity should be an inverse func- 
tion of system load; the higher the load, 
the more pressed a team member is to com- 
plete his own task, and the less time he has 
to compensate for a teammate’s errors. The 
present study explored the thesis that com- 
pensatory and fail-stop team functions can 
facilitate team output, at least under certain 
conditions of system load. 

In addition, the present study sought to 
further assess the inhibitory team function of 
intermember communication (Williges et al., 
1966). The effect of this team function should 
also vary with system load: the greater the 
load, the less a team can afford the luxury of 
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team communication, and so the greater 
should be the disruptive effect of such com- 
munication. 


MeETHOD 


Task and apparatus. A simulated approach-con- 
trol task was used in which subjects (Ss) served 
as radar controllers (RCs) who were responsible 
for directing aircraft through an approach gate. 
The approaches were to be effected at a prescribed 
rate, and two Ss were required to alternate in 
controlling approaches. That is, the first approach 
was to be directed by RC, the second by RC:, 
and then RC; assumed responsibility for the third 
incoming aircraft, and so on. 

The radar displays were linked to target-generator 
consoles via a special-purpose analog computer 
(Hixson, Harter, Warren, & Cowan, 1954). The 
role of pilots was assumed by well-trained experi- 
mental assistants who worked at the target-generator 
consoles. The RCs delivered instructions to pilots 
over a voice-communication channel, and the pilots 
carried out the instructions faithfully and without 
delay by appropriate manipulation of their con- 
soles. The RC-to-pilot communication protocol 
required that the RC first identify the pilot and 
then issue the command, for example, “Bravo one, 
speed 200 knots”; in return, the pilot was to im- 
mediately confirm the command, for example, 
“Roger; Bravo one, speed 200 knots.” The RCs 
gave only heading and speed commands to the 
pilots; altitude was intentionally omitted from 
consideration in order to maintain a reasonable 
level of task difficulty. 

One of the displays as it appeared at the 
beginning of a session is reproduced in Figure 1. 
The approach gate was located precisely at the 
center of the displayed airspace, and all aircraft 
entered the airspace from the eastern (right-hand) 
periphery. The display was marked with con- 
centric rings, and the distance between adjacent 
rings represented 20 mi. The distance from the 
periphery to the approach gate represented 100 mi. 
The aircraft appearing in the northeast sector of 
the scope were designated as “Alpha” aircraft, and 
those appearing in the southeast sector were referred 
to as “Bravo” aircraft. The Alpha and Bravo air- 
craft were indicated by different codes. The two 
RCs monitored the same airspace but on different 
displays, and they could speak to one another only 
over a voice-communication channel. The Alpha 
RC was assigned to the Alpha aircraft, the Bravo 
RC to the Bravo aircraft. The team-communication 
protocol required that both RCs identify them- 
selves before each conversation. All team com- 
munications were tape-recorded. A partition was 
erected between the two RCs so they could not 
see one another, and an ambient noise generator 
prohibited the RCs from hearing one another ex- 
cept by means of the inter-RC communication link. 

At the beginning of a session, four Alpha air- 
craft and four Bravo aircraft were spaced evenly 
along the eastern periphery of the airspace. An 
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Fic, 1. Reproduction of a display at the start 
of a session. 


Alpha aircraft made the first approach, a Bravo 
aircraft followed, then another Alpha aircraft, and 
so on. A successful approach occurred when an 
aircraft entered the approach gate at 200 kn. on 
a heading of 270°. A “miss” occurred if an aircraft 
in the inner circle of the display crossed the 
longitudinal axis into the western half of the 
airspace in any condition not constituting a successful 
approach. Aircraft were removed from the display 
following successful landings or misses, and as soon 
as one aircraft was removed a new one was posi- 
tioned at the eastern periphery. A safety infraction 
occurred if two aircraft came within 2 mi. of one 
another. However, a safety infraction did not lead 
to the removal of the aircraft. 

The entire operation, including the scoring and 
recording of approaches, misses, and safety infrac- 
tions, the presentation and removal of aircraft, and 
the administration of feedback, was conducted by 
a team of two experimental assistants. 

Subjects and design. The Ss were 64 male under- 
graduates who served in pairs for five consecutive 
sessions. All were paid for their services at the rate 
of $1.25 per session. Four conditions were formed 
from the factorial combination of two levels of team 
arrangement (compensatory and noncompensatory) 
with two levels of system load (low and high). 
Teams were composed of Ss who had volunteered 
for the same experimental time period, and they 
were randomly assigned to the four conditions 
(groups) with the restriction that groups be filled 
equally across the 8 wk. of data collection. 

Procedure. The first session was devoted to 
general instructions and practice, and the experi- 
mental conditions were implemented during the 
remaining four sessions. Each session required 35 
min., and the intersession interval was 24 hr. 

The required approach rate (system criterion) 
was an approach every minute for high load, and 
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one every 2 min. for low load. In the non- 
compensatory arrangement, the system criterion was 
to be satisfied on each approach. In the low-load 
condition, for example, a given approach was to 
be effected precisely 2 min. after the preceding air- 
craft passed the gate regardless of any time error 
that might have accrued over prior approaches. By 
contrast, a given approach in the compensatory 
arrangement was to compensate for the accumulated 
time error. Hence, if a time error of 20 sec. 
late had accrued over prior approaches, the next 
approach was to be 20 sec. early so that the 
average of the approach times would equal the 
system criterion. 

A clock system was mounted on the wall in front 
of the team in such a way that the RCs could keep 
continuous track of their temporal progress in 
guiding aircraft through the approach gate. Each 
clock kept time in terms of minutes and seconds 
up to 1 hr., and could be viewed by only a slight 
shift in an RC’s line of vision from the input dis- 
play. Only one clock was used in the compensatory 
condition, and the RC team was instructed to have 
an aircraft through the approach gate at every 
one (high load) or two (low load) complete ro- 
tations of the second hand. In order to avoid 
anomalies that arise when unusually large time 
errors accrue over previous landings, the clock was 
restarted after every six approaches. 

Two clocks were used in the noncompensatory 
condition: one for the Alpha RC, the other for the 
Bravo RC. When the Alpha RC made an approach 
(regardless of how early or late the approach might 
have been), the Bravo RC’s clock would start and 
a Bravo aircraft was to be at the approach gate 
after either one (high load) or two (low load) com- 
plete rotations of the second hand. Then, when the 
Bravo aircraft passed through the approach gate, 
the Alpha RC’s clock (which had been reset during 
the Bravo approach) would start again, and so on. 

The following aspects of the task characterized 
both team arrangements: (a) A small red light in- 
dicated whose turn it was to effect an approach; 
(b) the timing started upon the completion of the 
first approach of the session; and (c) immediate 
feedback was provided to the RC team of time 
errors (relative to the compensatory or non- 
compensatory criteria), misses, and safety infrac- 
tions immediately after each approach except the 
first. 

Performance measures. The experimental timing 
of each approach began when the immediately 
preceding aircraft passed through the approach 
gate. Approach-time error was defined as the dif- 
ference between actual approach time and desired 
average approach time (actual minus 60 sec. or 
120 sec. depending on the system criterion in effect). 
Thus, a +20-sec. time error indicated an approach 
that was 20 sec. late (e.g., an approach time of 
either 80 sec. or 140 sec.). Both the algebraic and 
the absolute sums of the time errors were calculated 
for each team on each of the last four sessions. 
The algebraic or constant error (CE) score was 


then divided by the absolute or variable error 
(VE) score to provide a single measure for each 
team on each session. 

The CE/VE ratio is analogous to the z or stand- 
ard score; thus, it is a way to compare CEs from 
populations differing in variability. In the present 
study the CE/VE ratio has special implications for 


the compensatory versus noncompensatory com- 
parisons. In effect, the instructions to the com- 
pensatory teams emphasized small CEs (at the 


expense of VE), while the instructions to the 
noncompensatory teams, in effect, emphasized small 
VEs (at the expense of CE). Relative to the non- 
compensatory arrangement, therefore, the com- 
pensatory arrangement was expected to reduce CE 
but to elevate VE. In short, smaller ratio scores 
were expected under the compensatory condition 
than under the noncompensatory arrangement. Fur- 
ther, this difference was expected to be particularly 
pronounced under low system load because this 
condition provides minimal constraints on com- 
pensatory activities. 

Each team was scored also in terms of total 
number of flight errors (misses and safety in- 
fractions) per session. It was anticipated that an 
RC would pay more attention to his partner’s ac- 
tivity, and would therefore be more likely to pre- 
vent his partner from committing errors, in the 
compensatory arrangement than in the noncom- 
pensatory arrangement. This fail-stop function 
should be most pronounced under high load, the 
condition in which flight error frequency should 
be maximal. Hence, error frequency was expected 
to be higher in the noncompensatory arrangement 
than in the compensatory arrangement, particularly 
under high load. 

Team-communication measures. All team com- 
munications were categorized according to the con- 
tent-analysis scheme described elsewhere (Williges 
et al., 1966). This scheme is based on five major 
categories of communication: identification, irrele- 
vant, declarative, tactical, and command messages. 
Teammates should have more reason to com- 
municate with one another in the compensatory 
arrangement than in the noncompensatory arrange- 
ment, and should have more “freedom” to do so 
under low-load than under high-load conditions. 
It follows that team communication should be most 
frequent in the compensatory low-load condition 
and should be most disruptive in the noncom- 
pensatory high-load condition. 


RESULTS 


Team performance. An analysis of variance 
of CE/VE ratios produced the following 
significant effects: system load, F(1/28)= 
12.63, p< .01; team arrangement, F(1/28) 
= 4.24, p< .05; sessions, F(3/84)= 6.49, 
p< .01; and the System Load X Team Ar- 
rangement X Sessions interaction, F(3/84)= 
4.56, p < .01. The character of these effects 
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Fic. 2. The ratio of algebraic error to absolute 
error for each experimental condition as a function 
of sessions. 


is depicted in Figure 2. Specifically, the CE/ 
VE ratio decreased across sessions, increased 
with load, and was higher in noncompensatory 
teams than in compensatory teams, par- 
ticularly on Session 1 under low system load. 
In accordance with prediction, therefore, the 
extent to which “standardized” algebraic error 
was reduced at the expense of absolute error 
was greatest in the compensatory low-load 
condition. The sessions effect arose from a 
greater intersession decline in algebraic error 
than in absolute error, which finding sug- 
gests that practice benefited CE more than it 
did VE. 

The time error per approach averaged 
11.13 sec. and 18.43 sec. under low and high 
load, respectively. These data indicate that 
the RC teams were under considerable time 
stress even under “low” load. Had the low- 
load criterion been, say, one approach every 
3 or 4 min., then the effect of team arrange- 
ment on the CE/VE ratio may have been 
even more obvious. 

The following sources of variance were 
significant in the analysis of variance of 
frequency of flight errors: load, F(1/28) 
= 13.00 p< 01; | Jsessions yf (3/84) == 
55.81, p< .001; Load x Sessions, F(3/ 
84)= 3.49, p< .05; and the Load x Team 
Arrangement X Sessions interaction, F (3/84) 
=3.33, p < .05. The nature of these effects is 
apparent in Figure 3. The frequency of flight 
errors increased with system load and de- 
creased across sessions. As was expected, 
flight errors were less frequent in the com- 
pensatory arrangement than in the non- 


compensatory arrangement, but only in the 
first experimental session under high load. 

Team communication. Team arrangement, 
F(1/28) = 10.86, p< .01, communication 
category, F(4/112)= 26.72, p< .001, and 
Team Arrangement X Communication Cate- 
gory, F(4/112)= 2.83, p< .05, were all 
that attained significance in the analysis of 
variance of communication frequency. Con- 
trary to expectation, team communication 
was not influenced by load. In agreement 
with expectation, however, communication 
frequency was higher in compensatory teams 
(an average of 105 per session) than in 
noncompensatory teams (47 per session). 
Table 1 shows the relative frequencies of the 
five communication categories for each team 
arrangement. The relative frequencies of 
declarative messages, tactical communica- 
tions, and commands were higher in com- 
pensatory teams than in noncompensatory 
teams, and the relative frequency of irrele- 
vant communications was higher in noncom- 
pensatory teams than in compensatory teams. 
Identifications were relatively frequent in 
both team arrangements. 

Correlational analyses were performed to 
assess the relationship between communica- 
tion frequency and team performance (all 
measures averaged across sessions). Signifi- 
cant correlations were obtained only with 
respect to the noncompensatory high-load 
condition, in which Condition CH/VE, r(6) 
= 81, p< .05, CE (algebraic error), 7(6) 
= .93, p< .01, and VE (absolute error), 
r(6)= .91, p < .01, were directly related to 
communication frequency. It may be recalled 
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TABLE 1 


MEAN RELATIVE FREQUENCY OF OCCURRENCE OF 
EAcH COMMUNICATION CATEGORY FOR 
Botu TEAM ARRANGEMENTS 








Team arrangement 


Communication 
category 
Compensatory | Noncompensatory 
Identification 28 oe 
Irrelevant .06 16 
Declarative aoe BD 
Tactical .20 a5 
Command 19 eS 


that noncompensatory teams were instructed, 
in effect, to maximize CE/VE by reducing 
VE at the expense of CE. At first glance, it 
might appear that, under high work load, 
team communication aided noncompensatory 
teams in their effort to maximize CE/VE. 
However, team communication augmented 
CE/VE not by reducing VE and elevating 
CE, but rather by increasing both error 
indexes, the increase apparently being more 
pronounced for CE than for VE. In actuality, 
therefore, team communication disrupted the 
performance of noncompensatory teams under 
high load. That this disruptive effect of team 
communication was restricted to the non- 
compensatory high-load condition is conso- 
nant with prediction and with the fact that 
irrelevant communications were relatively 
prominent in the communication profile of 
noncompensatory teams. 


DIscussION 


The present data lend testimony to the 
proposition that team performance can be 
augmented by the compensatory and fail-stop 
team functions. Despite the fact that the 
absolute level of system load was quite high 
even in the low-load condition, compensatory 
teams were able to partially offset accrued 
time error. Thus, the compensatory arrange- 
ment would be of service in real approach- 
control teams when the normal flow of air 
traffic has been impeded as a result of, say, 
adverse weather. However, this value of 
compensatory activity appears to be an in- 
verse function of system load. On the other 
hand, the utility of the fail-stop function is 


apparently a direct function of system load. 
Consequently, when system load is high, a 
team member may not be able to undo his 
partner’s errors once they have been com- 
mitted, but he may be able to reduce the 
number of errors that his partner in fact does 
commit. Hence, where one team function loses 
value, the other gains. 

Turning now to the communication data, 
it is apparent that team-communication fre- 
quency was augmented by compensatory ac- 
tivity but was not affected by system load. 
However, it is again noteworthy that the 
absolute level of work load was quite high 
even in the so-called low-load condition; a 
greater disparity in the two levels of load 
might have induced a disparity in com- 
munication frequency. In regard to the role 
of team communication in teamwork, the 
present findings further substantiate the pre- 
viously found disruptive effect of team com- 
munication on team performance (Briggs & 
Naylor, 1965; Johnston, 1966; Williges et al., 
1966). Of course, since this disruptive effect 
was limited to the noncompensatory high- 
load condition, it appears that team com- 
munication hinders team performance the 
most when there is the least need and the 
least freedom to communicate. When there is 
enough time and perhaps some reason to 
communicate, team communication neither 
retards nor enhances team performance. 

In summary, certain team functions such 
as team communication are potentially detri- 
mental to team output and should be re- 
stricted in system design and by the operat- 
ing procedures. Contrariwise, other team func- 
tions such as compensation and error pre- 
vention are potentially advantageous and 
should be incorporated into the design of 
those systems for which the unique features 
and effects of these team functions are 
appropriate. 
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LEARNING OF TELEPHONE NUMBERS WITH AND 


WITHOUT LETTER-EXCHANGE PREFIXES 


SHELDON J. LACHMAN 


Wayne State University 


In a group situation, 36 college students, randomly assigned to 4 groups, 
practiced and were tested for recall on 8- or 10-unit lists of telephone 
numbers with letter exchanges or with digits substituted for the letters. In 
terms of mean scores and final achievement levels, groups practicing lists 
of telephone numbers with lettered exchanges performed better than their 
counterparts who practiced corresponding lists of telephone numbers com- 
posed entirely of digits. Items in the traditional telephone identification 
system employing pairs of prefixed letter exchanges were most readily learned. 
Explanations in terms of (a) symbol isolation, (b) interference, and (c) 
familiarity effects are indicated; implications for telephone identification are 


specified. 


Number-exchange systems in recent years 
have substantially replaced letter-exchange 
systems in telephone numbers. Certain in- 
dividuals and even protest organizations, for 
example, the Anti-Digit Dialing League, have 
expressed opposition to telephone company 
policy in that regard. In part, their opposition 
has been based on the alleged increased diffi- 
culty in learning an all-digit code as opposed 
to learning a letter-digit code. 

Inquiry into the research literature did not 
yield reports of experiments in the area of 
telephone number learning. Surprisingly, re- 
quests for such research reports of Bell Tele- 
phone Laboratories also yielded no experi- 
mental reports nor citations of any reports. 

The purpose of the study now reported was 
to obtain evidence concerning the relative 
efficiency of learning lists of telephone num- 
bers having alphabetical or letter exchanges— 
two letters and five digits—as contrasted with 
learning telephone numbers composed of 
digits exclusively—seven digits. 


MertTHOD 


Subjects. The Ss were 36 college students, most of 
them sophomores. They were assigned at random 
into four groups, nine Ss in each group. 

Procedure. Group A practiced a 10-unit list; each 
unit was composed of 2 letters and 5 digits. This is 
the letter-exchange list. Group B practiced a 10-unit 
list; each unit was composed of 7 digits. This is the 
all-digit list. Group C practiced an 8-unit list, each 
unit composed of 2 letters and 5 digits. These were 
the first 8 items in the list used by Group A Ss. 
Group D practiced an 8-unit list, each unit com- 
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posed of 7 digits. These were the first 8 items in 
the list used by Group B Ss, 

Material. Learning lists were prepared by an 
extensive sampling of the 1963 Chicago Telephone 
Directory (white pages). About 500 letter-exchange 
items were selected. From the pool of items, a learn- 
ing list was developed. The attempt was made to 
obtain, for the learning list, exchanges which were 
different from those used in the Detroit area, where 
this study was conducted; this was not completely 
successful. However, no set of exchanges was re- 
peated in the list. No common letters were assigned 
to successive units. As a matter of fact, insofar as 
possible—again not completely successful—the at- 
tempt was made to avoid the use of any letter 
more than once in the series. Excluded were digit 
combinations which would make a word such as 
BE, HE, SO, AT, and IN. By such exclusion, it 
was believed that difficulty level for learning letter 
exchanges would be increased since nonword letter 
combinations would presumably be less familiar. 
Because there was the possibility of confounding O 
as a letter with O as a number, that symbol was 
also excluded. 

Materials were presented to Ss in the form in 
which they were found in the telephone directory. 
An equivalent all-digit list was transmuted from the 
letter-digit list. The learning materials are indicated 
below: 


Letter-Exchange List All-Digit List 


TR 4-9153 874-9153 
KE 6-1748 536-1748 
WH 3-1826 943-1826 
MU 5-9214 685-9214 
VA_ 6-1394 826-1394 
PL 2-6843 752-6843 
FR 6-2419 376-2419 
SA 1-8935 721-8935 
MI 2-1569 642-1569 
HU 3-2965 483-2965 
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The Ss performed under group conditions. Each 
S had a copy of a duplicated list.1 They worked 
at long tables and had abundant room for materials 
and for writing. The Ss were given 20 practice 
periods each of which was immediately followed 
by a recall period. During the recall period, re- 
sponses were written on response forms coded from 
1 to 20; on completion of the recall period, response 
forms for each S were deposited by each S into 
S-coded envelopes. Each practice period was 30 sec. 
long and each recall period was 60 sec. long. In- 
tervals between the practice-recall sequences were 
about 10 sec. in duration. 


RESULTS 


All groups and individual Ss made progress 
in the direction of learning. Groups A, B, C, 
and D obtained mean scores of 4.8, 3.5, 4.8, 
and 3.3, respectively. In each case, the groups 
practicing telephone numbers with letter ex- 
changes performed better than those having 
lists of the same length with appropriate 
numbers substituted for the exchange letters. 

An analysis of learning progress by group- 
ing the data in terms of deciles, quintiles, and 
quartiles disclosed that at each point in the 
learning sequence the letter-digit group is 
superior to its parallel all-digit group. Per- 
formance of Groups A, B, C, and D on the 
twentieth, that is, the final, trial were 7.3, 
6.0, 6.9, and 5.9, respectively. 

Individual differences in performance were 
obvious. In each group, with regard to total 
performance on the 20 trials, the score of the 
best performer was more than double that of 
the poorest performer. 


DIscussIon 


Despite “loading against’ letter-digit com- 
binations (by excluding two-letter word ex- 
changes which would ordinarily occur some- 
times by chance), results indicate that from 
the standpoint of facility in learning telephone 
numbers, the traditional system with a letter 
exchange is more efficient than the “exchange- 
less” or all-digit system. With a 10-unit list, 
on the average, about 37% more material 
is learned of the conventional letter-exchange 
phone numbers over comparable _all-digit 


1 The paired-associates technique was considered 
as an alternative method, but several problems 
concerned with that method and the primary objec- 
tive of this study suggested that the serial method 
was more desirable. 
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phone numbers. With an 8-unit list, on the 
average, about 45% more material is learned 
of the conventional letter-exchange phone 
numbers over comparable all-digit phone 
numbers. 

Two related viewpoints may be pertinent 
in attempting to explain the major finding of 
this research, namely, the superiority of 
letter-exchange learning. These are termed the 
isolation viewpoint and the interference view- 
point. 

1. The isolation viewpoint maintains that 
a small group of symbols of one kind among 
a more numerous group of symbols of another 
kind are apprehended and learned more 
readily, perhaps as a function of contrast. 
In any case, this is related to Von Restorff’s 
(1933) finding that if a syllable is inserted in 
a list of numbers or a number in a series of 
syllables the exceptional or “isolated” item 
will be better recalled. 

2. The interference viewpoint maintains 
that because of the similarity or relative 
homogeneity of items in the all-digit list, 
proactive and retroactive influences within the 
list may have interfering effects on recall. 
A variety of researches on proactive and 
retroactive effects provide research support 
for such a position; much of the literature in 
this area has been summarized by McGeoch 
and Trion (1952, pp. 404-447) and by Under- 
wood (1957). 

There is also a third possibility which is 
that letters are easier to learn than numbers; 
this may be a function of greater frequency of 
contact (experience) with letter material than 
number material under ordinary circum- 
stances. 

Assuming that the results are meaningful 
and generalizable, for the convenience of their 
subscribers, telephone companies might con- 
sider a return to the earlier more conventional 
letter-exchange system involving, if not a 
name exchange, at least a pair of letters. 
The use of letters as opposed to numbers 
provides for a distinct means of differentiat- 
ing, designating, and identifying—perhaps, in 
most cases, with less opportunity for retro- 
active and proactive inhibition. Further, one 
might have several different letter designa- 
tions for the same set of numbers. For ex- 
ample, a 74 could be PI, RI, or SI as well as 


LEARNING OF TELEPHONE NUMBERS 97 


PH, RH, and SH. This would achieve better 
differentiation than using 74 each time. 
749-1752 and 748-6945 might be _ better 
learned as SI 9-1752 and PH 8-4645. This, 
of course, assumes that in the lists to be 
learned, other combinations than 74 are most 
numerous. 

The letter exchange need not necessarily 
be the first two letters of a meaningful word 
as has usually been the case. For example, 
the 83 “exchange” might be dialed if the 
prefixes were VD, TD, or UF, none of which 
are words or word prefixes in the English 
language; but all are distinct from each other 
and all are commonly used abbreviations 
and, therefore, not completely without 
meaning. 

Three-letter combinations of words or of 
nonsense syllables are also not beyond the 
realm of possibility. 


Unfortunately, except for research in- 
volving digit span—long a standard method 
for assessing rote memory—relatively little 
research has been done with the learning of 
number material. Recently, however, Battig 
and Spera (1962) have reported association 
values for numbers from 0 to 100. 
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A concurrent validity study was made of the scores of 210 salesmen and 
16 district managers on 15 personal background dimensions, identified previ- 
ously by factor analysis. Differences between factor-score means for the 
manager and sales groups and for the upper- and lower-rated sales groups 
were significant for the factors Financial Responsibility (.001, .001), Early 
Family Responsibility (.01, .01), and Stability (.001, .05). Multiple-regression 
analyses of the factor scores against each of 5 criterion measures of on-the-job 
behavior yielded correlations of .42, .50, and .36 for the 3 major criteria of 
overall performance, and mean and maximum sales volume rank. The criteria 
route difficulty and tenure gave correlations of .27 and .30. Interpretation of 
the highest-weighted factors in these analyses indicates the logical, dynamic 
relationships between personal background and job behavior. 


The bulk of research on the prediction of 
occupational success from quantitative per- 
sonal-background-data items has taken an 
essentially empirical approach (Buel, 1965; 
Dunnette & Maetzold, 1955; Lockwood & 
Parsons, 1960; Minor, 1958; Mosel & Wade, 
1951; Scollay, 1956; Smith, Albright, & 
Glennon, 1961). It first determines the dis- 
criminative power of individual items in 
measuring a given criterion of success in 
some occupation. Then it combines those 
items which discriminate (with or without 
weighting) into an instrument which yields 
an index or “score.” Such scores have been 
individually cross-validated for a variety of 
occupations in industry and the armed ser- 
vices. They have also contributed materially 
to multiple correlations of test batteries with 
empirical criteria of performance. However, 
the approach has been criticized for its basic 
empiricism (Dunnette, 1962; Morrison, 
Owens, Glennon, & Albright, 1962) and its 
failure to shed any light on the dynamic rela- 
tionships between personal background on the 
one hand and occupational choice and suc- 
cess on the other. 

The present investigation is a sequel to a 
study designed to identify and define some 
of the underlying dimensions of personal 
background data (Baehr & Williams, 1967). 
The dimensions were identified through suc- 
cessive factor analyses of responses to a wide 
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variety of background items by an occupa- 
tionally heterogeneous sample of 680 male 
employees. The final factor analysis yielded 
15 interpretable oblique first-order factors. 
A factoring of the correlations among these 
primaries produced five uncorrelated second- 
order factors which appeared to represent 
broad behavior patterns associated with the 
needs and achievement drives of individuals. 

Two tests were run on the efficacy of the 
background dimensions, as measured by the 
first-order factor scores, in differentiating 
among the 10 occupational groups covered by 
the sample. An analysis of variance yielded 
F ratios which were virtually all significant 
beyond the .05 level of confidence, the ma- 
jority of them beyond the .001 level. A 
complete series of ¢ tests revealed that each 
pair of occupational groups showed differ- 
ences which were significant at the .05 level 
or better and, conversely, that each factor 
discriminated significantly among some of the 
pairs. 

These findings were considered an adequate 
basis for concluding that the dimensions of 
the personal background data were related 
to occupational classification. Further, as- 
suming that an individual’s occupational clas- 
sification is to some extent under his own 
control, then one can assume that the back- 
ground dimensions are also related to oc- 
cupational choice. Moreover, since the ma- 
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jority of the cases in the sample had been 
in their given occupation for some years—and 
thus must be regarded as at least moderately 
successful in it—the final assumption can be 
made that the background dimensions are 
related to occupational success. In the present 
study, this final hypothesis was experimentally 
investigated. The Ss were the 210-man sales 
force and its 16 district managers of a 
specialty-food manufacturing organization 
with branches at various locations through- 
out the United States. 


METHOD 
Variables 
The predictor variables were the unit-weight 


factor scores derived from the 15 first-order factors. 
These factors are defined as follows: 


Factor 1 (School Achievement). Academic achieve- 
ment, particularly in high school, but also, where 


applicable, at college. A general liking for, and 
adjustment to, the school environment. 
Factor 2 (Higher Educational Achievement). 


Special or technical accomplishment and qualifica- 
tion resulting in a relatively late vocational start 
and late assumption of family responsibilities This 
pattern is characterized by eventual occupational 
and financial achievement. 

Factor 3 (Drive). Inner drive to be outstanding 
in performance, to attain high goals even if this 
entails temporary setbacks, to supervise others, and 
to achieve success and advancement. 

Factor 4 (Leadership and Group Participation). 
A desire to establish contact with others as shown 
by membership and interpersonal activity in orga- 
nizations and an interest in influencing others 
through community and social activities. 

‘Factor 5 (Financial Responsibility). Ability to 
manage a personal economy of defined proportions— 
to earn, invest, save, and accumulate. 

Factor 6 (Early Family Responsibility). Early 
marriage and establishment of a family, with the 
husband ordinarily being the sole provider. Demon- 
strated achievement in handling the family’s financial 
affairs. Outside the work situation, the greatest 
interest is in family activities. 

Factor 7 (Parental Family Adjustment). Develop- 
ment of realistic, constructive attitudes and rela- 
tionships in the early family environment. In- 
cluded are relationships among siblings, between 
parents, and between the child and the parents. 

Factor 8 (Stability). Established security and 
stability in the work situation, resulting from a 
past history of good performance. The present con- 
cern is more to maintain what has been achieved 
than to plan for improvement or development. 

Factor 9 (School Activities). Major emphasis on 
active participation in athletic and extracurricular 


social activities at high school but also indications 
of good academic achievement. 

Factor 10 (Professional Successful Parents). A 
parental background characterized by a_ successful 
father, either self-employed or in one of the pro- 
fessions, and by material comfort and a happy 
home life. 

Factor 11 (Educational-Vocational Consistency). 
A preference for occupations which are _ highly 
related (or similar) and are in line with educational 
interests and training. 

Factor 12 (Vocational Decisiveness). Decisiveness 
in choosing an occupation and purposefulness in 
achieving the necessary qualifications, followed by 
an early start in the chosen occupation. 

Factor 13 (Vocational Satisfaction). Satisfaction 
with occupational choice and the expectation that 
peak performance will be some time in the future. 
There is, however, no evidence of consistency of 
application or of drive to achieve high standards 
of performance. 

Factor 14 (Selling Experience). Various kinds of 
selling experience, including door-to-door selling 
and transactions in real estate. 

Factor 15 (General Health). Generally better than 
average health over an extended period (childhood, 
adolescence, adulthood). General freedom from 
physical ailments and from loss of work time 
resulting from illness. 


The criterion indexes were five measures of on- 
the-job performance. The three major ones of these 
were directly related to success in selling. 

Paired-comparison performance rating. Three 
judges—ordinarily the district manager, the branch 
manager, and either the office manager or the 
buyer for the district—rated the salesmen for overall 
performance, using the paired-comparison technique. 
Their preferences were converted to normalized 
standard scores and then averaged to yield the com- 
posite performance measures used in this investiga- 
tion. 

Mean sales volume rank.1 For the past 10 yr. 
the organization has ranked each member of its 
sales force on the basis of his volume of sales for 
the year. The criterion measure used here is the 
average of all ranks assigned to a salesman over 
the period in which his sales volume was ranked. 

Maximum sales volume rank. (See Footnote 1.) 
This figure is the highest ranking the man received 
over the last 10 yr. or over his period of tenure if 
this were less than 10 yr. 

Route difficulty. Each branch manager used the 
paired-comparison technique to rate the sales routes 
in his region from most to least difficult in the 


1JIn the calculations presented in this paper, the 
ranks have been reflected. Thus, a high rank in- 
dicates a high sales volume. The commission re- 
ceived by salesmen is based directly on sales volume. 
For this reason, sales volume was selected as a 
performance criterion rather than pay, where the 
performance aspect is obscured by higher base pay 
for long-tenure salesmen, 
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TABLE 1 


CORRELATIONS BETWEEN CRITERION MEASURES 











nie Bs Performance Mean sales Maximum Route 
Criterion measure rating rank sales rank difficulty Tenure 
Performance rating poe ae — .20** —.13 
Mean sales rank aif’ pee .02 13 
Maximum sales rank Preble LT 5*#* .02 <LO® 
Route difficulty — .20** .02 .02 =D 
Tenure —.13 ice non —.02 
<= 05: 
ple 
¥EK D < 001. 


realization of sales potential. These ratings were 
converted to normalized standard scores. 

Tenure as a salesman. This measure was the num- 
ber of years the individual had been employed in 
the organization as a salesman. 

The correlations between these criterion measures 
are shown in Table 1. 


Subjects 


The sales personnel who were Ss in this study 
were not part of the sample used in identifying 
the personal background dimensions. The sales 
groups in the original 680-man sample were paper- 
forms salesmen and district sales managers in the 
insurance business. The 210 salesmen in the present 
investigation were employed in a food-products 
manufacturing organization supplying institutions 
and retail outlets. Their average education was 
1-2 yr. of college and their average age 42 yr. 

The 16 district managers, representing the first 
line of sales management in the organization, had 
about the same educational background as the 
salesmen. Their average age was 48 yr. Since the 
organization has a strong policy of promotion from 
within and since the district manager’s job still in- 
cludes a fair amount of personal selling, it may be 
assumed that the district managers represent the 
upper end of the performance continuum for the 
sales force. 


Procedure 


The Ss’ factor scores were secured by administer- 
ing a multiple-choice questionnaire called the Per- 
sonal History Index (Baehr, Burns, & McMurry, 
1965). 

T-test analyses. A t-test analysis was made of the 
significance of the differences between the mean 
personal-history factor scores of the group of dis- 
trict managers and of the sample sales group. An- 
other such analysis was run on the differences be- 
tween the mean factor scores of those salesmen 
with ratings of 55 and above on the standard-score 
scale for the paired-comparison assessments and 
those with ratings of 45 and below. 

Multiple-regression analyses. A multiple correla- 
tion for the personal background factors was calcu- 


lated separately for each of the five criterion 


measures. 
RESULTS 


Results of the ¢-test analyses are shown in 
Tables 2 and 3. The three factors of Financial 
Responsibility, Early Family Responsibility, 
and Stability differentiated at statistically sig- 
nificant levels both between the salesman and 
district manager groups and between the 
upper- and lower-rated sales groups. The 
scores on these factors rise successively from 
the lower-rated salesmen through the upper- 
rated salesmen to the district managers. These 
results correspond with the assumed level of 
sales performance of each of these groups. It 
is interesting to note that the three differ- 
entiating factors are among the first 8 of the 
15—those which had previously been identi- 
fied as having the most potential for opera- 
tional use (Baehr & Williams, 1967). 

Results of the multiple-regression analyses 
are given in Table 4. The computer programs 
for these analyses were written in such a way 
that a variable would be included only if its 
degree of variance produced an F ratio at the 
05 level of confidence or better. The ob- 
jectives of this programming were to prevent 
undue “forcing” of variables into the regres- 
sion equations and to lessen the effects of 
purely chance relationships of the variable 
and the criterion. 

The three major criterion measures of 
success are the paired-comparison performance 
rating, the mean sales volume rank, and the 
maximum sales volume rank. Table 1 shows 
that the latter two measures (because of their 
common source data) are highly correlated 
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TABLE 2 


SIGNIFICANCE OF THE DIFFERENCES BETWEEN MEAN PERSONAL-HISTORY Factor SCORES FOR THE 
District SALES MANAGER AND THE SALESMAN GROUPS 
































District sales managers Salesmen 
N = 16 Ne e21(0) 
Personal-history factor ( ( ) t 
M SD M SD 
School Achievement 2.44 1.37 2.19 1.56 .62 
Higher Educational Achievement 5.37 1.32 4.58 1.79 1.75 
Drive 4.75 Dede, 4.23 2.00 .99 
Leadership and Group Participation 2.81 1.67 4.78 1.67 .10 
Financial Responsibility 6.50 1.06 4.86 1.80 Stove 
Early Family Responsibility 9.56 1.54 7.95 2.43 2.60* 
Parental Family Adjustment 7.00 lois) 6.62 2.23 66 
Stability Hees aid 5.56 Del oo 
School Activities 3.31 2.02 3.28 1.87 .26 
Professional Successful Parents 4.19 1.38 S18) 1.58 1.13 
Educational-Vocational Consistency 1.69 98 1533 97 1.40 
Vocational Decisiveness DES eS Delis ete 1.91 
Vocational Satisfaction 6.12 1.69 5.80 1.80 10 
Selling Experience 6.56 1.54 6.23 1.48 85 
General Health 3.19 1.55 3.44 1.08 86 
*> <.01. 
xx 5 < .001. 

(y= .75). The performance rating shows The multiple correlation for the perform- 


moderate correlations (ry = .37 and .24) with ance-rating criterion was .42 and for the 
the mean and the maximum sales volume mean and maximum sales volume ranks, .50 
ranks. and .36, respectively. In the regression anal- 


TABLE 3 


SIGNIFICANCE OF THE DIFFERENCES BETWEEN MEAN PERSONAL-HISTORY FACTOR SCORES FOR THE UpPER-RATED 
AND THE LOWER-RATED SALESMEN ON THE PAIRED-COMPARISON PERFORMANCE INDICES 

















Upper-rated salesmen Lower-rated salesmen 
Personal-history factor een) eS?) t 
M SD M SD 
School Achievement 1.96 1.52 2.19 1.65 82 
Higher Educational Achievement 4.59 1.68 4.69 1.91 .30 
Drive 4.13 2.04 4.15 1.90 .00 
Leadership and Group Participation 2.90 1.66 2.44 1.78 1.48 
Financial Responsibility 5.54 1.43 4.11 1.79 ALODERS 
Early Family Responsibility 8.63 1.98 7.31 2.63 Selmi 
Parental Family Adjustment 6.47 2.25 6.79 2.06 80 
Stability 6.01 1.68 5.08 2.56 2.46* 
School Activities Soe 1.88 3.08 1.76 13 
Professional Successful Parents 3.79 1.46 3.60 155 Bi 
Educational-Vocational Consistency 1.42 95 1225 99 95 
Vocational Decisiveness DDD JESO 2.14 1.12 39 
Vocational Satisfaction 5.69 1.81 5.67 LP 00 
Selling Experience 6.25 1.39 5.92 1.44 127 
General Health 3.42 1.15 3.27 1.11 eal 
* 
wD S01. 


eae Os 
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TABLE 4 
MOULTIPLE-REGRESSION ANALYSES 
Accepted personal- Partial] Simple} Beta 
history factor ry, if weight 














Criterion: Paired-comparison performance rating 
(multiple R = .42) 


5. Financial Responsibility “oo 33 34 
4. Leadership and Group 
Participation 14 14 14 
13. Vocational Satisfaction 14 -06 14 
7. Parental Family Adjust- 
ment ll |) adie} |p il 
6. Early Family Responsi- 
bility ol 18 lid 


1. School Achievement —.10 | —.05 | —.10 


Criterion: Mean sales volume rank (multiple R = .50) 


. Financial Responsibility to 43 38) 
. Stability Pail 39 yp 
. General Health a 2a |e) 
. Higher Educational 

Achievement me 12 .10 


bo on CO MN 








1. School Achievement —.10 | —.08 | —.09 
Criterion: Maximum sales volume rank 
(multiple R = .36) 

5. Financial Responsibility .20 31 ead 
8. Stability m2, Pat mS 

7. Parental Family Adjust- 
ment —.11 | —.11 | —.10 
6. Early Family Responsi- 
bility .10 ply 10 
Criterion: Route difficulty (multiple R = .27) 
3. Drive 19 .16 19 
11. Educational-Vocational 
Consistency ml ml ahi 
6. Early Family Responsi- 
bility —.12 | —.09 | —.11 
15. General Health —.11 | —.11 ) —.11 


Criterion: Tenure as a salesman (multiple R = .30) 


13. Vocational Satisfaction —.20 | —.18 | —.20 
6. Early Family Responsi- 

bility —.19 | —.18 | —.18 

12. Vocational Decisiveness .16 ae 16 


yses on all three criterion measures, the 
factor Financial Responsibility carried the 
greatest weight. For both the sales volume 
criterion measures, the next greatest weight 
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was on the factor Stability. Interestingly 
enough, however, the second highest weighted 
factor for the performance-review criterion, 
which involved the judgment of the men’s 
superiors, was Leadership and Group Par- 
ticipation. The factors Parental Family 
Adjustment (negative), Early Family Re- 
sponsibility, and School Achievement each 
had weights in two of the three analyses. 
Again, it should be noted that all factors 
mentioned are among the first eight—those 
recommended for operational use. 

The two remaining criterion measures— 
route difficulty and tenure as a salesman— 
yielded multiple correlations of .27 and .30, 
respectively. These coefficients are admittedly 
modest. However, they do indicate the flexi- 
bility of the background dimensions in pre- 
dicting a number of different criteria, since 
the correlations of the two minor criteria with 
the three major ones are either zero or in- 
significantly negative. 

In the route-difficulty analyses, the factor 
Drive carried the greatest weight. This is 
logical, since a high score on Drive, as Drive 
is defined here, would be associated with a 
willingness to take on the difficult and chal- 
lenging sales routes. Enquiries to the organ- 
ization did indeed elicit the fact that the 
older, less successful salesmen were generally 
to be found on the easier routes, which de- 
manded mainly a servicing of established 
outlets rather than any real sales activity in 
opening new accounts. 

The highest weight in the tenure analysis 
is a negative one on the factor Vocational 
Satisfaction. This apparently startling result 
can be explained on the basis of the factor 
definition and the particular sales force in- 
volved. In this rather old and established 
sales group (average tenure is over 10 yr.), 
the salesmen of longest standing are gen- 
erally the men on the less difficult, servicing 
routes who no longer strive for promotion or 
for self-improvement through relocation. As 
a result, they are probably resigned to, rather 
than satisfied with, their choice of occupation 
and have no reason to feel that their period 
of peak performance is some time in the 
future. 

Results of this nature emphasize the need 
for care in interpreting personal-history data 
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for purposes of predicting on-the-job behavior. 
The “blind” application of an “index” derived 
from the items in this factor would, it is true, 
probably result in a sales force with low 
turnover. However, that sales force would 
tend to consist of older indivduals who would 
feel they were past their peak performance, 
whose aspiration levels were low, and who 
were more interested in simply maintaining 
their present position than in advancing 
in the organization. 


DISCUSSION 


On the basis of these results, the picture 
that emerges of the successful salesman or 
sales manager is one of a man with a back- 
ground of competent handling of his personal 
economy, an early vocational start with prime 
or sole responsibility for managing family 
finances, and, particularly for the managers, 
a past history of sales achievement and pres- 
ent stability in the work and family situa- 
tions. 

An interesting finding in the analysis of 
the criterion measures of on-the-job behavior 
was that the greatest weight for tenure was 
a negative one on the factor Vocational 
Satisfaction. This latter finding leads to the 
conclusion that it would be inexpedient to 
predict tenure on the grounds of vocational 
dissatisfaction unless the organization is pre- 
pared to maintain a stable, low-turnover 
sales force at the expense of having it com- 
posed of men who feel they cannot improve 
their performance either in other positions 
or in other companies. Further research 
should make it possible to define and measure 
objectively the different background di- 
mensions which characterize successful per- 
formers in a variety of occupations. 

A general aim of this study was to stress 
the need for a rational as well as an empirical 
approach to the use of personal background 
data. An attempt was made, therefore, to 
interpret the significance of the personal back- 
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ground factors in the life of the individual 
and to show their relationships to successful 
performance in different occupational settings. 
By contrast, the more frequently used em- 
pirical approach will yield, for each occupa- 
tional group studied, a frequently highly 
discriminative but uninterpretable composite 
score or index. 


REFERENCES 


BareHr, M. E., Burns, R. K., & McMurry, R. N. 
Personal history index. Chicago: Industrial Rela- 
tions Center, University of Chicago, 1965. 

Bareur, M. E., & WitiiAMs, G. B. Underlying di- 
mensions of personal background data and their 
relationships to occupational classification. Journal 
of Applied Psychology, 1967, 51, 481-490. 

Buet, W. D. Biographical data and the identifica- 
tion of creative research personnel. Journal of 
Applied Psychology, 1965, 49, 318-321. 

DunnettTE, M. D. Personnel management. In P. R. 
Farnsworth (Ed.), Annual review of psychology. 
Vol. 13. Palo Alto: Annual Reviews, 1962. Pp. 
285-313. 

DunnetrTeE, M. D., & Maerrzorp, J. Use of the 
weighted application blank in hiring seasonal 
employees. Journal of Applied Psychology, 1955, 
39, 308-310. 

Lockwoop, H. C., & Parsons, S. O. Relationships 
of personal history information to the performance 
of production supervisors. Engineering Industrial 
Psychology, 1960, 2, 20-26. 

Minor, F. J. The prediction of turnover of 
clerical employees. Personnel Psychology, 1958, 11, 
393-402. 

Morrison, R. F., Owens, W. A., Giennon, J. R., 
& Avsricut, L. E. Factored life history antecedents 
of industrial research performance. Journal of 
Applied Psychology, 1962, 46, 281-284. 

Moset, J. N., & Wave, R. R. A weighted application 
blank for reduction of turnover in department 
store sales clerks. Personnel Psychology, 1951, 
4, 177-184. 

Scottay, R. W. Validation of personality history 
items against a salary increase criterion. Per- 
sonnel Psychology, 1956, 9, 325-336. 

SmitH, W. J., ALBricHt, L. E., & Giennon, J. R. 
The prediction of research competence and crea- 
tivity from personal history. Journal of Ap- 
plied Psychology, 1961, 45, 59-62. 


(Early publication received August 11, 1967) 


Journal of Applied Psychology 
1968, Vol. 52, No. 2, 104-121 


GOALS AND INTENTIONS AS MEDIATORS OF THE EFFECTS 
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Based on the assumption that goals and intentions are the most immediate 
determinants of an individual’s behavior, it was hypothesized that monetary 
incentives would affect task performance only through or by means of their 
effects on the individual’s goals or intentions. 5 experiments were performed 
to explore this hypothesis. 2 dealt with the relationship of performance goals 
to level of performance (output) on a task as a function of incentive condi- 
tion. 3 experiments examined the relationship of behavioral intentions to task 
choice as a function of incentive. In all 5 studies, significant relationships were 
obtained between performance goals or behavioral intentions and behavior. 
However, when goal or intention level was controlled, there was no effect of 
monetary incentive on behavior. In the 3 choice studies where incentives did 
have an initial effect on choice, these choice differences were accompanied by 
equivalent differences in intentions. When the latter were partialed out, the 
original incentive effect was vitiated. The data were interpreted as supporting 


the hypothesis. 


A recent review of the literature on the 
motivational effects of financial compensation 
(Opsahl & Dunnette, 1966) stressed the need 
for more basic research on the mechanisms 
by which money motivates action. The authors 
stated that: 


. there is probably less solid research in this 
area than in any other field related to worker per- 
formance. We know amazingly little about how 
money either interacts with other factors or how 
it acts individually to affect job behavior. 
Speculation, accompanied by compensation fads 
and fashions, abounds; research studies designed 
to answer fundamental questions about the role 
of money in human motivation are all too rare 
[p. 94]. 


The present studies were designed to help 
remedy this situation. The authors’ approach 
was based on Ryan’s (1958) theory of moti- 
vation * which states that: ‘‘a very large pro- 
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Department. The authors would like to thank 
Charles D. Spielberger of Florida State University 
for his helpful criticisms of an earlier draft of 
this manuscript. 

2Primary affiliation now with the University of 
Maryland. 

3 For a more recent statement of Ryan’s theory, 
see also unpublished mimeos, 1964. Chapter I: Ex- 


portion of behavior is initiated by tasks [goals, 
intentions] and a very large proportion of tasks 
lead to the behavior specified by the tasks. ... 
A task is a necessary condition for most kinds 
of behavior [p. 79].” If tasks, goals, or in- 
tentions are a necessary condition for action, 
then incentives such as money should affect 
action only if and to the extent that they 
affect the individual’s goals and intentions. 
This was the major hypothesis tested in the 
studies to be reported here. 

While the above represents little more than 
refined common sense, it should be recog- 
nized that the use of conscious goals and 
intentions as explanatory concepts in psy- 
chology is virtually nonexistent today. It is 
much more common to view incentives (“re- 
inforcements”) as shaping behavior auto- 
matically (without the operation of conscious 
volition). The present studies provided an 
opportunity to test the validity of the latter 
assumption. 

Goals can be differentiated both qualita- 
tively (e.g., “do your best” versus “try to 


plaining behavior; Chapter II: Explanatory con- 
cepts; Chapter V: Experiments on intention, task, 
and set; Chapter VI: Intentional learning; Chapter 
VII: Unintentional learning. Cornell University, 
Department of Psychology. 


104 


EFFrects oF MoNETARY INCENTIVES ON BEHAVIOR 


improve”) and quantitatively (e.g., “try for 
a score of 50” versus “try for a score of 
70”). In the latter case, goal level would be 
synonymous with “level of aspiration.” Two 
of the present investigators have reported a 
number of studies showing strong relation- 
ships between S’s goals and level of perform- 
ance on laboratory tasks. These results in- 
dicate that hard goals produce a higher per- 
formance level (greater output) than easy 
goals (Bryan & Locke, 1967b; Locke, 1966a, 
1966b; Locke & Bryan, 1966b, 1967a, 1967b) ; 
and that specific (quantitative) hard goals 
lead to a higher performance level than a goal 
of “do your best” (Bryan & Locke, 1967a; 
Locke, 1967; Locke & Bryan, 1966a, 1966b, 
1967a, 1967b). A number of studies by 
other investigators also support the assump- 
tion that goals and intentions are important 
regulators of task performance (DeNike, 
1965; Dey & Kaur, 1965; Dulany, 1962; 
Mace, 1935; Stedry, 1962). 

Several recent studies of nonmonetary in- 
centives have suggested that such incentives 
affect choice behavior only through their 
effects on the individual’s conscious goals and 
intentions. For instance, Spielberger, Berger, 
and Howard (1963), Spielberger, Bernstein, 
and Ratliff (1966), and Spielberger, Levin, 
and Shepard (1962) found that the effect 
of verbal reinforcements on verbal behavior 
was a direct function of S’s awareness of the 
concept to be learned and the desire of S to 
get the reinforcement. Dulany (1962) and 
Holmes (1966) found that the individual’s 
verbal responses on such tasks were a direct 
function of his behavioral intentions. 

Locke and Bryan (1966b), using post- 
experimental goal questionnaires, found that 
knowledge of score had no effect on perform- 
ance but that goals were significantly related 
to output. Locke (1967) noted that previous 
studies in this area often confounded goal- 
setting and knowledge effects. The Ss given 
knowledge of score were typically given a spe- 
cific hard goal to aim for, while Ss not given 
knowledge of score were typically told to do 
their best. When these two factors were sepa- 
rated by independent experimental manipula- 
tion of goals and knowledge of score, Locke 
(1967) found that specific hard goals pro- 
duced a higher level of performance than a 
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goal of “do your best,” but that there was 
no effect of knowledge of score per se on 
performance. 

Goal setting as a mediator of the effects of 
time limits on work speed was investigated 
by Bryan and Locke (1967b). It was found 
that time taken to complete a task expanded 
to fill the time available (in line with Parkin- 
son’s Law) or contracted to get done in the 
time allowed, but only to the extent that Ss 
set different goals under the different time 
limits. 

Finally, Meyer, Kay, and French (1965) 
found that participation in performance ap- 
praisal interviews was relatively unlikely to 
lead to improved performance unless specific 
improvement goals were set during the in- 
terviews. The authors write: 


While subordinate participation in the goal- 
setting process had some effect on improved per- 
formance, a much more powerful influence was 
whether goals were set at all... . 

The average percent accomplishment estimate for 
those performance items that did get translated into 
goals was 65, while the percent estimate for those 
items that did not get translated into goals was 
about 27 [pp. 126-127]! 


To the authors’ knowledge, no studies have 
been made of goals and intentions as medi- 
ators of the effects of monetary incentives, 
although preliminary findings by Spielberger, 
Ratliff, and Bernstein (1966) suggested that 
the desire of Ss to earn money on a verbal 
learning task played a role in determining its 
effects on performance. Whyte (1955) has 
shown in real-life settings that the effects of 
monetary incentives on output are by no 
means automatic. A series of behavioristically 
oriented experimental studies of monetary 
incentives by Toppen (1965a, 1965b, 1965c, 
1966) found the following relationships be- 
tween monetary reinforcement schedule and 
performance output on a lever-pulling task: 
(a) larger reward magnitudes and higher re- 
inforcement frequencies led to higher output, 
(6) piece-rate payment yielded a higher out- 
put than time-rate payment, and (c) decreas- 
ing the magnitude of the reward over time 
resulted in performance decrement. Although 
Toppen asserts repeatedly in these studies 
that humans respond “just like animals” to 
reinforcement, his own findings and experi- 
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mental procedures consistently contradict this 
claim. Toppen repeatedly refers to the effects 
on performance of the Ss’ “feelings of obliga- 
tion” to #, their “expectations” and “in- 
terests,” their “ulterior motives,” and their 
“desire to do well,” and he uses elaborate 
instructions and procedures to try to control 
and eliminate such effects. His very efforts in 
this respect reflect the importance and 
ubiquity of conscious factors in human per- 
formance. 

What Toppen ignored was the fact that be- 
tween the offering of a monetary incentive 
and an individual’s overt action, a complex 
sequence of conscious and _ subconscious 
processes occur. The possibility that these 
processes might determine /ow an individual 
will react to the incentive was evidently not 
considered. For instance, before an external 
object (which does not physically touch S) 
can affect an individual’s actions he must 
(ordinarily): perceive and identify it, that is, 
be aware that it exists and of whai it is; fol- 
lowing perception and identification comes an 
appraisal of the object against some standard 
of value, that is, an estimate of the extent to 
which the object will be beneticial or harmful 
to him. As a result of this appraisal (which 
may be entirely subconscious), he may experi- 
ence an emotion: “the psychosomatic form in 
which man experiences his estimate of the 
beneficial or harmful relationship of some 
aspect of reality to himself |Branden, 1966, 
p. 5].” On the basis of his emotional reaction, 
his conscious values, and his perception of the 
task and situation, the individual will usually 
(explicitly or implicitly) set himself a goal 
or end toward which he will direct his be- 
havior and in terms of which its appropriate- 
ness will be judged. He will develop an in- 
tention to act in a certain way. 

It is assumed that the individual’s specific 
goals and intentions are the end result or 
culmination of the complex intervening mental 
processes described above and the most direct 
determinants of actual performance. The pres- 
ent studies were not concerned with the 
question of why particular goals or intentions 
were developed by particular individuals, but 
only with the relationship between such goals 
or intentions, once established, and perform- 
ance. To use Ryan’s (1958) terminology, the 
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authors were concerned with specific or im- 
mediate level explanation. 

To demonstrate that goals and intentions 
were direct determinants of performance and 
mediators of the effects of incentives on be- 
havior, it would have to be shown: (a) that 
goals and intentions were related to behavior 
regardless of incentive condition, but (0d) 
that the reverse was not true—that incentives 
did not affect behavior unless they affected 
goals and intentions. The above requirements 
can be expressed more precisely in terms of 
three specific subhypotheses which were the 
ones tested in the present studies: 

Hypothesis 1. Goals and intentions will be 
related to behavior regardless of incentive 
condition; that is, goals and intentions will 
be related to behavior, both within and 
across different incentive conditions. 

Hypothesis 2. When incentive differences 
do correlate with behavior differences, these 
differences will be accompanied by corre- 
sponding differences in goals or intentions. 

Hypothesis 3. When goal or intention dif- 
ferences are controlled or partialed out, there 
will be no relationship between incentive 
condition and behavior. 

Hypothesis 1 is simply a restatement of the 
general assumption that goals and intentions 
are direct determinants of behavior. If this 
assumption is valid, then the relationship 
should hold up regardless of incentive condi- 
tion. Hypotheses 2 and 3 are directly con- 
cerned with the role of goals and intentions 
as mediators of the effects of incentives. If 
incentives do affect behavior, they should 
affect goals and intentions at the same time 
(Hypothesis 2); if goal or intention differ- 
ences are controlled, incentives should have 
no effect (Hypothesis 3). 

The above three hypotheses were tested 
with respect to two different dimensions of 
behavior: level of performance or quantity 
of output, and direction of behavior or choice. 
Two experiments were performed in which all 
Ss worked on the same tasks under different 
incentive conditions, and level of performance 
(productivity) was the dependent variable. 
The relevant motivational concept in this sit- 
uation is the individual’s performance goal or 
aspiration level. If monetary incentives affect 
performance automatically then different 
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amounts of incentive should produce different 
levels of performance regardless of the level 
of the Ss’ goals (e.g., the higher the incentive, 
the higher the performance level). On the 
other hand, if conscious goals mediate the 
effects of incentives, then incentive differences 
should only produce performance-level differ- 
ences if and to the extent that they produce 
goal-level differences. 

Three experiments were performed in which 
Ss had a choice as to the difficulty of the 
tasks they would work on, and choice diffi- 
culty was the dependent variable. Incentives 
were offered for solving a task (problem) 
successfully regardless of the difficulty of the 
task. This meant that Ss were “reinforced” 
with money more frequently for choosing 
easy tasks than for choosing hard tasks. 

The relevant motivational concept in a choice 
situation is the individual’s behavioral in- 
tention (Dulany, 1962), the choice or re- 
sponse that the individual intends to make. 
If monetary incentives affect task choice 
automatically, then Ss who are paid for suc- 
cess only should be more likely to choose 
easy tasks as the payment for success in- 
creases, regardless of their intentions. On the 
other hand, if behavioral intentions mediate 
the effects of incentives on task choice, then 
incentive differences should produce task- 
choice differences only if and to the extent 
that they produce intention differences. 

The first two experiments deal with level 
of performance and the last three with choice. 
The first, fourth, and fifth experiments were 
designed and carried out by the first author; 
the second was designed and carried out by 
the second author; the third study was the 
responsibility of the third author. 


EXPERIMENT 1 


Method 
Subjects 


The Ss were 127 paid, undergraduate volunteers 
(71 males and 56 females) recruited from the Uni- 
versity of Maryland. All Ss received $3 for participa- 
tion in addition to whatever money (if any) 
they earned in the experiment. 


Task 


The task was giving uses for objects and was 
introduced as a test of creativity. On each trial, S 
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was given a common object (e.g., a cardboard box), 
the task being to list possible uses for the object. 
There were two practice trials followed by three 
blocks of seven experimental trials each. Trials 
lasted 1 min. The Ss wrote their responses on answer 
sheets. During the experiment, Ss kept track of their 
own scores with the instruction that the protocols 
would be rescored after the experiment. The E 
did rescore them, eliminating only those responses 
that were not uses (e.g., if S said “to sell” an ob- 
ject without telling why he sold it, it would not 
be considered a use). Thus, S’s score was the total 
number of uses given without regard to quality. 


Procedure and Conditions 


After the rules were explained to Ss, they were 
given the two practice trials and told to do their 
best. On the first block of seven experimental trials, 
all Ss in Conditions A through E (see below) were 
told to “work at what you would consider to be 
a normal pace for an eight hour day, assuming no 
time limits and that you would not be paid for 
production.” This block was of no direct relevance 
to the present study and will not be discussed 
further. The experimental conditions were intro- 
duced before Blocks II and III. The Ss in each con- 
dition were run together. 


Condition A: Assigned goals; all-or-none incen- 
tive (N=19). Before Block II, Ss were assigned 
the goal of attaining, on each trial in that block, 
a score equal to one more than their mean score 
on the two practice trials. They were offered a 5¢ 
bonus for each trial on which they reached this 
goal. 

Before Block III, Ss were assigned a goal of 
attaining, on each trial, a score equal to one more 
than their best score in Block II. They were offered 
a 10¢ bonus for each time they reached or ex- 
ceeded this goal. 

Condition B: Assigned goals; no incentive (N= 
17). This condition was identical to Condition A, 
except Ss were not offered any bonus for goal 
attainment. 

Condition C: Self-set goals; piece-rate incentive 
(N = 22). The incentives were set for this group 
so that if they performed at the same level as 
Group A, they would receive, on the average, the 
same total pay on each block as Group A had 
received. Thus Ss were offered 6/10 of a cent per 
“use” on each trial on Block II and 4/10 of a cent 
per “use” on Block III. So that their goal level could 
be determined, Ss were told to indicate what their 
goal would be before each block. They were told 
that the goal would have nothing to do with their 
pay, which depended upon the number of uses 
given, but that they should indicate the goal (score) 
they were actually going to try for on each of the 
seven trials in that block. 

Condition D: No goal setting; piece-rate incentive 
(N= 21). This condition was identical to Con- 
dition C, except that Ss did not set any explicit 
goals each block. 
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Condition E: Self-set goals; no incentive (N= 
12). This condition was identical to Condition C, 
except that no incentive for performance was offered. 

Condition A’; Assigned goals; all-or-none incentive 
(N=21). This condition was similar to Condi- 
tion A above, except that the goals in this case 
were set higher than for Group A. On Block I the 
A’ condition Ss were told to do their best. Each 
S’s Block II goal, for each trial, was one more than 
his best score on Block I, and his Block III goal, 
for each trial, was one more than his best score 
on Block II. The incentives were 10¢ and 25¢ for 
reaching or exceeding the goals on Blocks II and III, 
respectively. 

Condition B': Assigned goals; no incentive (N= 
15). This condition was identical to Condition A’ 
above, except that no incentives were offered for 
goal attainment. Thus this condition holds the same 
relationship to Condition A’ as Condition B holds 
to A above. 


All Ss were administered goal-description question- 
naires at the end of the experiment to determine 
whether they had been trying for their assigned or 
self-set goals and to what degree they had been 
trying for the incentives. The responses indicated 
that nearly all Ss were trying for their goals and 
were trying to maximize their earnings. 


Resulis and Discussion 


The data were initially analyzed using 
Groups A through E only. There were no 
significant performance differences among 
these five groups in raw means either on the 
practice trials (indicating the groups were of 
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equal initial ability) or on Blocks II and III. 
However, the within-cells variance for these 
groups was inflated due to wide differences in 
ability of the Ss within each group. Thus 
subsequent analyses were done using only 
improvement scores: (a) from the practice 
trials to Block II, and (0) from Block II 
to Block III. 

The performance means for each group for 
Blocks II and III are shown in Figure la 
and the mean goals for the same groups and 
blocks are shown in Figure 1b (no goals are 
shown for Group D, since they did not set 
explicit goals). In parentheses beside each 
point in Figure 1b are shown the percentage 
of times (trials) on which Ss in that group 
reached or beat their assigned or self-set goals 
on that block. It is evident that the higher 
the goals, the less likely Ss were to succeed 
in reaching them. 

The improvement scores from the practice 
trials to Block II showed no differences 
among the five groups. Since there were no 
differences among these groups on the practice 
trials, and since the goal levels of all groups 
on Block II were similar (see Figure 1b), 
this finding is consistent with Hypothesis 3. 

The improvement scores from Block II 
to Block III tell quite a different story, how- 
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Fic. la and b. Performance means and mean goal-level scores for Groups A 
through E, by block (Experiment 1). 
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ever. If all five groups are included in the 
analysis, the F ratio for the difference in 
performance improvement means is 5.44 (df 
= 4/86), which is significant at the .001 
level. It is clear from Figure 1b that the as- 
signed goal groups (A and B) had consider- 
ably higher and harder goals on Block III 
than the self-set goal groups (C and E). 

To determine the relative importance of 
goals and incentives in regulating perform- 
ance, Groups A, B, C, and E were put into 
a 2 X 2 analysis-of-variance design, the two 
factors being assigned goals versus self-set 
goals (A and B versus C and E), and mone- 
tary incentive versus no monetary incentive 
(A and C versus B and E). The first analysis 
was done using goal improvement scores 
(from Block II to III) as cell entries to 
determine differences in goal level among 
these groups. Using the unequal N method 
described by Winer (1962, pp. 241ff), a sig- 
nificant goal effect was obtained (F = 93.48, 
df =1/66; p< .001) in favor of the as- 
signed goal group. There was no difference in 
goal improvement between the incentive and 
no-incentive groups and no interaction effect. 
If performance differences were confined to 
groups having significant goal-level differ- 
ences, one would expect the only significant 
performance improvement difference to be 
for this same comparison. A similar 2 x 2 
analysis of variance using performance im- 
provement scores (Block II to III) as cell 
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entries yielded a significant goal effect, the 
assigned goal groups (A and B) showing 
greater performance improvement than the 
self-set (C and E) goal groups (F = 8.02, 
df = 1/66; p< .01); there was no effect of 
incentive and no interaction effect. 

Group D did not set explicit goals and 
their mean performance improvement was 
significantly lower than that of Groups C 
and E combined (¢ = 2.82, p< .01). Evi- 
dently the procedure of goal setting itself en- 
hanced performance as compared to no goal 
setting, regardless of incentive condition. 

There was no difference in the goal-level 
changes of Groups A and B (assigned goal 
with and without incentive) nor was there a 
difference in performance between these two 
groups. Similarly, the mean goal levels of 
Groups C and E (self-set goal with and with- 
out incentive) were almost identical and so 
were mean improvement scores. Thus there 
were no performance differences among 
groups offered different amounts of incentive 
providing their mean goal levels were the 
same, 

Added credence is given to the belief that 
incentive differences will not produce per- 
formance differences if the goal levels are the 
same, by data from Groups A’ and B’. The 
performance data for these groups for Blocks 
II and III are shown in Figure 2a, and the 
goal level and success data are shown in 
Figure 2b. Again there was no significant 
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Fic. 2a and b. Performance means and mean goal-level scores for A’ and B’ 
groups, by block (Experiment 1). 
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difference between the two groups either in 
goal-level changes or in performance im- 
provement from Blocks II to III. This 
replicates the similar findings for Group A 
versus B and Group C versus E, above. 

An estimate of the overall between-group 
relationship of performance level to goal level 
was obtained by ranking the 12 performance 
means plotted in Figures la (omitting Group 
D) and 2a, and the 12 mean goal scores plot- 
ted in Figures 1b and 2b. The rank-order 
correlation between these pairs of means was 
.85 which is significant at the .01 level. The 
overall relationship between change in goal 
level and performance improvement from 
Blocks II to III was obtained using the 70 
Ss in Groups A, B, C, and E as the units of 
analysis. The cosine-pi estimate of ret (di- 
chotomizing both variables at the median) 
was .55 which is significant at the .01 level. A 
similar relationship was not found among Ss 
in Condition A’ and B’, however. This is 
probably due to the fact that the range of 
goal improvement scores was less among these 
Ss. Improvement was harder for them since 
they were already at a high level of per- 
formance on Block II. As a result, the im- 
provement scores may have been less reliable 
for these Ss. 

The results of this experiment supported 
Hypotheses 1 and 3. Hypothesis 1 was sup- 
ported by the finding of a significant effect 
of goal level on performance level using 
both group means and individual change 
scores as the units of analysis. Hypothesis 3 
was supported by the finding that, among 
groups which had the same goal level, there 
was no effect of incentive on performance. 
The same goal level produced the same level 
of performance regardless of whether incen- 
tives were offered for goal attainment or not. 
Hypothesis 2 could not be tested in this study 
since there was no initial effect of incentive 
on performance. 

The next study was also concerned with 
level of performance, but was much simpler 
in design. It was decided to compare the 
effects of a piece-rate incentive with those 
of a time-rate incentive. It will be recalled 
that Toppen (1965c) previously found the 
former to produce a higher level of perform- 
ance than the latter. 
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EXPERIMENT 2 


Method 
Subjects 


The Ss were 30 paid, male, undergraduate volun- 
teers recruited from the University of Maryland. 
All Ss were run individually. The Ss were paid as 
indicated below. 


Task 


The task was assembling a number of identical 
objects from Tinker Toy sets. The object to be 
copied resembled a wind vane and was designed 
by the second author to insure a maximum utiliza- 
tion of all the Tinker Toy parts. The S was 
supplied with enough parts to make about 26 
wind vanes, and E could take some apart if S 
made more than this in the time allowed. Boxes 
of like parts were placed in front of S to make 
assembly easier. The S had to make a complete 
wind vane before beginning another (no assembly 
lines were permitted). 


Procedure and Conditions 


The study was introduced as an experiment on 
the development of “manual dexterity” and “manip- 
ulative abilities.” The S was shown the model and 
asked to make two wind vanes just like the model 
in as little time as possible. The model was placed in 
front of S and he could refer to it as needed. The 
E recorded the time taken to make each vane. At 
this point the experimental conditions were in- 
troduced. 

Piece-rate incentive. Half the Ss were told that 
they would get no basic (minimum) pay but would 
receive 12¢ for every vane they put together in the 
next 50 min. This rate was chosen to allow Ss to 
make $3, on the average, in the 50-min. time limit. 
The S was given his fastest assembly time (of the 
two practice vanes) and told how many he could 
expect to make in 50 min. if he worked at that 
same pace. He then had to indicate to EZ the num- 
ber of vanes he would try for and the amount of 
money he would try to make. 

During the experiment, & left the room but could 
observe S through a one-way mirror. The E 
recorded the number of vanes completed every 
5 min. and told S how much time had elapsed 
every 10 min, At the halfway (25 min.) point, 
S was given a 2-min. rest while E counted 
the completed vanes in front of S and asked 
what his final goal would be; that is, S set a new 
final goal. 

Hourly rate incentive. The procedure for this 
group was basically the same as for the piece-rate 
group, except that Ss were told they would get $3 
for participation regardless of output. The Ss made 
goal ratings and were given feedback as with the 
piece-rate group. 


Results and Discussion 


The two incentive groups did not differ 
significantly in initial ability as measured by 
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Fic, 3a and b. Performance means and mean goal level for incentive groups, 
by period (Experiment 2). 


the speed with which they completed the 
fastest of their two practice models. Per- 
formance for the two groups during the first 
and second 25-min. periods of the experi- 
ment, measured in terms of the total number 
of models completed, is shown in Figure 3a. 
The goal levels of each incentive group for 
the first and second 25-min. periods are 
shown in Figure 3b. Since Ss set their initial 
goals in terms of total output (performance 
for the whole 50-min. period), the first-period 
goals were obtained by dividing their initial 
total performance goals by two. Second- 
period goals were obtained by subtracting 
each S’s first-half performance score from 
his revised total output goal (made just be- 
fore the second half of the experiment began). 
Since this revised total goal was made in 
full knowledge of first-period performance, it 
necessarily included S’s second-half perform- 
ance goal. It is clear from Figures 3a and b 
that Ss thought they could do more and 
actually did more in the second half than in 
the first half. 

The percentages of times the Ss in each 
incentive group beat their Period 1 and 2 
goals are shown in parentheses in Figure 3b. 
Note that within periods, higher goals are 
associated with fewer successes, whereas 
across periods higher goals are associated with 
greater success. The latter finding is no doubt 


a result of learning: as Ss became more 
proficient in the second period, they were 
able to set higher goals and to reach these 
goals more often as compared with the first 
period. 

It appears from Figure 3a that the piece- 
rate group completed more models during 
each of the two periods than did the hourly 
group. However, ¢ tests on these two pairs of 
means showed these differences to be well 
within the range of chance expectancy. 
Similarly, the piece-rate group, as shown in 
Figure 3b, appeared to set higher goals for 
themselves in both periods than did the 
hourly group, but neither of the differences 
exceeded chance level. A ¢ test on the differ- 
ence of the changes in productivity from the 
first to the second period showed no difference 
between the two incentive groups, nor was 
the difference in goal changes significant. 

The Ss in both incentive groups were com- 
bined in order to compute correlations be- 
tween ability, first- and second-half per- 
formance, and first- and second-half goals. 
These correlations are shown in Table 1. All 
of the first-order correlations between the 
experimental! variables were significant at the 
01 level. Although first-half goal correlated 
significantly with first-half performance, when 
ability was partialed out the significant rela- 
tionship disappeared (the partial correlation 
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is shown in parentheses). Second-half goal 
was significantly related to second-half per- 
formance, and the relationship remained sig- 
nificant even with first-half performance 
partialed out (r = .57, p < .01; see Table 1). 
Evidently, at the outset the Ss had not had 
enough experience with the task to base their 
goals on anything other than their initial 
ability scores, whereas their goals for the 
second half were set more in accordance 
with their motivation. 

This relationship between second-half 
goal and second-half performance can be 
demonstrated in another way. The Ss were 
regrouped according to the size of the differ- 
ence between their second-half goals and their 
first-half performance. The 15 Ss setting the 
highest goals for the second half in relation 
to their first-half performance were compared 
with the 15 Ss setting the lowest second-half 
goals in relation to their first-half perform- 
ance. The difference in the mean discrepancy 
between the two selected groups was sig- 
nificant at the .01 level (¢ = 6.92), indicating 
that the division was successful. The first- and 
second-half performance of the two dis- 
crepancy groups is shown in Figure 4. The 
difference in performance improvement from 
the first period to the second period was 
significant at the .05 level (¢ = 2.68) in 
favor of the high-discrepancy group. 

The results generally support Hypotheses 1 
and 3. The first hypothesis was supported by 
the significant relationships found between sec- 
ond-half goal level and second-half perform- 
ance for all Ss combined and by the sig- 
nificant difference in performance improve- 
ment of the two discrepancy groups. Hy- 


TABLE 1 


CORRELATIONS BETWEEN GOALS AND PERFORMANCE FOR 
ALL SuBJECTS COMBINED 




















cout Goal a Performance,] Performance, 
half half first half second half 
Ability 718 mu .79 .60 
Goal, 
first half -71 -63 (16>) 62 
oal, 
second half 93 -90 (.57¢) 
Performance, 
first half 85 


Note.—Experiment 2; N = 30. 

8 All r’s over .46 are significant at p < .01. 
b With ability partialed out. 

© With first-half performance partialed out. 
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pothesis 3 was supported by the finding that 
when the goal level of the different incentive 
groups was the same (not significantly differ- 
ent), the effect of incentive condition on 
performance was not significant. Once again, 
the incentives failed to produce any initial 
performance differences so that Hypothesis 
2 could not be tested. 


EXPERIMENT 3 


Method 
Subjects 


The Ss were 22 Trenton State Teacher’s College 
females who were members of an_ introductory 
psychology class. The Ss volunteered for the experi- 
ment and were each paid $1 for participation at the 
end. However, they were told at the outset that 
they would get only what they earned during the 
experiment. 


Task and Procedure 


The task was word unscrambling. There were 
6 practice trials followed by 3 blocks of 10 experi- 
mental trials each. On each trial, S was allowed to 
choose the length of word she would try to un- 
scramble; S could choose any word length from 
3 to 11 letters each time. The S had to make her 
choice before actually seeing the word but was 
told that words of the same length were equivalent 
in difficulty. After making her choice, S turned the 
page in her work booklet and worked on the word 
length she had chosen for 45 sec. On the first block 
of 10 trials, Ss were not offered any incentive. On 
the second block, they were offered 2¢ for each 
word correctly solved regardless of the length of the 
word. On the third block of 10 trials, Ss were 
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offered 10¢ for each word correctly solved regard- 
less of the length of the word. 

Before each block of 10 trials and before each 
trial of each block, S described her behavioral in- 
tention for that block or trial by indicating, on a 
5-point scale, whether she would choose “the easiest 
word(s) possible,’ “fairly easy word(s),” “words 
that are neither too easy nor too hard,” “fairly hard 
word(s),” or “the hardest word(s) possible” on that 
block or trial. The intention scale filled out before 
each block of words will be called the B-intention 
(for block) scale. The intention scale filled out be- 
fore each trial will be called the T-intention (for 
trial) scale. 


Results and Discussion 


The S’s intention ratings were quantified 
on a 5-point scale (5 for the “hardest words” 
alternative, to 1 for the “easiest words” al- 
ternative). The dependent variable was the 
length of the word, in letters, chosen by S 
on a given trial or the mean word length 
chosen on a given block. 

The mean within-S correlation (using 
Fisher z transformations for averaging) be- 
tween T-intentions and word-length choice 
across the 30 trials was .81 (range = .12-—.94; 
Mdn= 80). The mean within-block, be- 
tween-Ss correlation between B-intentions 
and mean word choice was .60 (p< .01). 
The corresponding mean within-block correla- 
tion of mean word choice with the mean 
T-intention was .77 (p< .01). B-intentions 
and mean T-intentions correlated 69 (p< 
.01), on the average, within blocks. 

The mean word-length choices and inten- 
tion ratings of the Ss for each of the three 
blocks and the ¢ ratios for change between 
blocks are shown in Table 2. Clearly, the 
higher incentives led to easier word choices. 
The proportions indicating the number of 
trials out of 10 on which Ss succeeded in 
solving their chosen words were .60, .62, and 
.71 for Blocks J, II, and III, respectively. 
The changes in word-length choice were ac- 
companied by equivalent changes in T-inten- 
tion ratings. The B-intention ratings did not 
completely parallel those for word choice and 
mean T-intentions, but were in the same 
direction. 

Across-block correlations between B-inten- 
tions, T-intentions, incentive, and word choice 
were obtained by treating the data from each 
block as though they were from different 
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TABLE 2 


MEANS AND ? RATIOS FOR CHANGES IN INTENTIONS 
AND WorD-LENGTH CHOICE 














Means 
Change 
Block I Block II | Block TIT 
(0¢) (2¢) (10¢) 
B-intentions 2.91 2.68 2.45 
T-intentions 3.20 2.84 Deas 
6.32 5.68 5.08 


Word-length choice 


t ratios for mean change 





Blocks Blocks Blocks 

I-II II-III I-III 

B-intentions 1.31 1.05 DRaOG 
T-intentions B.a4ee Sto aae SoORs 
Word-length choice] 3.98** 3.035% tiles 





Note.—Experiment 3; N = 22. 
Tipe 05, 
ee pene Olle 


individuals (thus, the V for observations was 
3 X 22 or 66). The incentive was coded 0, 1, 
or 2 (for the O¢, 2¢, and 10¢ conditions, 
respectively). This gave maximum predict- 
ability to the incentive variable since the 
mean word-choice changes were about equal 
between Blocks I and II, and II and III, 
respectively (see Table 2). The across-block 
correlations between B-intentions, mean T-in- 
tentions, incentive, and mean word choice are 
shown in Table 3. Also shown are the in- 
tention—word-choice correlations with  in- 
centive partialed out and the incentive—word- 
choice correlations with intentions partialed 
out. For purposes of significance testing, an 
N of 22 was used in order to be conserva- 
tive (in view of the fact that the 66 observa- 
tions were not independent). The results in- 
dicated that both B-intentions and T-inten- 
tions were significantly correlated with word 
choice both before and after amount of in- 
centive was partialed out. In contrast, while 
amount of incentive was related significantly 
to word choice before partialing, the relation- 
ship was vitiated when T-intentions were 
partialed out, or when both B-intentions and 
T-intentions were partialed out. 

The results of this experiment support all 
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TABLE 3 
RAW AND PARTIAL CORRELATIONS OF INTENTIONS AND INCENTIVE WITH WORD CHOICE 
Word choice 
T-intentions| Incentive Partial out 
Raw 
Incentive | B-intentions | T-intentions B- and 
T-intentions 

B-intentions .69** —.248 Ol Ahi -- — — 
T-intentions —.50* 84** sore = — — 
Incentive —.51* — — 48* —.21 —.22 








Note.—Experiment 3, N = 66obs; 22 Ss. 


8 Significance levels are based on N = 22. A negative correlation indicates the higher the incentive, the shorter the word 


lengths chosen or the “‘easier’’ the intention. 
*p < .05. 
¥E > < 01. 


three of the hypotheses described earlier. 
(1) Intentions were related to choice within 
individuals, between individuals within-blocks 
(where incentive was constant), and between 
individuals across blocks (where incentive 
varied). (2) When incentive differences did 
produce choice differences, these differences 
were accompanied by equivalent differences 
in T-intentions (though not necessarily in 
B-intentions). (3) Finally, when across-block 
differences in intentions were partialed out, 
there was no relationship of amount of in- 
centive to choice. 

The next experiment was similar in design 
to this one except that only the B-intention 
scale was used and only two rather than 
three incentive conditions were included. In 
addition, on the 0¢ block in the next experi- 
ment, different instructions were given to 
different Ss and their effects on performance 
and intentions observed. 


EXPERIMENT 4 


Method 
Subjects 


The Ss were 70 University of Maryland, paid, 
undergraduate volunteers. Twenty-seven Ss were 
female and 43 were male. All Ss received $3 for 
participation plus whatever they earned in the 
experiment. 


Task 


The task was word unscrambling as in the previ- 
ous experiment. In the present case, there were 3 
practice trials followed by 2 blocks of 10 experi- 
mental trials each. Again S could choose any word 
length from 3 to 11 letters on each trial. The Ss 


chose their word length before actually seeing 
the word, but were told that all words at any 
given length were of equivalent difficulty. 


Procedure and Conditions 


Block I. The Ss were divided at random into 
three instructional groups on Block I. Twenty-three 
Ss were told to “try and achieve as much success 
as possible’; 23 Ss were told to “try and achieve 
as great a sense of personal accomplishment or 
personal achievement as possible’; and 24 Ss were 
told to “try and overcome the greatest possible 
challenges (even if you do not often make it).” 
No incentive was offered for success on this block. 

Before the first experimental trial, Ss filled out 
5-point behavioral intention scales, similar to those 
used previously, to indicate their intended word- 
choice difficulty for the next 10 trials (possible re- 
sponse categories ranged from “very easy words” 
to “very hard words”). 

After this, Ss worked on 10 different word choices 
for 45 sec. each, choosing whatever word length 
they wanted before each trial. The Ss were told 
that they were not committed to their intention 
ratings. 

Block II. On Block JI, all Ss were treated 
identically. All were offered 4¢ for each word cor- 
rectly solved in this block regardless of the length 
of the word. Before this block, Ss filled out be- 
havioral intention scales; they also indicated how 
much money they would try to earn on the coming 
block (level of aspired earnings). Again, Ss worked 
for 45 sec. on each word choice, selecting whatever 
word length they wanted before each trial. 


Results and Discussion 


The behavioral intention ratings were 
quantified on a 5-point scale (5 for the “very 
hard words” alternative, to 1 for the “very 
easy words” alternative). The Block I in- 
structions were quantified on a 3-point scale 
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(S for “challenge,” 3 for “achievement,” and 
1 for “success”). The latter quantities were 
based on the authors’ a priori notion of the 
difficulty of the choices these instructions 
would lead to if Ss followed them. The 
dependent variable was the mean word length 
chosen on a given block. 

In Block I, instructions correlated .67 
(p < .01) with mean word choice, while in- 
tentions correlated .88 (p< .01) with word 
choice. The correlation between intentions 
and word choice with instructions partialed 
out was .76 (p< .01); on the other hand, 
the correlation of instructions with word 
choice with intentions partialed out was only 
.08 (ns). Block II intentions correlated .81 
(p < .01) with word choice on that block, 
while aspired earnings correlated .60 (p< 
.01) with the latter variable. 

Figures 5a-c show the mean total word- 
length choices and mean intention ratings 
on each block, for the Ss in each Block I 
instruction condition (‘‘success,” ‘‘achieve- 
ment,” and “challenge”’). Looking first at 
Figure 5a, the group trying for “success” 
on Block I chose easy words on that block 
(mean empirical probability of success = .76) 
and did not change their word choices sub- 
stantially on Block II. The intention-level 
means for these Ss are also similar for Blocks 
I and IJ. Neither the word choice nor the 
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intention means changed significantly between 
blocks. 

Figure 5b shows corresponding data for 
the Block I “achievement” condition Ss. 
These Ss chose moderately difficult words on 
Block I (mean empirical probability of suc- 
cess = .48) and easier words on Block II. 
The ¢ ratios for mean between-block changes 
were significant at the .001 level for both 
the word-length and intention-level measures 
(’s = 4.87 and 4.83, respectively). 

The corresponding data for the Block I 
“challenge” Ss are shown in Figure 5c. These 
Ss chose relatively hard words on Block I 
(mean empirical probability of success = 
.33) and much easier words on Block II. The 
between-block mean changes in word-length 
choice and intention level were both signifi- 
cant at the 001)level (/’s = 7.73 and 9:72, 
respectively ). 

The effect of controlling for intention level 
on the relationship of incentive to choice can 
be seen in Figure 6. Mean word-length choice 
is plotted as a function of incentive for each 
of three levels of intention (Intention Level 1 
corresponds to the “‘very easy words” alterna- 
tive; Level 2, to the “fairly easy words” 
alternative; Level 3, to the ‘‘words that are 
neither too easy nor too hard” alternative; 
not enough Ss checked the “fairly hard” 
and “very hard” words alternatives to make 
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separate plots possible for them). The curves 
show a marked effect of intention level on 
word choice, but little effect of incentive on 
choice. None of the incentive effects is sig- 
nificant within any intention level. 

The effects of controlling for intention can 
also be shown by means of partial correla- 
tion techniques. For this purpose, the inten- 
tion, word-choice, and incentive scores from 
each block were treated as though they came 
from different individuals (making an WN for 
observations of 2 X the number of individuals 
involved). The incentive variable was coded 0 
for Block I (where the incentive was 0¢) and 
1 for Block II (where the incentive was 4¢). 
Thus, the correlations of incentive with the 
other variables were point-biserial coeffi- 
cients. The correlations of intention and in- 
centive with word choice, and relevant partial 
correlations, are shown in Table 4 for each 
Block I instruction group and for all Ss 
combined. 

The results show that intention was sig- 
nificantly related to word choice both before 
and after incentive was partialed out, but 
that incentive was not related to word choice 
when intention was partialed out. This find- 

4It was believed that a biserial coefficient was not 
justified in this case, since the incentive classifica- 
tion did not represent coarse grouping but the 
real value of the two incentive conditions (O¢ and 
4¢). In addition, the results of Experiment 5 in- 
dicated that the difference between “some incentive” 
and “no incentive’ was considerably greater than 
the difference between different amounts of in- 
centive (e.g. 1¢ vs. 2¢). This implies a genuine 
underlying dichotomy. 


E. A. Locks, J. F. Bryan, AND L. M. KENDALL 


60 ee ail Level - 3.0 
0 


Intention Level - 2.0 


50 ests 


Mean Total Word Length Choice (I0 Trials) 


Intention Level - 1.0 





O¢ 4¢ 
Incentive 


Fic. 6. Word choice as a function of amount of 
incentive, for three levels of intention (Experiment 4). 


ing was true for each Block I subgroup as 
well as for all Ss combined. 

Like the previous experiment, these results 
support all three of our hypotheses: (1) in- 
tentions were significantly related to choice 
both within and across blocks; in addition, 
level of aspired earnings was correlated with 
choice on Block II where money was offered 
for success; (2) significant incentive effects 
on choice were accompanied by corresponding 
differences in intentions; (3) when the effect 
of intention level was controlled or partialed 
out, there was no relationship of amount of 
incentive to choice. A further finding of in- 
terest was that intentions predicted perform- 
ance better than instructions. When Block I 
instructions were partialed out, there was still 
a significant relationship of intention to word 


TABLE 4 


CORRELATIONS OF INTENTION LEVEL AND AMOUNT OF INCENTIVE WITH MEAN WorRD-LENGTH CHOICE, 
ACROSS BLOCKS 








Correlation of mean total word-length choice with 





Subjects r Intention level 
‘partial incentive 
Success hse? .18** 
Achievement pSilncs shee 
Challenge ROSE RSH iee 
All subjects .89** ESOnc 





Note.—Experiment 4. 


N 
Amount of incentive 
I pbis Ypbie, partial intention| Subjects | Observations 
— .088 —.10 23 46 
—.51* —.13 23 46 
— .76** 01 24 48 
— .48** —.10 70 140 


® Negative sign indicates the higher the incentive the easier (shorter) the word-length choice, Since there were only two 
values of incentive, 0¢ and 4¢, r’s for incentive are point-biserial coefficients, 


po <.05 {peed on WN for Ss). 
**  <.01 (based on N for Ss), 
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choice on that block, but there was no rela- 
tionship of instructions to choice when inten- 
tions were partialed out. 

While these results strongly replicated those 
of the first study of choice, both studies 
shared possible methodological flaws. First, 
the effects of order were confounded with the 
effects of amount of incentive, since the 
larger incentive always came later. Although 
this would not affect the within-block cor- 
relations, it could affect the mean changes 
in choice. Second, in both previous studies 
there were five intention categories, but only 
two or three levels of incentive. Thus, it 
might be argued that incentives did not get 
a “fair shake” because they had less chance 
to vary than did intentions. The final experi- 
ment on choice was designed to remedy these 
difficulties. 


EXPERIMENT 5 


Method 
Subjects 


The Ss were 64 University of Maryland, paid, 
undergraduate volunteers. Thirty Ss were female 
and 34 were male. The Ss received $3 for participa- 
tion plus whatever they earned in the experiment 
(if anything). 


Task 


The task was word unscrambling as in the 
previous study. The Ss were given nine practice 
words (one at each length from 3 to 11 letters) 
followed by five blocks of five experimental trials 
(words) each. On each experimental trial, S could 
choose any word length from 3 to 11 letters. 


Procedure and Conditions 


Fifty Ss were in the variable-incentive condition: 
on each of the five blocks, each S was offered a 
different incentive, either 1¢, 2¢, 3¢, 4¢, or 5¢, 
for each word correctly solved on that block, regard- 
less of the length of the word. Each S was in all 
five conditions, but the order of the 5 levels of 
incentive was counterbalanced across Ss. Before 
each block of five trials, an instruction sheet in- 
formed Ss of what their incentive would be for the 
coming block. They then indicated on a 5-point be- 
havioral intention scale (like that used in the 
previous experiment) what their intended choice 
difficulty would be for the coming block of five 
trials. Next, Ss made their first word choice and 
worked on their chosen word length for 45 sec. 
before making their choice for the next trial and 
so on. The Ss were told that they were not 
committed to their intention ratings. 

Fourteen Ss were in the no-incentive condition. 
These Ss were not offered any incentive on any 
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block, but filled out an intention scale like that 
of the variable-incentive Ss before each block of 
five trials. At the end of the fifth block, all Ss 
were asked to describe, in their own words, what 
their overall intentions had been during the ex- 
periment. 


Results and Discussion 


The behavioral intention ratings were 
quantified on a 5-point scale as previously, 
and the criterion was again the mean word- 
length choice of each S on each block. The 
incentive conditions were quantified on a 
S- or 6-point scale (depending upon whether 
or not the no-incentive Ss were included) ac- 
cording to the amount of incentive offered. 

The within-incentive correlations between 
intentions and word-length choice were .78, 
63, ./9, .85, and’.79 for the 1¢, 2¢, 3¢, 4¢, 
and 5¢ blocks, respectively (p < .01 for all 
r’s). The corresponding figure for the no- 
incentive Ss was .59 (p < .05), using blocks 
as the unit of analysis, and .61 (p< .05), 
using individuals (sums across blocks) as the 
unit. 

Figure 7 shows the relationship of amount 
of incentive to intentions and word choice. 
The dotted lines connecting the no-incentive- 
group curves with the variable-incentive- 
group curves indicate that the two groups of 
Ss were different. There was a slight but sig- 
nificant tendency within the variable-incentive 
condition for Ss to choose easier words when 
a larger incentive was offered (F = 2.78, 
p< 05; df= 4/196). A similar effect was 
found using intentions as the dependent vari- 
able (F = 3.00; p < .05; df = 4/196). There 
was also a significant difference between the 
mean word choices (for all trials combined) 
of the variable-incentive Ss and the no-incen- 
tive Ss, with the latter choosing harder words 
(¢ = 4.13, p< .01). A similar difference was 
found in the mean intention scores of the 
variable-incentive and no-incentive groups 
(t = 3.33, p< .01), with the latter having 
“harder” intentions. The proportions of suc- 
cess in solving the chosen word lengths were 
SZ eOoweO medi yecoand «26 for theiO¢, l¢, 
2¢, 3¢, 4¢, and 5¢ incentive conditions, 
respectively. 

The effect of controlling for intention level 
is shown in Figure 8 where word choice is 
plotted as a function of incentive for each of 
three levels of intention. The intention levels 
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correspond to the three “easiest” alternatives 
on the intention scale, except that the top 
category includes all intentions of 3 or higher. 
There was a marked effect of intention level 
on word choice, but very little effect of in- 
centive within any given intention level. 
There was a tendency for no-incentive Ss 
with intentions of 3 or more to choose harder 
words than variable (1¢ to 5¢) incentive Ss 
with the same intentions (¢ = 3.15, p < .01), 
but the ¢ ratio may have been inflated due to 
the fact that the observations within each 
group were not independent. (Since the unit 
of analysis was the block, rather than the 
individual, many Ss appeared more than once 
in the analysis.) 

Correlations between amount of incentive, 
intentions, and word choice were obtained 
using independent observations by the fol- 
lowing procedure: for each S, his intention, 
mean word choice, and corresponding in- 
centive scores were chosen for just one of the 
five blocks that he worked on. The data were 
chosen so as to equalize the representation 
of each of the five incentive levels and so as 
to control for order effects. The same pro- 
cedure was used to select data from the no- 
incentive condition, except that only order 
had to be controlled. Thus, the correlations 
between intentions, incentive, and word choice 
are based on only one-fifth of the total data, 
but no S was entered twice in the analyses 
as in the previous two experiments. The rele- 
vant raw and partial correlations are shown 
in Table 5, for the variable-incentive group 
separately and for the variable- and no- 
incentive groups combined. 
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Fic. 7. Mean word choice and intention level as a 
function of incentive (Experiment 5). 
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Fic. 8. Word choice as a function of amount 
of incentive, for three levels of intention (Experi- 
ment 5). 


The correlations between intentions and 
word choice were highly significant (p < .01) 
both before and after partialing out incentive. 
In contrast there was very little relationship 
of amount of incentive to word choice before 
partialing out intentions, and no relationship 
after partialing intention level. 

These findings clearly support those of 
the previous two studies, even with order 
effects controlled and using six different in- 
centive levels: (a) there was a significant 
relationship between intentions and choice 
both within and between incentive conditions; 
(6) significant incentive effects on choice were 
accompanied by corresponding mean inten- 
tion differences; and (c) when intention 
level was partialed out, there was little or no 
effect of incentive on choice. 

A further finding of interest in this study 
emerged from the analysis of Ss’ postex- 
perimental descriptions of their general in- 
tentions in the experiment. Two independent 
raters rated each intention description on a 
1-5 scale according to the choice difficulty it 
suggested (e.g., an intention “to make as 
much money as possible’? would get a score 
of 1, while an intention to “get a sense of 
accomplishment” would be given a score of 
3). The correlation between the score as- 
signed by the two raters was .90, for those 
51 protocols to which both raters felt con- 
fident enough to assign scores. To obtain 
summary scores for these Ss, the two scores 
were summed. Summary scores were obtained 
for an additional 8 Ss, whose ratings were 
only assigned scores by one rater, by dou- 
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TABLE 5 
CORRELATIONS OF INTENTION LEVEL AND AMOUNT OF INCENTIVE WITH MEAN 
Worp-LENGTH CHOICE, ACROSS BLOCKS 
Correlation of mean total word-length choice with 
Subjects Intention level Amount of incentive N 
f. Tpartial incentive r Tpartial intention 
Subjects in 1-5¢ condi- Ogne coon — Ole 04 50 
tion only 
All subjects (1-5¢ and no- efota Aone — .29% —.11 64 
incentive condition 
combined) 


Note.—Experiment 5, 





a Negative sign indicates the higher the incentive the easier (shorter) the word-length choice. 


mp = .05. 
EDT < 01. 


bling the latter score. For these 59 Ss, sum- 
mary postperformance intentions correlated 
.68 (p < .01) with summary preperformance 
intentions (the latter were obtained by sum- 
ming the five intention ratings Ss made 
during the experiment). Summary preper- 
formance and _ postperformance intentions 
correlated .83 (p< .01) and .68 (p< .01), 
respectively, with mean word-length choice 
on the five blocks combined. 

The former correlation was significantly 
greater than the latter (¢ for nonindependent 
r’s = 2.60, p < .05), indicating that the post- 
performance measures were poorer “pre- 
dictors” of actual behavior than preperform- 
ance measures. This would seem to con- 
tradict the claim frequently made by be- 
havioristically oriented investigators that cor- 
relations between postperformance measures 
and performance are spurious because the 
former are only descriptions (or rationaliza- 
tions) of behavior. If this were true, the post- 
performance measures should be at least as 
highly related to behavior as preperformance 
measures, if not more so. 


CONCLUSION 


The results of the five studies reported 
here are consistent with the hypothesis that 
goals and intentions are important determi- 
nants of behavior, and with the hypothesis 
that they are the mechanism by which 
monetary incentives influence behavior. All 
five studies showed significant relationships 
between goals or intentions and behavior 


within and/or between different monetary 
incentive conditions. Further, in each of the 
five studies it was demonstrated that if goal 
or intention level was controlled or partialed 
out, there was no effect of amount of in- 
centive on behavior. This was demonstrated 
in the studies of performance level by show- 
ing that the same goal level produced the 
same performance level regardless of whether 
incentives were offered for performance or 
not. The same fact was demonstrated in the 
studies of choice by showing that when be- 
havioral intention level was partialed out, 
the initial effects of incentives on choice 
were vitiated. Finally, the three studies of 
choice demonstrated that when incentives 
did affect behavior, these behavior differences 
were associated with equivalent differences in 
behavioral intentions. 

The three studies of choice yielded a more 
complete test of the theory than did the 
two performance-level studies since the de- 
pendence of incentive effects on intention 
differences could be demonstrated directly. 
However, the fact that monetary incentives 
did not show any initial effect in the per- 
formance-level studies suggests the lesser 
potency of these incentives as compared with 
performance goals. 

The first study suggests the possible im- 
portance of the manner in which incentives 
are arranged. All-or-none incentives made 
contingent upon reaching a given performance 
level were more effective, in this case, than 
the same payment rate made on a piece-rate 
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system. The reason seems clear: the goals 
Ss set themselves under the piece-rate sys- 
tem were lower than those that were assigned 
under the all-or-none system. There is no 
reason to believe that this will inevitably be 
the case, but it does point up the potential 
importance of the manner in which incentive 
systems are structured. Essentially, a piece- 
rate system lets the individual set his own 
overall work and pay goals, while an all-or- 
none payment system requires that he reach 
a specific level of performance before he gets 
anything. 

Interestingly, back in 1911, Taylor im- 
plicitly recognized the potential difference 
between the two types of incentive system as 
well as the relationship of pay to goal setting. 
Under what Taylor called the traditional 
system of management by “initiative and 
incentive” the worker set his own pace, chose 
his own goal, determined his own work 
methods, and was paid accordingly, typically 
on a piece-rate system. Taylor’s scientific 
management system introduced, among other 
things, performance standards or specific 
work goals for workers. Payment of the bonus 
(or the higher rates of pay which accom- 
panied the introduction of his system) was 
contingent upon reaching assigned work goals. 
This goal was inevitably higher than the level 
of performance which had previously existed. 
Employees were told that they were expected 
to earn so much money (which meant pro- 
duce so much) if they wished to remain on 
the job. Taylor called these “high-priced 
men.” 

Opsahl and Dunnette (1966) recognize the 
possible importance of goal setting in arguing 
that it might be beneficial to integrate pay- 
incentive systems with goal setting in indus- 
try, for example, by having pay increases 
dependent upon fulfillment of work goals (p. 
104), or by having individuals actually set 
pay goals. 

It may be hypothesized that assigned all- 
or-none goals or goals formally agreed upon 
by management and the worker will be more 
effective than self-set goals only to the extent 
that: (a) the former goals are set at a 
higher level than the latter, and (b) the 
former procedure results in a greater or more 
intense commitment of the individual to the 
goal. 
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This brings up the problem of the 
specific psychological role that monetary in- 
centives play in motivation. Although such 
incentives appear not to influence perform- 
ance directly, they may serve a number of 
other important functions: 

1. Incentives can influence the nature of 
the intentions one develops on a task. If the 
payoff for succeeding at a hard task is large, 
the individual may be more likely to set a 
hard goal and take a big risk than if the 
payoff is small; or if the payoff is greater for 
succeeding at easier tasks (as in Experiments 
3-5), this may lead him to establish easier 
intentions and to choose easier tasks. 

2. Monetary incentives may persuade an 
individual to accept goals or tasks that are 
assigned to him. In Experiment 1, the 
“demand characteristics” (Orne, 1962) of 
the experimental situation were sufficient to 
persuade the assigned-goal, no-incentive Ss 
to accept their assigned goals, but such co- 
operation could not be taken for granted in 
most industrial situations. Workers will 
generally not accept a work goal or task 
unless it achieves other valued goals, such as 
to make money. 

3. Monetary incentives may affect the 
degree of commitment of an individual to his 
goal or behavioral intention. Commitment 
may be expected to influence the degree of 
persistence an individual will show in the face 
of difficulty and frustration, the degree to 
which he will retain the goal if it conflicts 
with other goals (e.g., to be liked by co- 
workers), and the probability of his abandon- 
ing the goal altogether and “leaving the 
field” in the face of alternatives. 

It must be stressed that whatever the 
effects of monetary incentives on performance, 
their ultimate impact should be a function of 
the degree to which the individual values 
money as compared to other incentives and 
his perception of the degree to which a given 
course of action is seen as a means of attain- 
ing this value (i.e., the perceived instrumen- 
tality of behavior in gaining the value; 
Vroom, 1964). 

While the present research did not ex- 
plore the evaluation process itself, the find- 
ings did indicate that attention should be 
paid to the goals and intentions which result 
from it. Many incentive programs are 
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based on the implicit assumption that incen- 
tives will affect behavior more or less auto- 
matically. The present findings show such an 
assumption to be vastly oversimplified and 
highly misleading, even in the context of 
laboratory experiments in which there is a 
high degree of control over the situational 
variables. 
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EFFECTS OF CHANGES 


IN JOB-SATISFACTION 


LEVELS ON EMPLOYEE TURNOVER 


CHARLES L. HULIN 1 


University of Illinois 


The results of a program designed to increase the job satisfaction and decrease 
the turnover rate among a large group of female clerical workers are presented. 
Results indicated that levels of satisfaction were increased, with the largest 
increases occurring in the satisfaction variables stressed in the program. A sig- 
nificant decrease in turnover (from 30% to 12%) was also observed. Several 
alternative explanations for increased satisfaction and decreased turnover were 
considered, due to the lack of a completely adequate control group. None was 
considered adequate to explain the particular pattern of results obtained. 


Previously reported data (Hulin, 1966) 
indicated the presence of an abnormally high 
turnover rate among the female clerical 
workers employed by the corporate offices of 
a large Canadian manufacturing firm. The 
turnover rate among the female employees of 
this company (30%/yr) was not only high 
on an absolute basis but was high relative 
to 15 other large firms located in Montreal 
who had a mean annual turnover rate of only 
20%. The company involved ranked either 
first or second in turnover rate during each 
of the last 10 yr. among this group of com- 
panies. Many of the ecological factors which 
contribute to high turnover rates (general 
economic conditions of the area, labor market 
conditions, competition from other firms) and 
the characteristics of females in the work 
force (age, marital status, previous experience, 
domestic responsibilities, pregnancy, mar- 
riage) were uncontrolled but were common 
to the other firms in the area who had much 
lower turnover rates. Therefore, considera- 
tion was given to the role job satisfaction and 
dissatisfaction played in contributing to the 
high turnover rate. 

The relationship between job satisfaction 
and turnover was studied (Hulin, 1966). 
Job-satisfaction questionnaires were adminis- 
tered to 345 female clerical workers employed 
at this company. The questionnaire measured 
satisfaction with six aspects of work. A year 
after the questionnaires were administered, 

1 The writer would like to thank James Anderson 
and Susan Pfeifer for their assistance on the data- 
analysis stages of this research and Milton Blood 


who read and commented on an earlier draft of the 
paper. 
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43 of the girls had quit who had completed 
the questionnaires and who could be identified 
on the basis of birth date, department, lan- 
guage, and marital status. A number of girls, 
who quit during the year following the survey, 
had started work after the survey in July 
1964, a number did not report sufficient 
biographical information on the question- 
naire so positive identification was possible, 
and some had changed departments. 

For each of the 43 terminators, 2 control 
Ss were drawn from the sample of girls who 
had remained on the job. These controls were 
selected so they matched the terminators on 
age, years of education, and mother tongue. 
A comparison of the mean satisfaction scores 
of these two groups indicated the two groups 
were significantly different in terms of group 
centroids. Correlational analyses indicated a 
multiple correlation between the satisfaction 
variables and turnover of .34 (p< .01) and 
a significant Pearson correlation between an 
unweighted sum of the satisfaction variables 
and turnover of —.27 (p < .01). 

On the basis of these results the question 
was raised as to whether the turnover rate 
for this company could be reduced by in- 
creasing the job satisfaction of the clerical 
workers. In order to answer this question a 
program was initiated by the company to at- 
tempt to increase job satisfaction in the hope 
of reducing turnover. It is not the purpose of 
this article to analyze the program per se. 
However, a brief description may be in order. 
The results of the scores on the Job Descrip- 
tion Index (JDI) indicated that dissatisfac- 
tion of the clerical workers was occurring in 
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all areas. (The JDI is a cumulative-point, 
adjective checklist type of scale. It has under- 
gone extensive validation research and has 
been described elsewhere—Quinn and Kahn, 
1967, p. 456; Vroom, 1964, p. 100. The JDI 
was constructed to measure five separate 
aspects of job satisfaction: satisfaction with 
work done, with the pay, with the supervisor, 
with promotional opportunities and policies, 
and with co-workers.) Analysis of their re- 
sponses to open-ended questions substantiated 
this finding and also indicated that their dis- 
satisfaction with pay was directed as much 
at the wage and salary administration as it 
was at wage /evels. Examination of the work- 
ers’ responses on the satisfaction with promo- 
tions scale indicated that the girls felt dis- 
satisfaction was due to their feelings of being 
on “dead-end jobs” with little or no chance 
for promotion and not due to their feelings 
about the fairness of the policies. These anal- 
yses led the company to revise their wage 
and salary administration policies to make 
them more nearly constant across departments 
and to institute regular salary reviews and 
formalize the merit-raise procedure. Also a 
policy was initiated of allowing and encourag- 
ing intracompany transfers so the girls could 
move from one department to another where 
they would have a better chance of being 
promoted. In conjunction with this latter 
procedure the girls were encouraged to make 
changes in their own job so that they could 
have a more responsible job within their 
present job classification. The most compelling 
reason for choosing these job aspects to con- 
centrate on in the program was because they 
are the job characteristics which are most 
fuliy under the control of the company. The 
entire program was put into effect in October 
1964. 

The purpose of the present article is to 
analyze the changes which have occurred in 
this company since July 1964 regarding satis- 
faction scores, turnover rates, and the relation- 
ship between satisfaction and termination 
decisions. 


MrtHop 
Research Setting 


The company in which this study was carried out 
has been previously described (Hulin, 1966; Hulin 
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& Smith, 1967). Briefly, it is a large international 
manufacturing company which employs approxi- 
mately 400 female clerical workers in the corporate 
offices. During the 3 yr. (1961, 1962, 1963) preceding 
initial study the turnover rate among the female 
clerical staff was 30.3%, 30.0%, and 30.0%. This 
company’s rate of turnover was stable and had 
shown no tendencies in either direction over the 
past 10 yr. Turnover rate was computed by obtain- 
ing the ratio of number of quits per year to 
average number of female employees where average 
number of employees was equal to the average of 
the number at the start of the year and the number 
at the end of the year. 


Subjects 


The entire female clerical staff was asked to 
participate in the job-satisfaction survey which was 
to be conducted by the company. The workers were 
informed of this by their supervisors and by a letter 
from the personnel department. They were told that 
the questionnaires were completely anonymous and 
that their individual responses would never be re- 
vealed to the company. The questionnaires were 
given to the staff members by departmental rep- 
resentatives after the representatives had met with 
the investigator and had the questionnaires ex- 
plained to them. The questionnaires were sealed in 
envelopes and returned either to the investigator 
at the offices or sent through the mail directly to 
the University of Illinois. Of the 350 members of the 
clerical staff present during the week of the survey, 
298 (85.1%) participated in the survey. (Not all of 
the staff was present due to sickness, vacation, 
travel, etc.) 


Variables 


The job satisfaction of these clerical workers was 
measured by the Job Description Index (JDI). In 
addition to the five satisfaction variables, measures 
were also obtained of each worker’s age, education 
level, job level (obtained by matching reported 
job title to the job-evaluation scale of the company), 
socioeconomic background (measured by placing the 
occupation of the father on an 11-point scale from 
professional to unskilled worker), and mother 
tongue. Measures of all variables were obtained 
during the first week of July 1966. By the first 
week of April 1967, 32 girls had quit, of whom 20 
could be positively identified. However, the data of 
4 of these 20 had to be eliminated from subsequent 
analysis because 3 of them were summer students 
and 1 of them did not report enough biographical 
information to yield measures of the control vari- 
ables so that it was possible to obtain control Ss. 


RESULTS 


The turnover rate among female clerical 
workers during 1966 was 12%. This is sig- 
nificantly lower than the turnover rate of 
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TABLE 1 


MEAN 1964 anv 1966 JDI SatTIsFAcTION SCORES 











| Mean 1964 | 1964 Mean 1966 1966 
Variable scores percentile scores percentile t 
(n = 345) | equivalents (n = 298) equivalents 
Satisfaction with: 

Work | 35.33 44 36.11 47 85 
Pay | 15.01 14 32.83 66 1027755 
Promotions 10.78 48 24.58 74 10.38** 
Co-workers 41,53 39 | 43.49 47 2.45* 
Supervision 40.85 48 | 43,23 60 | Zia 
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30% in 1964 before the changes in the com- 
pany’s programs and policies were initiated. 
The turnover rate during 1965 was 18% in- 
dicating that there has been a steady decrease. 

As in the 1964 survey, mean satisfaction 
scores were computed for each of the five job 
areas. Table 1 shows these mean satisfaction 
scores as well as those obtained in 1964. 
t tests of the significance of the difference 
between the 1964 and 1966 means were 
computed. The results of these ¢ tests are 
shown in Table 1. Of the five job areas, 
four—pay, promotion, supervision, and co- 
workers—showed a significant increase in 
satisfaction from 1964 to 1966. Promotion 
and pay satisfaction showed the biggest in- 
creases. For both these variables the increase 
was significant beyond the .001 level. The 
mean satisfaction with work did not increase 
significantly from 1964 to 1966. While this 
analysis (multiple ¢ tests done on correlated 
dependent variables) is not exactly correct 
and yields an underestimate of the a level for 
the results of the study, it is the most nearly 
correct analysis considering the hypothesis of 
the study. The concern was not with changes 
in the total set of dependent variables but 
with where the changes occurred. Further, 
a T’ analysis, the multivariate analogue of the 
t test (Cooley & Lohnes, 1962), would not be 
appropriate either since the variance-co- 
variance matrices from the 1964 and 1966 
samples were not equal. This, of course, vio- 
lates one of the assumptions of the TJ? 
analysis. 

The columns headed “1964 and 1966 per- 
centile equivalents” present the percentile 


equivalents of mean scores based, of course, 
on the appropriate numbers of Ss. These per- 
centile scores were computed with reference 
to norms which have been constructed on the 
basis of approximately 650 female workers 
from the United States work force. 

For each of the 16 terminators who could 
be identified, 2 control Ss were drawn who 
were matched in terms of age, years of educa- 
tion, job level, and mother tongue. A linear 
multiple-regression analysis was then per- 
formed on this group of 16 terminators and 
32 control Ss. A linear multiple correlation 
of .45 (p < .05) was obtained. An unweighted 
sum of the five satisfaction scores was cor- 
related —.46 (‘yis) (f < .01) with the turn- 
over. 

The multiple correlation of .45 and the 
“pis Of —.46 predicting turnover indicate that 
even with turnover rates as low as 12% these 
termination decisions bear a relationship to 
the satisfaction of the workers. Further, the 
multiple R of .45 is not significantly different 
than the multiple R of .34 which was obtained 
in the 1964 analysis (Hulin, 1966). This 
indicates that a turnover rate of 12% in this 
setting does not represent the base rate which 
can be expected due to non-job-related 
variables. 


DISCUSSION 


The results of this study indicate that there 
was a decrease in turnover of 18% after the 
changes in the company’s pay and promotion 
policies were instituted. In addition, there 
was a significant increase in satisfaction with 
four out of five of the job areas. Considering 
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these results, one is tempted to conclude that 
the company program initiated in 1964 
brought about an increase in the job satisfac- 
tion of these female clerical workers, and 
that this increase led to a reduction in turn- 
over in 1966. However, since this was not a 
true experimental study (Campbell & Stanley, 
1966, refer to it as a “pre-experimental” 
design which has confounded extraneous vari- 
ables), several alternative explanations must 
be considered. 

One explanation which could account for 
the decrease in turnover without attributing 
it to increased satisfaction is history. A change 
in the labor-market conditions from 1964 to 
1966 may have occurred. It could have been 
much easier to get a job in 1964 than it was 
in 1966. If a girl was dissatisfied with her 
job, she could quit and easily find another 
job. However, an examination of the condition 
of the labor market for these 2 yr. indicates 
little change. One would expect that changes 
in the labor market would be reflected in the 
turnover rates of the other companies located 
in Montreal. The average annual turnover 
rates for the other 15 large companies in 
Montreal were 18.4%, 20.2%, and 20.0% in 
1961, 1962, and 1963. These turnover rates 
have not decreased during 1964, 1965, and 
1966. The average turnover rate among these 
15 companies was 24.3% in 1966. An explana- 
tion for the decreased levels of turnover based 
on changes in the labor market conditions 
can be rejected. 

One could explain the increases in job- 
satisfaction scores from 1964 to 1966 as a 
regression to the mean phenomenon. How- 
ever, this argument loses potency when the 
percentile equivalents of the 1964 and 1966 
satisfaction scores are examined. Only the 
nonsignificant change in work satisfaction 
(forty-fourth to forty-seventh percentile) and 
the change in satisfaction with co-workers 
(thirty-ninth to forty-seventh percentile) 
would fit this explanation. Futher, the change 
in turnover rate should not be attributed to 
a regression phenomenon since the turnover 
rate had been stable for over 10 yr. For the 
5 yr. preceding the study the turnover rate 
fluctuated between 30.0% and 31.3%. To 
argue that the rate suddenly regressed to 12% 
stretches the notion of statistical regression. 
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Another explanation which could account 
for the results of this study is that there were 
changes in the composition of the female 
clerical workers employed at this company 
from 1964 to 1966. A certain pattern of these 
employee characteristics might predispose the 
worker to remain on the job and also to be 
more satisfied with her job. This pattern could 
be more frequently found among the 1966 
clerical staff. If this explanation is correct 
then one would expect to find differences be- 
tween the composition of the 1964 and 1966 
clerical staff. However, a comparison of the 
compositions of the two staffs in terms of 
age, job level, years of education, and mother 
tongue revealed no such differences. 

It would also be difficult to postulate 
random factors which could explain the par- 
ticular pattern of results obtained. To argue 
that one set of uncontrolled factors accounted 
for the large increases (¢ ratios of 10.27 and 
10.38) in pay and promotion satisfaction, 
which were the two job aspects concentrated 
on in the program, and small increases in the 
three areas not stressed and that other un- 
controlled factors simultaneously led to de- 
creased turnover strains credibility. 

Other problems of internal validity resulting 
from this type of study design do not seem 
particularly relevant. Factors such as motiva- 
tion, instrument decay, mortality, or selection 
would not seem to account for the findings. 

The final remaining weakness of this design 
would be in the use of reactive measures 
(Campbell & Stanley, 1967; Webb, Campbell, 
Schwartz, & Sechrest, 1966). However, if the 
simultaneous reduction in turnover and in- 
crease in satisfaction are due purely to ob- 
trusive measurement in combination with an 
otherwise ineffective program, then the mil- 
lennium has indeed arrived. For if this ex- 
planation were true, then a combination of 
obtrusive measurement and any poorly de- 
signed, inappropriate program will solve turn- 
over problems. This does not seem reasonable. 

The major remaining question has to do 
with the relationship between job satisfaction 
and turnover under the conditions of reduced 
turnover rates. With a turnover rate of 30% 
in 1964, a multiple correlation of .34 was 
obtained. With a turnover rate of 12% in 
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1966, a multiple correlation of .45 and a 
biserial correlation of —.46 were obtained. 
The expectation was that a lower relationship 
between satisfaction and turnover would be 
obtained with reduced turnover levels. These 
findings indicate that the difference in the 
multiple correlation is nonsignificant. Even 
with a turnover rate of 12% a significant 
amount of the variance in individual termina- 
tion decisions is attributable to differences in 
satisfaction. 
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Using a forced-choice test of achievement motivation (v Ach), called the 
Sentence Completion Test (SCT), 100 Ss were classified into High v Ach 
and Low v Ach groups. All Ss took the Marlowe-Crowne Social Desirability 
scale and the Berdie Adjective Check List. It was found that Ss of High 
v Ach group more frequently endorsed achievement-related adjectives of the 
checklist. The correlation between SCT scores and frequency of endorsing 
achievement-related adjectives turned out to be significant even after the 
effects of social desirability and acquiescence were partialed out. The cor- 
relation between the later 2 response-set variables was found to be high. 


Implications of the results are discussed. 


The adjective checklist method has recently 
aroused some interest among workers en- 
gaged in personality research. The adjective 
checklist has some advantages over some 
other self-report techniques of personality 
assessment. It takes only a few minutes for 
most Ss to check the list even though it may 
contain as many as 200 items. A distinct ad- 
vantage of this method is that Ss usually are 
not reluctant to perform this kind of task. 
According to Gough (1955), the adjective 
checklist method has three advantages. As 
with the association technique, a wide seg- 
ment of personality can be readily measured. 
The familiar words in the list provide a 
meaningful task for the respondent. Lastly, 
the endorsement of various items assures 
analytic ease and may stimulate research 
workers to derive various scoring keys from a 
particular checklist such as Gough’s (1952) 
Adjective Check List (ACL). 

Heilbrun (1958) developed 15 rationally 
derived need scales for the ACL. In one of 
his validational studies (Heilbrun, 1959) he 
found that high-achieving Ss, as defined by 
relatively high grade-point averages (GPAs) 
with estimated intelligence held constant, en- 
dorsed more frequently those statements 
which reflected need for achievement. How- 
ever, in this investigation no attempt was 
made to control the social desirability factor 


1 The author would like to thank Chandra Kishore 
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(Edwards, 1957), although Heilburn (1958) 
himself had demonstrated earlier that fre- 
quency of endorsement of adjective need clus- 
ters is highly related to judged social desir- 
ability of the adjective characteristics of the 
need. Edwards (1953) has reported a high 
positive correlation (7 = .87) between social 
desirability value of a trait name and the 
probability that the trait will be endorsed. 
Quite recently, there have been several 
empirical and theoretical papers directed 
specifically toward the problem of possible 
relationships between social desirability and 
endorsement of trait names of closely related 
self-descriptive statements from Q-sort pools 
and inventories like the Minnesota Multi- 
phasic Personality Inventory (MMPI). A re- 
view of such studies will be found in Edwards 
(1957). Jackson and Messick (1961) found 
that nearly three-quarters of the common 
variance and more than half of the total 
variance in scores on MMPI could be at- 
tributed to social desirability and acquies- 
cence. According to Edwards, Walsh, and 
Diers (1963), “The probability of a True 
response to a personality item is a linear in- 
creasing monotonic function of the social de- 
sirability scale value of the item [p. 255].” 
Kenny (1956) determined the mean social 
desirability rating (SD) based on grouped 
data for each of the 25 trait-descriptive ad- 
jectives and found that the SD values cor- 
related as high as .81 with the probability of 
item endorsement established on the basis 
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of self-rating obtained from a_ separate 
sample. 

The data reported above strongly suggest 
the need to control the SD factor in studies 
designed to test the hypothesis of differential 
patterns of item endorsement by groups classi- 
fied in terms of a particular personality di- 
mension. Groesbeck (1958), for example, 
found that persons who were high in both 
n Achievement and n Affiliation scales, rated 
themselves significantly higher on the Effec- 
tive Intellectual Functioning scale (developed 
by Gough from MMPI items) than groups 
having a different configuration of the above 
two motives. Martire (1956) found that among 
Ss classified according to strength and gen- 
erality of achievement motivation, people with 
low need for achievement (n Ach) endorsed 
to a lesser extent a number of trait names re- 
lated to achievement, sociability, emotional 
maturity, etc., than Ss who obtained high 
n Ach scores whether achievement-related 
cues were present or not. Martire’s (1956) 
data further suggest that “each group tends 
to see as characteristic, to some degree, those 
traits which they consider important [p. 
370].” To what extent the reported differences 
between the criterion groups with respect to 
self-description are free from the effects of 
social or personal desirability of the trait 
names endorsed is a problem that needs 
systematic study. A possible difference due 
to acquiescent response set or ‘“‘Yes-saying” 
has also not been assessed so far. Any at- 
tempt to determine the relationship between 
endorsement frequency of self-descriptive 
adjectives and scores on a particular per- 
sonality dimension would have to be con- 
trolled both for SD and acquiescence. 

The present investigation is a preliminary 
effort in this direction. The primary purpose 
of the study was to test the hypothesis that 
people with high verbalized need for achieve- 
ment (v Ach) tend to endorse more achieve- 
ment-related trait names than persons with 
low v Ach even after the need to present 
oneself in a socially approved fashion and 
acquiescent response set are held constant. 


METHOD 
Subjects 


The Ss used were 60 undergraduate and 40 
postgraduate students between the ages of 18 and 
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25 yr. The sample consisted of 37 males and 63 
females studying in different colleges and graduate 
departments of Patna University, Patna, India 
during the 1964-65 session. The mean age of the 
sample was 20.39 yr. The Ss were all unpaid volun- 
teers. 


Instruments 


The adjective checklist used in this investigation 
was the Femininity Adjective Check List (BACL) 
developed by Berdie (1959). This checklist contains 
a total of 148 adjectives printed in two columns 
with space next to each item for checking if S 
thought that particular adjective applied to him. 
Berdie (1959) has derived a femininity scale on 
the basis of Ss’ endorsement of 61 items included 
in the list of 148 adjectives. The test-retest re- 
liability of the femininity scale, as reported by 
Berdie (1959), is .81 for a sample of 95 men. 
Out of these 148 items of BACL, only 21 adjectives 
were regarded unanimously by 10 graduate students 
in psychology to be directly related to different 
aspects of achievement motivation. Endorsement of 
each of these 21 items by S was credited with 1 
point. 

The v Ach scores were obtained from the 
Sentence Completion Test (SCT) which was de- 
veloped by Mukherjee (1964) to assess specifically 
consciously recognized achievement values among 
American college students. The SCT has also been 
found usable for Indian college students (Mukherjee 
& Verma, 1966) and has recently been used in a 
number of studies conducted in India. Details con- 
cerning its construction, administration, and various 
psychometric properties are reported by Mukherjee 
(1964). Basically, the SCT consists of 50 forced- 
choice triads (one item reflecting achievement-related 
sentiment and the other two pertaining to other 
aspects of manifest needs), selected in such a 
manner as to minimize the SD factor. Examples 
of the forced-choice triads included in SCT appear 
elsewhere (Mukherjee, 1968). The S’s score on SCT 
is the number of times he has chosen a statement 
reflecting any 1 of the following 10 aspects of 
achievement motivation: (a) hope of success, (bd) 
fear of failure, (c) high standard of excellence, (d) 
sense of competition, (e) optimism, (f) perseverance, 
(g) concern for creative work, (kh) preference for 
difficult and challenging tasks, (7) interest in making 
future plans, and (j) identification with a successful 
authority. The test-retest reliability of the SCT after 
an interval of nearly 2 mo. has been found in various 
studies to range from .71 to .83. The test-retest cor- 
relation has been found to be almost .75 for a sample 
of 100 Indian male graduates to whom the test 
was readministered after a month and a _ half 
(Mukherjee & Sinha, 1967). Previous studies 
(Mukherjee, 1964, 1965a, 1965b, 1965c, 1968) have 
demonstrated the concurrent and construct validities 
of the SCT as a measure of v Ach. A person high 
in v Ach, as measured by the SCT, is one who by 
word habitually indicates a desire to compete suc- 
cessfully with a standard of excellence. There is 
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some indirect evidence that the SCT also measures 
to some extent the degree of fear for failure 
(Mukherjee & Sinha, 1967; Pande & Botkawar, 
1967). 

The tendency to respond in a socially approved 
manner was measured by a very slightly modified 
form of the Marlowe-Crowne Social Desirability 
scale (M-C SD scale). The M-C SD scale (Crowne 
& Marlowe, 1960) consists of 33 items which were 
so designed following the Lie scale approach that 
they do not have any psychopathological implica- 
tions. The scale is supposed to measure a “need for 
social approval.” Scores on this scale have been 
correlated with methodologically independent, ob- 
jective, life-situational or laboratory measures of 
personality in a number of studies reviewed re- 
cently by Crowne and Marlowe (1964), and the 
data suggest that the approval motive measured 
by the M-C SD scale emanates from a more basic 
need to maintain and defend an idealized version 
of the self (Crowne & Marlowe, 1964, p. 190). 

The SD scale administered to Ss of the present 
investigation was slightly different from the original 
M-C SD scale in the wordings of only two items. 
The only major modification was with respect to 
Item 27 which now reads “I never act unless I 
have fully considered the possible consequences.” 
The item in the original form appears as “I never 
make a long trip without checking the safety of 
my car.’ There was also a slight change in the 
wording of Item 29. Instead of “the urge to tell 
someone off,” the modified version used “the urge 
to rebuke someone.” The items of the modified 
form of the M-C SD scale have been found to be 
internally consistent (Mukherjee, 1967). 


Procedure 


Mimeographed copies of the SCT were dis- 
tributed in class to all Ss. The Ss were urged to fill 
out the SCT answer sheets in their leisure time and 
return them within a week. All Ss turned in the 
answer sheets before time. They filled out the 
questionnaire in leisure time in accordance with 
the instructions contained on the first page of the 
booklet. The SCT answer sheets were scored 
manually following a predetermined scoring key 
developed on the basis of the discrimination index 
calculated for each item. If S encircled the alterna- 
tive of a particular triad which had the highest 
discrimination index, he received 1 point for that 
item of the SCT, otherwise zero. Fifty Ss who 
scored 20 or above in the SCT were classified as 
High v Ach group and the remaining scoring 19 
or less formed the Low v Ach group. There were 
30 females in the High group and 33 in the Low 
group. The same 100 Ss who made up the High 
and Low groups also completed the Berdie Ad- 
jective Check List (BACL) approximately 3 wk. 
after they filled out the SCT. They were in- 
structed to check those adjectives which they con- 
sidered most descriptive of themselves. They were 
urged to be candid and honest in their self-evalua- 
tion. After 3 wk., all Ss were given the modified 
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M-C SD scale with standard instructions, that is, 
the directions suggested by Crowne and Marlowe 
(1964, p. 23). 

In the present study, “acquiescence” was con- 
ceptualized as a general tendency to agree but was 
measured in terms of the frequency of accepting 
many heterogeneous characteristics as descriptive of 
the self. The method used to assess acquiescent re- 
sponse set or “yes-saying’” was to sum the number 
of adjectives endorsed by each S. For each S a fre- 
quency count was also made of the total number 
of achievement-related adjectives endorsed, and this 
measure was correlated with scores on the M-C SD 
scale and the SCT. 


RESULTS 


The relative frequency (proportion) of 
endorsement for each of the 21 achievement- 
related trait names is given separately in 
Table 1 for the two criterion groups classified 
on the basis of SCT score. Phi correlations 
estimated for each of these items following a 
nomograph given in Guilford (1954, p. 431) 


TABLE 1 


PROPORTION OF SUBJECTS IN HicH AND Low v AcH 
Groups EnporsING Various ACHIEVEMENT- 
RELATED TRAIT NAMES AND OTHER 
RELATED STATISTICS 























Proportion of 
endorsement Phi 
Adjective coefi- 2 : 
. statistic 
High | Low | Cent 
v Ach | v Ach 
Ambitious 74 52 .26 DOs 
Assertive 50 28 .20 Dion 
Bold 58 40 a7 1.82 
Capable .60 .26 34 3.43** 
Confident .64 32 30 Sheol 
Determined .50 28 19 Dene 
Enterprising 42 18 a) 2.62* 
Forceful .16 04 .20 2.00* 
Foresighted 34 16 22 2.08* 
Imaginative .88 62 18 1.98* 
Independent 38 .16 “25 2.47* 
Industrious 44 BLD, 28 eae 
Initiative 40 18 ez 2.42* 
Painstaking 48 .22 aT Dear 
Persevering 56 28 DiI DEOoia 
Serious 56 24 BSL See 
Sincere 84 oy BSD 3.64** 
Spontaneous 54 28 EoD) 2000s 
Steady 76 48 aii 2.88** 
Straightforward .64 34 .26 3.04** 
Vigorous 18 10 08 Tel 
*p<.05 
**p <.01 
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TABLE 2 


Propuct-MoMENT INTERCORRELATIONS AMONG 
MeEAsureES OF Vv Acu, SocrAL DESIRABILITY, 
ACQUIESCENCE, AND FREQUENCY OF 
ENDORSING ACHIEVEMENT-RELATED 
Trait NAMES 











Endorsement Total 
of achieve- ete M-C 
Variable ment- d a iD) 
orse 
related scale 
au ments 
adjectives 
Sentence Com- 
pletion Test 56 09 mls) 
Frequency of 
endorsement of 
achievement 
adjectives MT 20 
Total no. endorse- 
ments of adjectives 67 
Note.—N = 100. r = .254 required for significance at the 


-O1 level. For .05 level, the required value is .195. 


appear in the third column of Table 1. A 
critical ratio test based on Z statistic was 
also run for each item in order to test the 
significance of the differences between two 
independent proportions of item endorsement. 
The results of the test of significance (Z 
statistics) are shown in the last column of 
the same table. 

In Table 2 are given the product-moment 
intercorrelations among the major variables 
of the present study. From the intercorrela- 
tions, a few first-order partial correlations 
were also computed and they are as follows: 
112.3 = .99, 112.4 = .95, Ti3.4 = —.014, ro3.4 = 
.049, the first two of which are significant at 
the .01 level. In order to determine the 
intrinsic relationship between achievement 
values (v Ach) and frequency of endorsing 
achievement-related traits, a second-order 
partial correlation was computed in which the 
effects of acquiescence as well as social de- 
sirability were eliminated. The obtained 
coefficient 712.34 turned out to be .56 which 
is significant at the .01 level. 


Discussion 


The main purpose of the study was to ex- 
amine whether or not persons classified as 
having higher achievement values (v Ach) 
endorse achievement-related traits more fre- 
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quently than those who were found to have 
a low achievement need. The findings of the 
investigation clearly demonstrate that for 
purposes of self-description the High v Ach 
group chose more frequently the following: 
(a) ambitious, (5) assertive, (c) capable, 
(d) confident, (e) determined, (f) enter- 
prising, (g) forceful, (%) foresighted, (7) 
gracious, (j) imaginative, (k) independent, 
(1) industrious, (7) initiative, (7) painstak- 
ing, (0) persevering, (~) rational, (q) serious, 
(r) sincere, (s) spontaneous, (¢) steady, (2) 
straightforward, and (v) thoughtful. The Ss 
of the Low v Ach group described themselves 
more frequently as “slow” and “submissive” 
than Ss of the High v Ach group. 

In a previous self-rating study, Mukherjee 
(1965c) also found that Ss having high 
achievement values tended to rate themselves 
significantly higher on items pertaining to 
perseverance, industriousness, and diligence. 
No significant differences were found with 
respect to dimensions such as sociability, intel- 
ligence, emotional stability, etc. The findings 
of the present study are therefore much in 
line with the previous results. It is generally 
believed that persons with high need for 
achievement tend to be independent and this 
is supported by the study reported by Cran- 
dall, Katkovsky, and Preston (1960). Winter- 
bottom (1958) has also reported some data 
which suggest that high achievement motiva- 
tion is associated with early maternal en- 
couragement for independence training. Such 
a relationship may also be true for achieve- 
ment values (v Ach). Thus, the higher fre- 
quency of endorsement on the part of the 
High v Ach group of trait names such as 
independence, initiative, and spontaneity as 
found in the present study indicates more 
or less the same results established in the area 
of need for achievement (n Ach). 

It is interesting, however, to note that 
the differences between the two criterion 
groups did not turn out to be statistically 
significant so far as their endorsement of the 
following trait names is concerned: active, 
anxious, bold, conventional, dependent, vigor- 
ous, etc. Mukherjee and Sinha (1967) re- 
ported a significant negative relationship be- 
tween v Ach and manifest anxiety. It is pos- 
sible that the connotations attached to these 
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adjectives by the Ss of the present study may 
be at variance with the conceptual meaning 
of the respective dimensions employed in 
deriving the various tools used in the studies 
reported above. 

The first important observation emerging 
out of the present study is that the persons 
having high scores on the SCT endorsed sig- 
nificantly more of the total 21 achievement- 
related items of the checklist than did those 
who scored low on the SCT. The correlation 
between SCT scores and frequency of en- 
dorsing achievement-related items also turned 
out to be significant even after the effects 
of both acquiescence and SD were partialed 
out, as was hypothesized. 

In the light of the present confusion over 
what it is that personality assessment in- 
struments are measuring, it is relevant to 
question whether the significantly higher en- 
dorsement of achievement-related traits by 
the High v Ach group reflected basic person- 
ality differences or were merely artifacts of 
response styles. Christie and Lindauer’s 
(1963) recent review points out that most 
existing scales and inventories are affected 
by both SD and tendency to agree. The re- 
sults of the present study clearly show that 
acquiescent response set does not seem to 
strongly influence scores either on the BACL 
or on the SCT. In so far as SD is concerned, 
it has already been reported that the SCT 
and the M-C SD scale are essentially un- 
correlated (Mukherjee & Sinha, 1967). The 
correlation between the two variables also 
turned out to be insignificant in the present 
study. It seems then that the attempt to 
minimize the SD factor in the process of con- 
structing the SCT has not been futile. 

Thus, one implication of the findings of the 
present study is that achievement-oriented 
self-perception and acquiescence set are not 
highly related. The study also reveals that 
the correlation between SD and tendency for 
“ves-saying” is high enough (cf. Solomon & 
Klein, 1963) to support the contention of 
Webster (1962) that “these two ‘dimensions’ 
are functionally integrated into a larger 
syndrome within personality [p. 789].” 

The results also indicate that, as a group, 
the achievement-oriented assessments college 
students make of themselves are not very 
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different from the ones made by an objective 
test of personality. This suggests that the 
BACL is an instrument sensitive enough to 
differentiate between those with high achieve- 
ment values and those with low achievement 
values. 


The present study also affords additional 
evidence of the construct validity of the SCT 
as a measure of achievement orientation by 
relating individual differences in value achieve- 
ment to preference for endorsing achieve- 
ment-related adjectives on the Berdie Check 
List. The findings in general support the 
hypothesis proposed: that what has been 
described as typical achievement value is a 
significant variable affecting one’s preference 
with regard to self-descriptive trait names 
having achievement connotations. In conclu- 
sion, the results indicate that the forced- 
choice measure of verbalized need for 
achievement called SCT can not only success- 
fully isolate the presumed differential strength 
of achievement values in groups but can also 
successfully predict the probability of en- 
dorsement of achievement-related trait names 
for a group even when the SD factor operates. 
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EFFECTS OF SIGNAL AND STIMULUS RATES ON 
VIGILANCE PERFORMANCE 
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2 experiments investigated the relative effects of critical signal rate, normal 
noncritical signal rate, and critical signal probability on detection in a visual 
vigilance task. In all conditions Ss had to monitor a clock display for a period 
of 54 min. The results of both experiments suggested that normal signal rate 
may be of greater importance in a vigilance situation than either critical signal 
rate or critical signal probability. In general, both percentage of detections and 
percentage of stimuli falsely responded to were inversely related to normal 


signal presentation. 


Although numerous experiments have in- 
dicated that probability of detection is di- 
rectly related to critical signal rate, recent 
studies of vigilance behavior have suggested 
that additional rate variables, such as non- 
critical signal and total stimulus rates, may be 
of equal or even greater importance in de- 
termining vigilance performance. Colquhoun 
(1961) observed that many of the studies 
which found a direct relationship between 
performance and frequency of critical signal 
presentation (Jenkins, 1958; Kappauf & 
Powe, 1959) had utilized tasks in which an 
increase in the number of critical signals was 
simultaneously accompanied by a decrease 
in normal or noncritical signals. Accordingly, 
Colquhoun investigated the separate effects of 
both critical and normal signal rates and 
concluded that detection efficiency was de- 
termined by the probability that a stimulus 
will be a critical signal, that is, number of 
critical signals divided by the total number 
of stimuli presented. Subsequent studies 
(Jerison & Pickett, 1964; Jerison, Pickett, & 
Stenson, 1965; Johnston, Howell, & Gold- 
stein, 1966) have confirmed the general find- 
ings of Colquhoun by obtaining differences in 
performance which were directly related to 
critical signal probability. On the other hand, 
the results from a recent study by Jerison 
(1965) have suggested that the main variable 
affecting performance may be the number of 
normal signals (noncritical) and not critical 
signal probability. Jerison’s (1965) findings 
indicated that under certain conditions per- 


1 The author is also at the State University of 
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centage of detections decreased as the num- 
ber of noncritical signals increased, even 
where critical signal probability was equal. 

Since each of the above studies (Colquhoun, 
1961; Jerison et al., 1965) utilized only two 
or three groups and thus only two comparison 
levels of each of the relevant variables, the 
discrepancy between findings might be due to 
the particular levels which were investigated. 
Thus, the present experiments were designed 
to provide further insight into the relative 
effects of critical signal rate, normal signal 
rate, and critical signal probability. This was 
accomplished in Experiment I by investigating 
the effects of critical signal and total stimulus 
rates at various signal probabilities and in 
Experiment II by using a constant critical 
signal probability level derived from several 
combinations of critical and noncritical signal 
rates. 


EXPERIMENT [| 
Method 


Subjects. Data are reported on a total of 72 
undergraduate males who participated as part of 
the requirements of the introductory course in 
psychology at Syracuse University. Data from nine 
additional Ss were discarded due to equipment 
failure (V=4), not responding (V=3), or con- 
tinuous random responding (N = 2). 

Apparatus. The visual display was a Type S-1 
Standard Electric clock with a 1-rps sweep rate 
which had been modified as follows: (a) the in- 
strument face was reversed and painted white; (6) 
clock face diameter was reduced from 34 to 2% in. 
by painting the outer rim of the glass display sur- 
face white; and (c) the black sweep hand was 
painted white toward the center of the display, 
leaving only a 3-in. black portion at the tip. 
All three modifications were based on preliminary 
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testing and were designed to increase the difficulty 
of the vigilance task. 

As in other clock-type vigilance tasks, pulses 
were presented at fixed rates which allowed the 
sweep hand to move or jump a specified distance. 
The normal or noncritical signal consisted of a 9° 
movement, while the critical signal to be detected 
by S had a sweep-hand movement of 20°. Thus, 
S’s task was to detect a “double jump” which was 
actually 2.2 times the distance of the normal non- 
critical signals. Detection responses were to be made 
on a telegraph key located directly in front of S. 

Two identical clocks were modified and placed 
in adjoining rooms to allow for simultaneous testing 
of two Ss. Each clock was positioned at one end 
of a table with an acoustically tiled wall behind it. 
The S was located at the other end of the table 
approximately 4% ft. from the clock. Further, to 
reduce possible noise and vibration transmission, 
each clock was placed upon acoustic-absorbing 
material. Finally, to insure complete acoustic isola- 
tion and a uniform acoustic environment, Ss wore 
earphones and received 75 db. of white noise through- 
out the testing session. 

All programming and recording equipment were 
located in an adjoining control room. The rate of 
stimulus presentation was controlled by Grason- 
Stadler Programming modules, while critical signal 
rate and onset were controlled by a Gerbrands tape 
programmer. An Esterline Angus 20 pen Model 
recorder and a Grason-Stadler Print-Out counter 
were used to record each signal and each S’s re- 
sponse. Formal instructions and white noise were 
prerecorded and played back through Ss’ earphones 
by a Wollensak tape recorder. 

Signal schedules. Stimuli were presented at two 
fixed rates of 30 and 60 stimuli per minute (ie., 
normal plus critical signals), providing a total of 
1,620 and 3,240 stimuli, respectively, for the 54-min. 
session. The preprogrammed film tapes provided 
that 27, 54, or 108 of the stimuli presented in each 
test session were critical signals while the rest were 
normal signals. Each program tape was set up with a 
time base of 2 min., such that within each critical 
signal rate the same number of signals were presented 
in each 2-min. block, that is, one, two, or four 
critical signals per 2 min. Critical signals were pro- 
grammed randomly within the restrictions imposed 
by the 2-min. blocks. Table 1 presents the six 
combinations devised from the three critical signal 
and the two total stimulus rates and, for each 
condition, further illustrates the probability of a 
stimulus being a critical signal. 

Procedure. The Ss were randomly assigned to 6 
groups with 12 Ss per group. Each group served un- 
der a different combination of total stimulus rate 
and critical signal rate. All Ss served for 54 min., 
thus providing a measure of performance for three 
18-min. time blocks. 

At the beginning of the testing session, Ss were 
given informal instructions which drew attention to 
the clock and key and which provided stimulus 
samples. The Ss continued to receive sample stimuli 
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TABLE 1 


CRITICAL SIGNAL PROBABILITY FOR EACH OF 
THE 6 EXPERIMENTAL CONDITIONS 


Critical signals 


Total stimuli 
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1620 .0166 0333 .0666 
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until they could discriminate between the normal 
and critical signals. At this time Ss were told to 
put on their earphones and were given the formal 
pretaped instructions for the vigilance task. The 
formal taped instructions provided further examples 
of the normal and critical signals and instructed Ss 
that their job was to press the telegraph key as 
soon as a Critical signal was detected. The Ss were 
further informed of the white noise and were re- 
quested to remain seated and to continue the task 
until E entered the room to terminate the session. 

All watches were removed before the testing 
session began so Ss did not receive any time cues, 
although they did know that the session would last 
“about an hour.” 


Results 


A single response-time criterion was used 
for all groups even though the duration of the 
interval between successive stimuli was in- 
versely dependent upon rate of stimulus 
presentation. Since Ss tended to pace their 
responses to the off-intervals, the single cri- 
terion was based upon the duration for the 
groups with the slowest stimulus rate. Thus, 
a response to a critical signal was considered 
as being correct only if it occurred within a 
2-sec. interval following signal presentation. 
All responses which occurred outside of this 
interval were considered as false detections. 

Correct detections. The number of critical 
signals detected by each S during each of the 
three 18-min. time periods was converted to 
percentage correct by dividing the number 
of signals detected by the number of signals 
presented. A Hartley’s test of homogeneity of 
variance (Winer, 1962) was run using the 
between-groups effects of critical signal rate 
and total stimulus rate and indicated that the 
assumption of homogeneity was tenable and 
that all groups could be run in a single anal- 
ysis of variance. The analysis of variance of 
the data indicated that the only significant 
effects were total stimulus rate, F (1/66)= 
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14.25, p < .01, and time periods, F (2/132) 
= 36.62, p < .01. No consistent differences in 
performance were attributable to critical 
signal rate or to the interactions between the 
variables. 

Figure 1 presents mean percentage of cor- 
rect detections for each of the six groups 
during each of the three 18-min. time periods. 
The main effect of time periods is clearly 
shown by the fact that all groups made less 
detections in each successive period. Similarly, 
the data underlying the significant effect of 
total stimulus rate indicate that performance 
was poorer with the faster rate of stimulus 
presentation as compared to the slower rate. 
The lack of interaction between these variables 
suggests that, in general, the increase in total 
number of stimuli did not affect the rate of 
vigilance decrement but, in effect, lowered 
performance in all time periods. 

Of further relevance to the evaluation of 
the various rate parameters in vigilance per- 
formance are the findings that, although in 
the predicted direction, the effects due to 
critical signal rate were not sufficiently large 
or consistent to achieve statistical significance. 
These findings plus the significant effect of 
total stimulus rate suggest that the rate of 
noncritical or normal signal presentation may 
be the most important of the variables under 
question. That is, the combined results in- 
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Fic. 1. Percentage of detections as a function of 
critical signals, total stimuli, and 18-min. time 
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dicate that a fourfold increase in critical sig- 
nals and signal probability with a relatively 
small decrease in normal signals produced no 
consistent effect, whereas holding critical 
signals constant and varying the number of 
normal signals produced a significant effect. 
Finally, even in conditions of equal prob- 
ability (see Table 1), differences appeared as 
a function of normal signal rate. 

False detections. The number of false detec- 
tion or commission responses varied con- 
siderably with individual Ss making as few 
as 1 or as many as 320 commissive errors. 
Further, 25% of the Ss made 65% of the 
total false detections. Thus, high variability 
obscured any general between-group trends. 
The only trend which was consistent was 
that commissive errors decreased with time, 
Tia 52s) == lOcie eae Ls 

Conversion of the false-report data into 
percentages (i.e., number of commissive 
errors divided by number of normal stimuli 
that could be responded to with the 2-sec. 
detection criterion) further suggested that 
the proportion of normal signals responded to 
was inversely related to total stimulus rate, 
F (1/66)= 9.46, p < .01. Mean percentages 
of responses were 5.68% and 2.57%, re- 
spectively, with the slow and fast stimulus 
rates. No other trends were consistent 
except the decrement with time (as noted 
above). In general, the results closely parallel 
the findings for percentage of correct detec- 
tions; that is, the general performance trends 
are identical with a greater percentage of 
normal signals being responded to in those 
conditions which also led to the highest levels 
of correct detections. 


EXPERIMENT II 


The findings of Experiment I suggested 
that noncritical signal rate may be of greater 
importance in a vigilance situation than either 
critical signal rate or critical signal prob- 
ability. Since all three factors could not be 
compared directly in Experiment I, a second 
experiment was run to further evaluate the 
relative effects of each of the variables. This 
was accomplished at various stimulus rates 
by simultaneously varying the number of 
normal and critical signals in a manner which 
maintained a single critical signal probability 
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level. Three predictions could be made with 
this design: (1) if probability level is the 
most important factor, then rate of stimulus 
presentation should have no effect upon per- 
formance; (2) if noncritical signal rate is the 
most important variable, then detection per- 
formance should vary inversely with rate of 
stimulus presentation; and (3) if critical 
signal rate is of greatest importance, then 
performance should vary directly with stim- 
ulus presentation rate. 


Method 


The Ss consisted of 20 male and 20 female under- 
graduate students at Syracuse University. Data 
from seven additional students were discarded due 
to equipment failure (V =6) or refusal to complete 
the session (V = 1). 

Five male and five female Ss were randomly as- 
signed to four stimulus rate groups which were 
derived from different combinations of normal 
signal rate and critical signal rate. These combina- 
tions, which will be written in the form “critical 
signal rate/normal signal rate,” are 216/6264, 108/ 
3132, 54/1566, and 27/783. All combinations have 
the identical signal probability of .033; that is, the 
ratio of total critical signals to total stimuli is 
.033 in all four combinations. 

The basic apparatus, task instructions, and 
methods of making the stimulus tapes were identical 
to those used in Experiment JI. In all cases, S’s 
task was to depress the telegraph key upon detec- 
tion of a critical signal. Similarly, all Ss served 
for a total of 54 min. 


Results 


As in Experiment I, the criterion for a 
correct detection response was determined by 
the group which had the longest duration be- 
tween the presentation of succeeding stimuli, 
that is, Group 27/783 with 810 total stimuli 
per 54 min. and 1 every 4 sec. Thus, in the 
present experiment the cutoff time was 4 sec., 
and all responses were counted as being cor- 
rect if made within 4 sec. after a critical 
signal was presented. All other responses were 
considered as being false detections. 

Correct detections. The Hartley test in- 
dicated homogeneous variances for the four 
experimental groups. Analysis of variance of 
percentage of detections indicated that stim- 
ulus rate groups, F (3/32)= 6.41, p< .01, 
time periods, F (2/64)= 18.52, p < .01, and 
the Time Periods X Sex interaction, F (2/64) 
= 3.29, p< .05 were the only significant 
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TABLE 2 


PERCENTAGE OF CRITICAL SIGNALS CORRECTLY DE- 
TECTED AND PERCENTAGE OF NORMAL SIGNALS 
WHICH RECEIVED COMMISSION RESPONSES 








Group Correct detections} False responses 
216/6264 43.52 1.21 
108/3132 44.64 1.45 

54/1566 72.62 8.99 

27/783 61.86 fas 





effects. As in Experiment I, the interaction 
between time periods and stimulus rate was 
not significant. 

Table 2 illustrates the data underlying 
the findings for the stimulus rate groups and 
indicates that performance was inversely re- 
lated to the rate at which the normal or 
noncritical signals were presented. More 
specifically, the table suggests the two slowest 
critical signal/normal signal combinations led 
to better performance than obtained with the 
two faster rates. Further, the lack of in- 
teraction with time periods suggests that these 
differences were apparent in all three 18-min. 
blocks. Figure 2 presents the data for the 
Time Periods X Sex interaction and suggests 
that although there was a general decrement 
in performance over time, the rate of decre- 
ment was greater for males than females. 

False detections. The Ss exhibited a large 
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Fic. 2. Percentage of detections for male and female 
subjects as a function of 18-min. time periods. 
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amount of variability in the number of false 
detections, with 20% (eight Ss) making 65% 
of the commissive errors. Further, although 
the number of commissive errors for the 54/ 
1,566 group was more than twice that of the 
other groups, this difference is attributable 
to the error total of one S, which, in fact, 
was equal to or greater than the total number 
of errors for each of the other three groups. 
Thus, it did not appear that the groups 
differed in terms of number of false detections. 
Finally, although number of false reports 
appeared to decrease with time, an analysis 
indicated that this decline was not significant. 

Conversion of number of false reports into 
percentages by dividing by the total number 
of normal signals which could be responded 
to with the 4-sec. response criterion indicated 
that stimulus rate groups was a significant 
effect, F (3/36)= 3.75, p< .05 with this 
measure. That is, since the number of com- 
missions appeared to be similar, the fewer 
the number of signals presented, the higher 
the percentage which was responded to by S. 
Table 2 indicates that the general trend of 
these data closely resembles that for correct 
detections. 


DIscussION OF EXPERIMENTS I AND IT 


In general, the present results suggest that 
both percentage correct detections and _ per- 
centage of stimuli falsely responded to are 
inversely related to the number of normal 
signals which are presented in a_ session. 
Further, the direction of the results in both 
experiments indicates that under the present 
conditions neither number of critical stimuli 
nor the probability of a stimulus being a 
critical signal had any significant effect upon 
detection performance. Taken together, these 
findings cannot be explained by either the 
expectancy or the arousal theories which 
would lead to predictions which are op- 
posite to the present results. That is, predic- 
tions from arousal theory would lead one to 
expect less arousal and thus poorer perform- 
ance with low rates of normal signal presenta- 
tion. Similarly, the use of an expectancy hy- 
pothesis would predict, in Experiment IT, 
overall detection performance to be directly 
related to critical signal rate and amount 
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of decrement over time to be inversely re- 
lated to critical signal rate. The only ex- 
planation which appears to be compatible 
with the present results is a decision-theory 
model of vigilance which suggests that detec- 
tion performance is directly related to ob- 
serving responses which are, in turn, de- 
pendent upon the various signals rates (Jeri- 
son, 1965; Jerison & Pickett, 1963, 1964). 
Although the present results are not in agree- 
ment with the suggestion that signal prob- 
ability is the most effective variable, they are 
in agreement with the assumption that the 
cost of an observing response increases di- 
rectly with normal signal rate, while the 
expected value of that response varies in- 
versely (Jerison, 1965). If this latter as- 
sumption can be accepted then the present 
results for both percentage of correct and per- 
centage of false detections can be explained 
by a decrease in observing responses, attention 
to the display, and detection responses due 
to an increase in the number of normal signals 
with a consequent increase in cost of ob- 
serving and decrease in value of observing. 
Although the present findings are con- 
trary to expectations from the study by 
Colquhoun (1961), a number of possible dif- 
ferences were present in the experiments and 
might account for the discrepancies in re- 
sults. Colquhoun’s use of a slow normal 
signal rate was one possible difference, al- 
though Jerison (1965) found that almost all 
signals were detected with few normal signals 
regardless of the probability. A second pos- 
sible reason might be due to Colquhoun’s 
training conditions which, in contrast to the 
present study, provided pretest practice trials 
using the same signal probabilities as were 
experienced in the testing session. A third 
difference and one of possible greater im- 
portance is the fact that Colquhoun actually 
utilized three types of stimuli with the third 
being a blank display. This third type of 
stimulus, in effect, informed Ss that neither 
a critical nor noncritical signal had appeared 
and thus may have affected the nature of 
responses. The final and probably most likely 
contributor to the discrepancy is that 
Colquhoun’s single comparison of equal-signal 
probabilities, which produced no significant 
differences in percentage of detections, was 
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made using 0.5 as the probability level 
whereas the highest probability of a stim- 
ulus being a critical signal was .06 in the 
present experiments and .04 in the studies 
by Jerison (1965). In view of the large dif- 
ference in signal probabilities, it appears pos- 
sible that dual processes may be present such 
that the normal signal rate may be the deter- 
miner of performance at low probabilities, 
while at high probabilities either (a) the 
probability itself determines performance, or 
(b) detection performance has reached an 
asymptote and is not affected by further 
changes in rate or probability. These factors 
require further research before any general 
conclusions may be drawn. 

Finally, the effect of males versus females, 
although of lesser importance than the other 
results, suggests that females are better able 
to maintain performance over a long period 
of time. This interaction is in general agree- 
ment with the results of a study by Whitten- 
burg, Ross, and Andrews (1956), although the 
exact shape of the interactions is different due 
to different time measurements. That is, the 
present study suggested that males were 
superior in the first 18-min. period with a 
reversal occurring in the second and third 
period, whereas Whittenburg et al. using 
half-hour time periods found no differences 
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during the first hour with the females being 
superior to male Ss during the last hour. 
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PARTICULARISM AND ORGANIZATIONAL POSITION: 
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The Stouffer-Toby Role Conflict Scale, a measure of particularism (the value for 
institutionalized obligations of friendship), was administered to 230 employees 
in 13 bank branches in Mexico and the United States. Findings suppert the 
hypothesis that for both bank officers and line employees Mexicans are sig- 
nificantly more particularistic than Mexican-Americans, who in turn are sig- 
nificantly more particularistic than Anglo-Americans. The hypothesis that, by 
ethnic group, officers are less particularistic than line employees was partially 
supported. The cultural development of particularism and the potential impact 
of that value upon organizational behavior are discussed. Some suggestions are 
offered to management and to researchers concerning the importance of under- 
standing the interactions among specific value orientations and _ specific 
organizational behaviors, and concerning the implications of such interactions 
for employee satisfaction and organizational integrity. 


Rapid industrialization of “underdeveloped” 
countries, continued expansion of international 
business operations, and increased use of 
indigenous labor by foreign companies 
heighten the relevance of cross-cultural stud- 
ies of individual behavior in complex organ- 
izations. Though such studies are relatively 
few, their findings invariably indicate the 
pervasive influence of cultural values upon, 
for example, the pace and pattern of company 
development, the emergence of management 
and supervisory styles, and the varieties and 
intensities of motivation for, attitudes toward, 
and alienation from work (Fayerweather, 
1959; Haire, Ghiselli, & Porter, 1963; Mc- 
Clelland, 1961; Whitehill, 1964; Whyte, 1963; 
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Science Foundation through a Cooperative Graduate 
Fellowship awarded to the author, and by the Na- 
tional Institute of Mental Health, Grant No. 5-R11- 
MH-544-2, Arnold Meadow, director. Special 
acknowledgment is made to Arnold Meadow, Sal- 
vatore Zagona, William MacKinnon, Richard Coan, 
Stuart Queen, and Jerry L. L. Miller of the Uni- 
versity of Arizona for their helpful comments. With- 
out the cooperation of John Henderson, vice-presi- 
dent of the Valley National Bank, Tucson, Arizona, 
the Banco Nacional de Mexico, Nogales, Sonora, 
and the Bureau of Business and Public Research, 
University of Arizona, this study would not have 
been possible. 

2 Now in the Department of Sociology, University 
of Texas. 
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Whyte & Williams, 1963; Zurcher, Meadow, 
& Zurcher, 1965). 

Ideally, the investigator seeking to ap- 
preciate the dynamics of cross-cultural dif- 
ferences in organizational behavior would him- 
self experience the life styles and work settings 
of his subjects (Ss), and then come to grips 
with the complex interaction between whole 
personalities and whole organizations. Con- 
siderably less ambitious, the purpose of this 
paper is to isolate a single cultural value, 
and to observe how it varies between and 
within specific work organizations. 

The term “value” is taken to mean “a 
selective orientation toward experience, im- 
plying deep commitment or repudiation, which 
influences the ordering of choices between 
possible alternatives in action [Kluckhohn, 
1961, p. 18].” Values are learned in one’s 
parent culture, and “culture” is “the dis- 
tinctive way of life of a group of people, 
their complete design for living [Kluckhohn, 
1951, p. 86].” The value to be studied in 
this paper is “particularism,” or the value 
for institutionalized obligations of friendship. 
Particularism is one component of Parsons 
and Shils’ (1959) general theory of action 
and is juxtaposed with “universalism,” or 
the value for institutionalized obligations to 
society (pp. 76-77). According to Parsons 
and Shils, before acting in a specific social 
situation the individual will be influenced by 
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a particularistic or a universalistic orienta- 
tion, and that orientation will have been 
culturally determined (p. 79). Using varying 
vocabularies, several other social scientists 
have, based upon their comparative observa- 
tions of societies in differing stages of social 
complexity, conceptualized dichotomies or 
continua bounded on one side by a predomi- 
nant value for concrete interpersonal ties and 
on the other by a predominant value for 
abstract societal expectations (Becker & 
Meyers, 1942; Lewis, 1951; Loomis & Beegle, 
1950; Miner, 1952; Redfield, 1941). Mem- 
bers of the less complex, more “primitive” 
societies are reported to tend toward par- 
ticularism, members of complex societies 
toward universalism. 

In that context, Mexico has been described 
as manifesting (modally) a cultural value for 
particularism. Parsons and Shils (1959), 
Lewis (1951), Redfield (1947), and Cohen 
(1961) report the influence of the Mexican 
social structure, particularly the extended 
kinship system, upon the individual’s assess- 
ment of events in his daily life. Government, 
laws, and other abstract societal expectations 
are observed to be distant, ephemeral, and 
untrustworthy to the Mexican, and not suffi- 
cient basis for the evaluation of other human 
beings. What seems more important is the 
depth and significance of interpersonal ties 
with blood relatives, compadres, and friends. 
Another person is accepted by the individual 
not in terms of office held, title displayed, or 
any other such abstraction, but in terms of 
immediate interpersonal experience with the 
evaluating individual. The essential questions 
are: What kind of person is he to me? Can 
he be trusted? 

On the other hand, the United States has 
been described (modally) as a universalis- 
tically oriented society. The high level of 
technology, “‘scientism,” and bureaucracy, the 
keen emphasis upon competition, achievement, 
and mobility encourage more shallow inter- 
actions with other human beings and a greater 
dependence upon collective societal definitions 
of others (Buber, 1958; Fromm, 1940; 
Riesman, 1950). The essential questions are: 
What (group, company, department, etc.) 
does he belong to? How high up is he? 
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If these observations on the two societies 
are accurate, differences in selected cultural 
values, specifically particularism, should be 
empirically demonstrable. That the present 
discussion of Mexico and the United States 
has been in terms of modal value orientations 
admits to the fact that, for example, groups 
in rural America may be more particularistic 
than groups in highly urbanized Mexico City. 
Thus, as with all cross-cultural research, 
special effort must be made to establish 
comparability of samples. In this study, com- 
parability was approached by drawing Ss 
from similar work organizations in Mexico 
and the United States. The structures of the 
work organizations were such that employees 
could be classified as “officer” or “line.” It 
was therefore hypothesized that otherwise 
comparable Mexican officers would be more 
particularistic than Mexican-American officers, 
who would in turn be more particularistic 
than Anglo-American officers. A similar di- 
rectional difference in particularism was hy- 
pothesized for line employees. 

Since the work organization was bureau- 
cratic in nature, the assumption would be 
that movement up the organizational ladder 
would be associated, by a selection or a 
socialization process, with a shift away from 
the particularistic value orientation. Some in- 
dications of that relationship can be found in 
the work of Merton (1940) and Whyte 
(1948). In this study it was therefore hy- 
pothesized that Mexican officers would be 
less particularistic than Mexican line em- 
ployees, Mexican-American officers less par- 
ticularistic than Mexican-American line em- 
ployees, and Anglo-American officers less par- 
ticularistic than Anglo-American line em- 
ployees. 


METHOD 
Subjects 


The work organization chosen as the setting for 
this study was the bank, because it provided an 
accessible and cross-culturally comparable bureau- 
cratic organization from which to draw Ss. Mexican 
and American banks are almost identical in formal 
organization and operating procedures (Beteta, 
1963), the former being based upon the structure 
of the latter. 

Three experimental groups were used, the num- 
ber of Ss for all three groups totaling 230. The 
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first group was composed of native-born, Spanish- 
speaking (unilingual) Mexican citizens employed 
by two bank branches in Nogales, Sonora. The 
second group was composed of bilingual (Spanish- 
English) Mexican-Americans employed by two bank 
branches in Nogales, Arizona (bilingualism and 
United States citizenship are conditions for em- 
ployment in these branches). The third group was 
composed of English-speaking (unilingual) Anglo- 
Americans employed by nine bank branches in 
Tucson, Arizona. All 11 United States banks are 
branches of the same parent organization. Both of 
the Mexican banks are branches of the same Mexican 
parent organization. All employees of the banks 
(up to the position of vice-president and including 
at least one vice-president for each cultural group) 
who were at work on that day were administered 
the questionnaire. The final sample included all of 
these employees, with the exception of 1 Anglo- 
American who was removed from the Mexican- 
American group, and 14 Mexican-Americans who 
were removed from the Anglo-American group. The 
employees of the Nogales, Sonora, banks were all 
Mexicans. All three groups approximated each other 
for proportion of officers (about one to every six 
line employees), for employee longevity (M/dn = 2-4 
yr.), and for age (Mdn=20-29 yr.). The ratio of 
males to females was found to be disproportionate 
among the groups (22:16 Mexican; 14:29 Mexican- 
American; 66:83 Anglo-American), but subsequent 
analysis demonstrated sex not significantly to be 
related to particularism. Officers (N=35) were 
vice-presidents, managers, assistant managers, and 
all other employees who had “officer” in their 
formal titles (i.e., loan officer, personnel officer, etc.). 
Line employees (V=195) were tellers, clerks, 
typists, secretaries, including first-line supervisors 
(ie., head teller, chief clerk, etc.). 


Measures 


As part of a questionnaire packet that contained 
biographical and employment history questions and 
some attitude toward work scales, each S was 
given the Stouffer-Toby Role Conflict Scale. The 
Role Conflict Scale is a four-item Guttman instru- 
ment designed to sound the degree of particularism 
by presenting four situations involving conflicts be- 
tween obligation to a friend and more general social 
obligations, and forcing the individual to choose 
one set of obligations or the other (Stouffer & 
Toby, 1951). For example, Item 1 states: 


You are riding in a car driven by a close friend, 
and he hits a pedestrian. You know he was going 
at least 35 miles an hour in a 20-mile-an-hour 
speed zone. There are no other witnesses. His 
lawyer says that if you testify under oath that 
the speed was only 20 miles an hour, it may 
save him from serious consequences. 

What right has your friend to expect you to 
protect him ? 
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Check one: 


——— My friend has a definite right as a friend 
to expect me to testify to the lower figure. 
He has some right as a friend to expect me 
to testify to the lower figure. 

He has no right as a friend to expect me to 
testify to the lower figure. 








What do you think you’d probably do in view of 
the obligations of a sworn witness and the obliga- 
tion to your friend? 


Check one: 





Testify that he was going 20-miles-an-hour. 
Not testify that he was going 20-miles-an- 
hour. 





The response “My friend has a definite right . . .” 
is considered to be particularistic. The response 
“He has no right . . .” is considered to be uni- 
versalistic. If the respondent marks “He has some 
right . . .” the response is scored particularistic if 
in the second part of the question he says he would 
favor the friend in action; universalistic if he says 
he would not favor the friend. Scale scores may 
vary from 0 to 4, with a score of O indicating a 
minimum and a score of 4 indicating maximum 
particularism. Only particularistic responses are 
scored, 1 point for each item. 

The Role Conflict Scale was translated into 
Spanish for use with the Mexican S$ group. Both 
the English and the Spanish scales maintained 
Guttman scalability throughout the present study. 
The coefficient of reproducibility was .924. 


Procedure 


The mimeographed questionnaire form was ad- 
ministered, either before or after customer hours, to 
the on-the-job employees by the investigator. Prior 
to the administration, the employees were gathered 
together and told by the investigator that their 
anonymous responses were considered to be con- 
fidential, would not be seen by any other bank 
personnel, and would be used only for research 
purposes. The employees were then given copies 
of the questionnaire, and went to their own work 
spaces to fill them out. When finished with the 
forms, the employees returned them to the investi- 
gator. Members of management simultaneously filled 
out the forms and did not influence the line em- 
ployees in any apparent way. The same procedure 
was used in every bank branch, and all the data 
were collected within a time span of 20 working 
days. 


RESULTS 


Table 1 presents the distribution of bank- 
officer particularism scale scores for each 
enthnic group. A Kruskal-Wallace one-way 
analysis of variance indicated that particular- 
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ism was significantly related to ethnic mem- 
bership (H = 35.08, df = 2, p < .0005). Fur- 
ther analysis by Fisher’s Exact Probability 
test (Ss who scored O versus those who did 
not) and inspection of Table 1 indicate the 
direction of the overall results: Mexican 
versus Mexican-American officers (p = .045); 
Mexican versus Anglo-American officers (p = 
.0004); Mexican-American versus Anglo- 
American officers (p = .008). Thus, as hy- 
pothesized, the particularism of the bank 
officers is influenced by cultural background— 
Mexicans significantly more particularistic 
than Mexican-Americans, who in turn are 
significantly more particularistic than Anglo- 
Americans. 

Table 2 presents the distribution of line- 
employee particularism scale scores for each 
ethnic group. As with the officers, a Kruskal- 
Wallace one-way analysis of variance in- 
dicated that line-employee particularism was 
significantly related to ethnic membership 
(H = 9.92, df = 2, p < .0005). Further anal- 
ysis by the Median Test (one-tailed) and 
inspection of Table 2 indicate the direction of 
the overall results: Mexican versus Mexican- 
American line employees (x? = 14.44, df = 1, 
pb < .0005); Mexican versus Anglo-American 
line employees (x? = 24.80, df=1, p< 
.0005 ) ; Mexican-American versus Anglo-Amer- 
ican line employees (x? = 3.9, df=1, p< 
.025). Again as hypothesized, the particular- 
ism of the bank line employees is influenced 
by cultural background—Mexicans signifi- 
cantly more particularistic than Mexican- 
Americans, who in turn are significantly more 
particularistic than Anglo-Americans. 

With ethnic groups combined, officers were 
demonstrated to be significantly less partic- 


TABLE 1 


OFFICER PARTICULARISM BY CULTURAL ORIGIN 








Scale score 
Calturalorsn i na mV 


Mexican 0 


2 2 1 0 5 
Mexican-American 4 0 0 1 1 6 
Anglo-American 2 Ae |e 0 0 OM e242 
Total 25 5 2 2 1 35 
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TABLE 2 


Line-EMPLOYEE PARTICULARISM BY CULTURAL ORIGIN 








Scale score 
Cultural origin 


Mexican 5 5 6 





7 | L05iisS 
Mexican-American yf |) 4 4 OF eo7 
Anglo-American 82 | 23) 10 8 2 ales 
Total 104 | 40 | 20 | 19 } 12 | 195 
ularistic than line employees (H = 3.56, 


df=5, p< .05). However, the hypothesis 
that officers would be significantly less par- 
ticularistic than line employees for each ethnic 
group was supported only for the Anglo- 
Americans (7 = 10.31, df= 1, p< .0005). 
Inspection of Tables 1 and 2 suggests that the 
distribution of Mexican and Mexican-Amer- 
ican scores tends toward the hypothesized 
difference between officers and line employees, 
but the small Ws, particularly for the officers, 
make interpretation tenuous and obscure 
statistical analysis. 


DIscUSsION 


The hypothesis that a single value, par- 
ticularism, varies cross-culturally for officers 
and line employees in a specific type of work 
organization has been supported. This find- 
ing encourages the conclusion that value 
orientations are central factors to be con- 
sidered in the social psychology of work, 
but raises further questions about the relation 
of such values to measurable organizational 
behavior. Zurcher et al. (1965) report find- 
ings with the present S group that indicate 
a significant positive relation between par- 
ticularism and alienation from work in the 
bank, which is interpreted to be a universal- 
istic organization. The same authors point 
to an active social club in the Mexican- 
American S banks and suggest that those 
informal ties may be the result of particular- 
istic needs. McClelland (1961) reports Mex- 
ican businessmen to manifest a significantly 
higher need for affiliation (a particularistic 
parallel) and a significantly lower need for 
achievement than do Anglo-American busi- 
nessmen. Fayerweather (1959) observes that 
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Mexican businessmen seem to be more con- 
cerned with the maintenance of personal al- 
liances than with bureaucratic efficiency and 
if forced to choose between those alternatives 
will choose the former. Whyte and Williams 
(1963, p. 6) conclude that the Peruvian 
worker “looks upon the supervisor as a total 
human being, without thinking about the 
various aspects of his job performance,” 
and that this “fits in with the emphasis upon 
the person and personalism” that has been 
noted in Latin American culture. 

Further research, certainly interdisciplinary 
in nature, is needed to delimit more specifi- 
cally the degree to which particularism or 
other values vary cross-culturally and the 
impact they have upon organizational be- 
havior—that is, performance, supervision, 
policy formulation, absenteeism, turnover, 
etc. Especially useful would be data illustrat- 
ing the generation of conflicts in personnel 
expectations from disparities in value orienta- 
tions, when and if such conflicts are functional 
or disfunctional for the individual and for the 
organization, and the means, if appropriate 
and desirable, for the amelioration of those 
conflicts. 

The present study was not able clearly to 
demonstrate for each of the three ethnic 
groups that officers were significantly less 
particularistic than line employees. Insufficient 
officer WV may account for the lack of statis- 
tically significant differences between officer 
and line Mexicans and Mexican-Americans— 
a difference which was markedly demonstrated 
for the considerably larger Anglo-American 
group. On the other hand, it may be that the 
value for particularism is not importantly 
altered by or a requisite for managerial status 
in the Mexican and Mexican-American banks. 
The data are not conclusive for either inter- 
pretation. The significant difference estab- 
lished in the Anglo banks supports the original 
hypothesis, but introduces an_ interesting 
question. What proportion of the difference in 
particularism can be accounted for by the 
screening and selecting process of promotion, 
and what proportion can be accounted for by 
the process of officer role-enactment following 
promotion? 

The data and interpretations presented in 
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this paper suggest to management in com- 
plex organizations, particularly those organ- 
izations with international branches or sub- 
sidiaries, that differences in the cultural back- 
ground of personnel may yield differences in 
their value orientations—value orientations 
which may influence organizational behavior. 
Also, management should not assume that a 
set of policies, organizational expectations, or 
leadership styles established and proven suc- 
cessful in one culture will inevitably be suc- 
cessful in another. 

The findings of this study suggest to the 
researcher that there is continued need for 
interdisciplinary studies of the interactions 
among specific cultural values and _ specific 
organizational behaviors. 
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ACHIEVEMENT VALUES AND SCIENTIFIC PRODUCTIVITY 


BISHWA NATH MUKHERJEE 1 


Nagpur University, Nagpur, India 


50 college science teachers were tested on Mukherjee’s Sentence Completion 
Test (SCT), a forced-choice scale for measuring achievement values (v Ach). 
The hypothesis of a positive relationship between v Ach and scientific produc- 
tivity was tested. Results indicated a significant positive correlation between 
the 2 variables. Most of the teachers having advanced research degrees were 
also found to have high v Ach scores. The study provides additional evidence 
of the construct validity of the SCT as a measure of v Ach and demonstrates 
the importance of v Ach in the prediction of scientific productivity. 


The purpose of the present study is to 
investigate the relationship between achieve- 
ment values (v Ach) of university teachers 
in the science faculty and their productivity 
in written scientific work defined in terms of 
the number of publications in scientific 
journals, research reports, technical memos, 
and papers contributed to scientific meetings 
and conferences (Taylor, Smith, & Ghiselin, 
1963). The research is based on the premise 
that achievement values can best be viewed 
as those principles that guide one’s conduct 
in making substantive contribution to the 
field or profession in which one is engaged. 
It was therefore predicted that achievement 
values of science teachers and the number of 
their scientific publications are positively cor- 
related. In addition to verifying the above 
hypothesis, the present study aimed at ex- 
ploring the relationship, if any, between 
achievement values and academic degrees 
earned. 


MeEtTHOD 
Subjects 


Fifty male teachers of various science depart- 
ments of Patna University, Patna, India volunteered 
as Ss for the study. Their ages ranged from 22 to 
55 yr. with a mean of 31.86 yr. Out of the 50 
teachers who were sampled for the study, 28 
teachers were found to participate in both under- 
graduate and graduate programs. Of the remain- 
ing, 17 were primarily engaged in teaching at the 
undergraduate level and 5 were exclusively connected 
with graduate programs. 

1The author wishes to convey his heartfelt 
thanks to Braj Kumar Mishra, lecturer in psy- 
chology, Sindri College, Sindri, India for collecting 
the data reported in this article. 


Measurement of Achievement Values 


Achievement values were measured by the Sentence 
Completion Test (SCT), a forced-choice scale, de- 
veloped by Mukherjee (1964). The SCT consists 
of 50 forced-choice triads (one item reflecting 
achievement-related sentiment and the other two 
pertaining to different aspects of manifest needs) 
selected in such a manner as to minimize the “social 
desirability” factor. A few examples of the types 
of forced-choice triads constituting the SCT are 
given below: 


Lee ikem 
A. to be faithful to my friends 
and colleagues. 
B. to be very systematic in my 
work, 
C. to do my best in whatever 


work I undertake. 


A. gay and irresponsible pleas- 
ure-seekers. 

B. mental confusion and chaos. 

C. those situations which are 
not competitive. 


3. I have a general tendency ....... 
A. to continue a work till it 
is finished. 
B. to analyze others’ judgments 
critically. 
C. to be polished in my manners. 


The S’s score on SCT is the number of times he 
has chosen a _ statement reflecting achievement- 
related values. Details concerning the construction, 
administration, scoring, and various psychometric 
properties of the SCT have been reported by 
Mukherjee (1964, 1965c). The test-retest reliability 
of SCT after an interval of nearly 2 mo. has 
been found in various studies to range from .71 
to .83. The test-retest correlation has been found 
to be almost .75 for a sample of 100 Indian male 
graduates to whom the test was readministered after 
a month and a half (Mukherjee & Sinha, 1967). The 
Kuder-Richardson reliability estimate for the SCT 
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was .716 for a sample of 248 American college 
students enrolled at the Jeffersonville and Blooming- 
ton Campus of Indiana University. Previous studies 
(Mukherjee, 1964, 1965a, 1965c, 1965d, 1966) have 
demonstrated the concurrent and construct validities 
of SCT as a measure of v Ach. A factor analysis 
of SCT (Mukherjee, 1965b) revealed that the test 
covers some of the important dimensions of cog- 
nitive expectations regarding achievement, such as, 
aspiration, ego-ideal, perseverance, identification with 
a successful authority, preference for difficult tasks, 
etc. 


Procedure 


Mimeographed copies of SCT along with separate 
answer sheets were distributed randomly to 50 
science teachers regardless of their age and fields 
of specialization. They were requested to complete 
the SCT during their leisure time and return the 
answer sheet within a couple of days. All answer 
sheets were turned in within 2 days. On the answer 
sheet, each S provided information regarding his 
(a) present academic rank, (b) length of service 
as a college teacher, (c) academic degrees, (d) level 
of teaching, (e) age, and (f) publications. Papers 
published in scientific journals only defined each 
teacher’s level of scientific productivity. Papers con- 


tributed to various scientific meetings were also 
considered, irrespective of whether they were 
published or not. 

RESULTS 


Based on the total scores on SCT, Ss were 
classified into two criterion groups, High 
v Ach and Low v Ach. The 25 Ss scoring 22 
and above were arbitrarily designated as the 
High group and the remaining 25 Ss scoring 
21 and below formed the Low v Ach group. 
Table 1 presents the means for the two 
groups on some of the selected variables. 

The data on number of scientific publica- 
tions were examined by ¢ test (one-tailed) 
for differences between means and by cor- 
relational analysis. However, the Student ¢ 


TABLE 1 


MEANS FOR THE Two CRITERION GROUPS 
ON A FEW SELECTED VARIABLES 








High v Ach | Low v Ach 


Variable group group 

(N=25) (N=25) 
SCT score 23.96 16.80 
No. publications 8.20 3.80 
Teaching experience in years 8.70 9.49 
Age in years 31.00 Suma 
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test could not be applied directly since the 
High v Ach group had a significantly higher 
variance in publication variable. So in order 
to achieve homoscedasticity, the number of 
papers (Y;) contributed by each S was sub- 
jected to a transformation of the form: log 
(Y; +1). The means of these transformed 
scores for the High and Low v Ach groups 
were .7001 and .3665, respectively. The 
unbiased variances turned out to be .2541 
and .2478, respectively. A ¢ test indicated 
that the difference between the two criterion 
groups with respect to transformed scores 
on the publication variable was statistically 
significant (t = 2.36, p < .02). Computation 
of the product-moment correlation between 
SCT scores and number of publications also 
indicated a significant positive relationship 
(P82 Pp Oe): 

In order to test the hypothesis of no as- 
sociation between v Ach as measured by SCT 
and degrees earned, a chi-square test was 
run. The x? turned out to be 4.024 which 
for 1 df is significant beyond .05 level. When 
the data on level of teaching were subjected 
to a chi-square analysis, it was found that 
most of the teachers belonging to the High 
v Ach group participated in the graduate 
program whereas most Ss having low v Ach 
scores were exclusively engaged in under- 
graduate teaching (yi? = 7.20, p < .01). The 
hypothesis of no association between v Ach 
and academic rank could not be rejected 
(xu? = 2.10, p> .10). Similarly, ageyand 
v Ach scores were found to have no relation- 
ship (x12 = .32). A chi-square test also re- 
vealed that the two criterion groups did not 
differ significantly with respect to length of 
teaching experience (y17 = .72). 


DISCUSSION 


The study demonstrates that college teach- 
ers in science having high achievement values 
show greater scientific productivity than their 
colleagues whose v Ach scores are low. Con- 
versely, the more productive Ss of the present 
study have attitudes which suggest that they 
strive for a long record of professional 
achievement, for more distant goals, and a 
stronger desire to compete successfully with 
a standard of excellence than the less pro- 
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ductive Ss even though the opportunities 
for scientific research are comparable for 
both the groups. Since the above relationship 
was predicted in advance, the study provides 
additional evidence of the construct validity 
of the SCT as a measure of achievement 
values. 

Direct validational evidence is also fur- 
nished by some of the other data reported 
here. That the relationship between SCT 
scores and academic degrees earned should 
be positive is quite consistent with the con- 
cept of achievement values according to 
which a person having high achievement 
orientation is one who shows self-actualizing 
tendency (Maslow, 1954), exercises his 
talents fully and enthusiastically to achieve 
higher goals that are related to his further 
self-development. It is no wonder that teach- 
ers having higher scores on the SCT were 
found more frequently to teach at the grad- 
uate level than those who have low scores on 
the SCT. The finding holds true even when 
factors such as age, teaching experience, and 
academic rank are held constant. The result 
is interpreted to mean that Ss of the High 
v Ach group have a higher level of aspira- 
tion and a keen interest in undertaking 
difficult tasks. The study lends conclusive 
evidence for the hypothesis that the produc- 
tive scientist has a higher achievement value, 
and thus individual differences with regard 
to scientific productivity may be accounted 
for not only in terms of ability and op- 
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portunity but also in terms of their achieve- 
ment values. 
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DIFFERENTIAL EFFECT OF NOISE ON TASKS OF 
VARYING COMPLEXITY * 
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The method of simultaneous tasks was used to test the hypothesis that the 
deleterious effect of noise on performance varies as a function of task com- 
plexity. 48 Ss performed on 1 of 2 complexity levels of a 4-choice RT task 
and, at the same time, performed a secondary auditory monitoring task. All 
Ss performed in both quiet and in noise. Performance indexes were RT and 
secondary-task errors. Noise produced a significantly greater increase in sec- 
ondary-task errors when the secondary task was paired with the complex 
primary task than when it was paired with the simple primary task. Sec- 
ondary-task performance provided a more sensitive measure of both task 
complexity and the effect of noise than did the RT measure. 


The purpose of this experiment was to test 
the hypothesis that the deleterious effect of 
noise on performance increases as a function 
of increasing task complexity. There have 
been numerous studies conducted to investi- 
gate the effect of environmental noise on per- 
formance. Some have indicated a decremental 
effect of noise (e.g., Broadbent, 1953, 1954, 
1958a; Jerison, 1959; Jerison & Wing, 1957; 
Sanders, 1961; Woodhead, 1964), while oth- 
ers have indicated an incremental effect (e.g., 
Kirk & Hecht, 1963; Wilbanks, Webb, & 
Tolhurst, 1956). A substantial number of 
studies, however, have revealed no significant 
effect whatsoever (e.g., Broadbent, 1957; Hy- 
man, 1950; Jerison, 1957; Kryter, 1950; 
Kurz, 1964; Wilbanks, Webb, & Tolhurst, 
1956). If research which has remained un- 
published as a consequence of achieving nega- 
tive results were to be made available (e.g., 
Chisman, 1963), the weight of evidence indi- 
cating that noise does not significantly affect 
performance would be considerably increased. 

One explanation for the prevalence of nega- 
tive results in laboratory studies involving 
noise might be that Ss were able to draw upon 
unused perceptual capacity and thereby main- 
tain their performance despite the distracting 
noise. In order to detect changes in unused 
capacity associated with performance in 


1 This paper is based upon a thesis completed by 
the first author under the direction of the second 
author and submitted to the University of Iowa in 
partial fulfillment of the requirements of the mas- 
ter’s degree. 

2Now at the University of Connecticut. 
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noise, the present study utilized a procedure 
called the “method of simultaneous tasks.” 
The procedure involved having S perform, 
simultaneously with the primary experimental 
task, a secondary task which effectively oc- 
cupied that portion of S’s perceptual capacity 
which was not occupied by the requirements 
of the primary task. Thus, performance on 
the secondary task provided a measure of un- 
used capacity, and any effect of noise suffi- 
cient to require the use of this excess capac- 
ity to maintain primary-task performance 
was evidenced as a decrement in secondary- 
task performance. According to Poulton 
(1965), this method was first employed by 
Bornemann (1942a, 1942b) although it was 
apparently also developed independently by 
Bahrick, Noble, and Fitts (1954). 

Of special relevance to the present research 
are two studies which utilized the method of 
simultaneous tasks. Poulton (1958) used 
secondary-task performance as a measure of 
the amount of attention required by a per- 
ceptual-motor task and was able to demon- 
strate a difference in difficulty between two 
versions of the task which was not apparent 
by comparing performance on the two tasks 
when performed alone. Broadbent (1964) 
employed the method to investigate the rela- 
tionship between attention required to a re- 
action-time (RT) task and degree of stimu- 
lus-response (S-R) compatibility. Using a 
secondary, mental-arithmetic task, he com- 
pared the performance of Ss working on com- 
patible and incompatible versions of an RT 
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task. The incompatible task caused more in- 
terference with the secondary task than did 
the compatible task demonstrating that a 
perceptual-motor task may be made more 
difficult by making the S-R code less compati- 
ble, and that this difference in difficulty may 
be measured indirectly by observing perform- 
ance on a simultaneous, secondary task. 

There are probably many reasons for the 
apparent conflicting results concerning the 
effects of noise on performance. One impor- 
tant factor seems to be that some tasks are 
more vulnerable to noise than others. Accord- 
ing to Broadbent (1958b), noise produces 
brief failures of perception which, in turn, 
result in the marked decrements observed on 
vigilance tasks and tasks which require con- 
tinuous handling of information. Jerison 
(1957) suggests that the primary factor de- 
termining the effect of noise on performance 
may be the “flexibility of attention” required 
by the task. Noise seems to have a decre- 
mental effect on vigilance tasks which involve 
several sources of information whereas tasks 
which require attention to only one informa- 
tion source are not affected. 

Task difficulty may also be important in 
determining whether noise has a decremental 
effect on performance. Broadbent (1954) 
found a noise-induced decrement on a “‘diffi- 
cult” watch-keeping task (Twenty Dials Test) 
but not on an “easy” task (Twenty Lights 
Test). Park and Payne (1963), however, 
found no evidence of an interaction between 
noise conditions and the level of difficulty of 
division problems. 

The present experiment was concerned with 
the variable of difficulty or, more properly, 
task complexity. In this study, Ss performed 
two tasks simultaneously. The overall com- 
plexity or perceptual load imposed by these 
simultaneous tasks was manipulated by alter- 
ing the S-R compatibility of the primary task, 
a four-choice RT task. Performance on a sec- 
ondary, auditory-monitoring task was used to 
measure Ss’ available excess perceptual ca- 
pacity. It was hypothesized that the introduc- 
tion of noise would increase the total per- 
ceptual load, reduce Ss’ reserve capacity, and 
thereby result in a performance decrement on 
the secondary task. It was further hypothe- 
sized that this noise-induced performance dec- 
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rement on the secondary task would be more 
marked under the condition of high task com- 
plexity. 


METHOD 


All Ss performed two tasks simultaneously under 
quiet conditions and also while exposed to an inter- 
mittent, annoying noise. The primary task was of 
the perceptual-motor type and was presented in 
either of two levels of complexity. The secondary 
auditory-monitoring task was held constant under 
these two complexity levels and was used to detect 
changes in overall perceptual load. 


Primary Task 


The primary task was a four-choice reaction-time 
(RT) task in which Ss, after being alerted by a 
green warning light, responded as rapidly as possible 
to the onset of one of four red stimulus lights by 
pressing the appropriate one of four switches which 
were located directly below each of the lights. In 
the simple condition, the correct switch corresponded 
to the light directly above it. In the complex condi- 
tion, there was no spatial correspondence between 
light and switch. In this latter condition, Ss were 
instructed to respond: (1) to the left light by press- 
ing the second switch from the left; (2) to the sec- 
ond light from the left by pressing the switch at the 
extreme left; (3) to the second light from the right 
by pressing the extreme right switch; and (4) to 
the right light by pressing the second switch from 
the right. The four stimulus lights were arranged 
3% in. apart in a horizontal row, located at a dis- 
tance of approximately 24 in. from S’s eyes, slightly 
below shoulder level. 

Presentation of warning and stimulus lights in the 
RT task was controlled automatically by three 
Hunter interval timers. The first timer activated the 
warning light for a duration of 0.5 sec. The second 
timer provided a 1.0-sec. interval between warning 
light and stimulus light and also activated the stim- 
ulus light. The third timer provided a 5.0-sec. delay 
following the onset of the stimulus light, and then 
automatically initiated a recycling of the three 
timers. A Hunter Klockounter began at the onset of 
the stimulus light and stopped when S pressed one 
of the switches, thus providing a measure of RT. 
Pressing the switch also extinguished the stimulus 
light. The order of presentation of stimulus lights 
was controlled manually by E according to a pre- 
determined sequence. The lights were presented in 
this same sequence on every block of trials, each of 
the four stimulus lights appearing 23 times. Signal 
lights on E’s side of the apparatus permitted detec- 
tion of errors in S’s choice of switch. Since pressing 
any of the four switches caused the stimulus light 
to be extinguished, S received no information con- 
cerning the correctness of his responses. 

The S pressed the switches with the index finger 
of his preferred hand. To control hand-travel time, 
S began each trial from a rest position which was 
marked on the table 7 in. in front of the RT 
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apparatus, on an imaginary line bisecting, and per- 
pendicular to, a line passing through the four switch 
knobs. 


Secondary Task 


The secondary task was similar to one used by 
Brown (1965). A continuous sequence of digits was 
presented to S through a headphone placed over his 
left ear. He was instructed to listen for odd-even- 
odd sequences of digits, and to respond immediately 
by saying “yes” whenever such a sequence was de- 
tected. The digits were recorded on one channel of 
a stereophonic magnetic tape at regular intervals of 
1.5 sec. One odd-even-odd sequence was located 
randomly within every 15-sec. period of the 10-min. 
tape, so that the tape contained a total of 40 odd- 
even-odd sequences. The second channel contained 
signals, audible only to £, indicating the location of 
each odd-even-odd sequence. The tape was played 
on a Sony Model 464 stereophonic recorder. The S 
listened to the digits channel through a Grason- 
Stadler Model TDH-39 earphone (equipped with 
MX 41-AR rubber cushion) connected directly to 
the output of the appropriate internal playback 
amplifier of the recorder, while E listened to the 
marker signals through headphones connected 
through a Sony external power amplifier to the 
playback-preamplifier output of the second channel. 
The playback-preamplifier gain of the channel con- 
taining the signals was reduced to a minimum to 
prevent leakage into the digits channel. This ar- 
rangement permitted constant synchronization of 
the digits with the signals to Z, and provided com- 
plete consistency throughout the experiment in the 
scoring of errors on the secondary task. 


Noise 


The noise consisted of recorded 0.5-sec. bursts of 
the sound produced by a bandsaw cutting aluminum. 
Acoustical frequency analysis revealed strong peaks 
at approximately 1800 and 1900 cps. A total of 90 
bursts (92 db. SPL) were presented in the course of 
each 10-min. experimental run. These bursts were 
recorded on the tape containing the digits. The 
bursts were located randomly, but with the restric- 
tion that they never overlap any of the secondary- 
task digits. The masking effect of noise, therefore, 
was not a factor in this experiment. The digits on 
the “noise” tape were copied directly from the 
“quiet” tape, using a second recorder, so that the 
only difference between the two tapes was, in fact, 
the presence of noise on the “noise” tape. 


Subjects 


The Ss were 24 males and 24 females from the 
elementary psychology courses at the University of 
Towa. All Ss reported having normal hearing. 


Experimental Design 


A mixed design involving one within- and three 
between-S dimensions was employed (Lindquist, 
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1953). Twelve men and 12 women performed on the 
simple RT task, and 12 men and 12 women per- 
formed on the complex RT task. Within each com- 
plexity condition, 12 Ss (6 men and 6 women) per- 
formed first in quiet and then in noise, while the 
remaining 12 Ss (6 men and 6 women) performed 
first in noise and then in quiet. Complexity level and 
sequence were assigned randomly to Ss within each 
sex group. 


Procedure 


Each S served in one experimental session con- 
sisting of one run in quiet and one in noise. Each 
run was 10 min. in duration. The S listened first to 
a 1-min. tape demonstrating the correct method of 
responding to the digits task. He was then given 
two consecutive 1-min. practice sessions on the digits 
task alone. If he failed to respond without error 
during the first min. of practice, the tape was stopped 
and the instructions repeated. All Ss succeeded in 
performing without error during the second min. of 
practice. Next, the RT-task instructions were read, 
followed by the appropriate instructions for the first 
experimental condition, that is, quiet or noise. In 
order to establish the relative primacy of the RT 
task, S was instructed repeatedly that he should do 
as well as he could on the digits task, but that he 
should not let the digits task interfere with the RT 
task. After completing the first run, the instructions 
for the second condition were read, and S was again 
cautioned not to let the digits task interfere with 
the RT task. The entire session lasted approximately 
45 min. 


Scoring 


Records were kept of RT, errors on the RT task, 
and errors on the digits task. Omissive errors (fail- 
ure to report the occurrence of an odd-even-odd 
sequence in the 1.5-sec. interval after the comple- 
tion of the sequence) and commissive errors (in- 
appropriate “yes” responses) on the digits task were 
recorded separately. The first 12 RT trials of both 
experimental runs were considered practice trials and 
were not included in the calculation of RT medians 
and error totals. 


RESULTS 


Performance data from both primary and 
secondary tasks were analyzed to determine 
the effects of noise on performance. 


Primary Task 


An analysis of variance * indicated no sig- 
nificant effect of noise on RT (787 vs. 784 


3 Summaries of analyses of variance and other sum- 
mary tables have been deposited with the American 
Documentation Institute. Order Document No. 9747 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
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TABLE 1 
TotTaL ERRORS ON SECONDARY TASK AS A FUNCTION OF SEQUENCE, NoIsE, TASK COMPLEXITY, AND SEX 
Males Females 
Task Sequence Condition Condition he 
Quiet Noise M Quiet Noise M 
Quiet-Noise 7.67 11.83 9.75 10.33 13.67 12.00 10.88 
Simple Noise-Quiet 8.67 17 9.92 leds) 10.33 8.83 9.38 
M 8.17 11.50 9.83 8.83 12.00 10.42 10.13 
Quiet-Noise 14.17 21.00 17.58 11.67 18.33 15.00 16.29 
Complex Noise-Quiet 8.83 17.83 13333 10.33 17.00 13.67 13.50 
M 11.50 19.42 15.46 11.00 17.67 14.33 14.90 
Combined mean for both 9.83 15.46 12.65 9.92 14.83 12.38 
complexity levels 




















msec.). There was a tendency for males to 
respond faster than females (766 vs. 806 
msec.), and for RT on the complex task to be 
slower than on the simple task (808 vs. 763 
msec.), but neither of these effects was sig- 
nificant.* The only significant factor affecting 
Roleewas orders” #(1/40) = 11.76, p< .01, 
with the first run being slower than the sec- 
ond (805 vs. 766 msec.). No errors were made 
on the primary task during the experimental 
runs. 


Secondary Task 


Table 1 shows the effects of sequence, 
noise, task complexity, and sex on the total 
number of secondary-task errors, that is, 
omissive plus commissive errors. It may be 
seen that mean error totals for all sex-se- 
quence groups were higher in noise than in 
quiet, and that, in general, a greater number 
of errors was made under the complex task 
condition than under the simple condition. An 





ton, D. C. 20540. Remit in advance $1.25 for micro- 
film or $1.25 for photocopies and make checks pay- 
able to: Chief, Photoduplication Service, Library of 
Congress. 

4Jn a separate experiment, 20 Ss performed the 
RT task alone, that is, without the simultaneous 
secondary task. Half of the Ss performed the simple 
version of the task while the other half performed 
the complex version. Performance on the complex 
task was significantly slower than on the simple 
task (708 vs. 587 msec.; t=2.45, p< .05). 


analysis of variance revealed that the main 
effect of noise was significant; F(1/40) = 
62.52, p < .001. The analysis also indicated a 
significant main effect of complexity; 
PCAC yi=18, cle pe) Ot maior interest, 
however, was a significant Noise x Complex- 
ity interaction; F(1/40) =9.19, p< .01. 
Table 2 summarizes this effect. It will be 
noted that the introduction of noise produced 
a greater increase in errors under the complex 
task condition than under the simple task 
condition. There was no sequence effect to 
complicate the interpretation. 

Separate analyses of variance were per- 
formed for omissive and for commissive er- 
rors on the secondary task. Results of the 
analysis of omissive errors were identical to 
those of the total errors; that is, the effects 
of both noise and task complexity were sig- 


TABLE 2 
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Mean errors Difference 
(noise 
Task minus 
Quiet Noise tas quiet) 
Simple 8.50 11.75 10.13 geno 
Complex} 11.25 18.54 14.90 7.29 





nificant, and there was a significant Noise X 
Complexity interaction. In the analysis of 
commissive errors, noise was the only signifi- 
cant factor. 


DISCUSSION 


Results of this study demonstrate that noise 
produces a significant decrement in perform- 
ance and that the magnitude of this decre- 
ment varies as a function of task complexity. 
Results also suggest that noise effects may 
have gone undetected in much previous re- 
search for lack of a sensitive criterion mea- 
sure, 

Many tasks demand less than the total per- 
ceptual capacity which an individual has 
available. In these situations, the person pos- 
sesses what may be termed a reserve capacity, 
which is free from the demands of the task. 
Should the perceptual demands of the task or 
of the situation increase, he may draw upon 
this reserve capacity and, by so doing, may be 
able to maintain task performance at a con- 
stant high level. In the present study, the 
fact that increased task complexity, and the 
introduction of noise, did not affect RT per- 
formance is attributed to this compensatory 
mechanism. 

The method of simultaneous tasks provides 
a means of detecting changes in reserve ca- 
pacity associated with changes in the task 
and/or environment. Secondary-task perform- 
ance was used in this study as a measure of 
reserve perceptual capacity. In this experi- 
ment, the complex primary task made greater 
perceptual demands than the simple primary 
task. Therefore, less reserve capacity was 
available for the secondary task when it was 
paired with the complex task than when it was 
paired with the simple task. This resulted in 
the greater number of errors on the secondary 
task when it was paired with the complex 
primary task. The introduction of noise “used 
up” some of S’s reserve capacity; that is, S 
had to draw from his reserve so that primary- 
task performance would not suffer as a conse- 
quence of the noise. There was, therefore, 
less perceptual capacity available for sec- 
ondary-task performance, and, consequently, 
errors increased. The fact that the noise pro- 
duced a greater increase in secondary-task 
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errors when the secondary task was paired 
with the complex primary task than when it 
was paired with the simple primary task may 
be attributed to the fact that the noise occu- 
pied proportionally more of the reserve ca- 
pacity in the former case than in the latter. 
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EXPECTATIONS AND DROPOUTS IN SCHOOLS OF NURSING 
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The expectations and experiences of stress and satisfaction of 1,852 1st-yr. stu- 
dents in 43 schools of nursing were assessed by questionnaire. Low but signifi- 
cant negative correlations were obtained between withdrawal and (a) experi- 
enced satisfactions and (b) confirmation of expectations, especially with respect 
to satisfactions. Withdrawal was unrelated to experienced stress, whether ex- 


pected or unexpected. 


Remarks about the nursing shortage az2 
by now commonplace. Since 20-25% of 
entrants to schools of nursing withdraw dur- 
ing the first year, reducing this attrition rate 
would alleviate the shortage. 

Fox, Diamond, and Walsh (1960) re- 
ported on the situations which student nurses 
had indicated as stressful and_ satisfying, 
including those elements in the school situa- 
tion to which students alleged their with- 
drawal was due. In this report and elsewhere, 
students who disliked nursing indicated that 
it was not what they had expected. 

In addition, Kibrick (1958) observed that 
the satisfactions expected by nursing students 
did not have much bearing on their remain- 
ing in school, but that those who were more 
realistic in their appraisal of nursing were 
more likely to remain. This finding is con- 
sistent with that of Weitz (1956) who dis- 
covered that life insurance agents were more 
likely to remain in their jobs if they had a 
clear picture of the job duties. Block, Yuker, 
Campbell, and Melvin (1964) and Brown 
and Landsberger (1960) found that dis- 
satisfaction is influential in determining 
whether an individual will withdraw during 
the early stages of employment. Block and 
his colleagues added that the longer the 
worker remains with the company, the less 
important satisfaction becomes in determin- 
ing his further continuation. This may also 
be true of student nurses, among whom the 
withdrawal rate is greatest during the first 
year, and declines markedly thereafter. 


1 This article is based upon data collected by the 
author in fulfilling the dissertation requirements for 
the PhD degree at Teachers College, Columbia 
University. 


Among the theories that have dealt with 
withdrawal from group participation are 
those of Ross and Zander (1957), March and 
Simon (1958), Herzberg, Mausner, and 
Snyderman (1959), Katzell (1964), and 
Vroom (1964). 

Degree of need satisfaction was hypoth- 
esized by Ross and Zander to be related to 
continued participation in the employment 
situation. Herzberg et al., on the other hand, 
classified the elements of the work situation 
into satisfiers or motivating factors, and dis- 
satisfiers or hygiene factors. Turnover, in 
this theory, was viewed as being related to a 
low level of hygiene factors, causing dis- 
satisfaction, whereas the motivating factors, 
or satisfiers, were perceived as being related 
primarily to productivity. 

In Katzell’s view, withdrawal is directly 
related to the experienced or expected dis- 
satisfactions associated with the situation 
and the satisfactions associated with an 
alternative situation. He also theorized that 
withdrawal is negatively related to the experi- 
enced or expected satisfactions associated 
with the situation and to the dissatisfactions 
associated with an alternative situation. 
Vroom postulated that the more satisfied the 
individual, the greater the force on him to 
remain in the situation and the less the 
probability of his voluntarily withdrawing 
from it. He also indicated that decisions to 
remain or withdraw are related to the positive 
and negative valences of outcomes and to 
the expectancies that the outcomes will be 
attained. 

March and Simon specified that continued 
participation depends upon the difference be- 
tween rewards and contributions, and sug- 


154 


EXPECTATIONS AND Dropouts IN ScHOOLS oF NURSING 


gested that the difference be estimated from 
measures of satisfaction, since the greater 
the excess of rewards over contributions, the 
greater should be satisfaction. Satisfaction 
was seen as being related to, among other 
factors, predictability in terms of the degree 
to which expectations are realized. 


OBJECTIVES 


This study was undertaken as a partial 
test of the March and Simon theory, as it 
applies to withdrawal from schools of nurs- 
ing. From the theory, it should follow that 
the greater the number of satisfactions ex- 
perienced, the more likely a student will be 
to continue in a school of nursing. Conversely, 
the greater the number of stresses experi- 
enced, the more likely a student will be to 
withdraw. In addition, a student will be 
more likely to withdraw if she experiences 
a large number of unexpected stresses than 
if she experiences the same number of stresses 
but they were expected. It should also be 
true that expected but unrealized satisfac- 
tions will tend to cause withdrawal, and that 
experiencing expected satisfactions will do 
more to prevent withdrawal than experiencing 
unexpected satisfactions. 


METHOD 
Sample 


The sample comprised 1,852 first-year students in 
43 schools of nursing. Among the 1,852 students, 
1,439 (77.7%) returned for the second year; 183 
(9.9%) withdrew for academic reasons; 230 (12.4%) 
withdrew for nonacademic reasons. 


Instruments 


A 139-item questionnaire was constructed to 
measure the satisfactions (56 items) and _ stresses 
(83 items) expected and experienced by the students. 
A single set of items was used, with one form of 
instructions being given when the students’ ex- 
pectations were collected, and another when ex- 
periences were collected. 

Tllustrative of the questionnaire items are the 
following 10, of which the first 5 were classified as 
stress items, and the remainder as satisfaction items. 


1. Being homesick. 

2. Being respected less as a student nurse, by 
people outside the school, than if one had 
pursued another career. 

3. Spending very little time working with pa- 
tients. 

4, Doubting the wisdom of the choice of nursing 
as a career. 
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. Being in the presence of death. 

. Having lots of fun. 

. Having someone to tell one’s troubles to. 

. Finding that a student nurse is an important 

part of the team in the hospital. 

9. Being so interested in nursing that one doesn’t 
find it hard to learn what one needs to know. 

10. Getting good grades. 


CONT ON UU 


Fourteen scores were derived for each student from 
the pair of questionnaires.? 


Procedure 


During the first week they were in school, the 
students completed the Expectations Questionnaire. 
A supply of Experiences Questionnaires was pro- 
vided to each school of nursing to be administered 
to any students who withdrew between the group 
sessions of the Expectations and Experiences 
Questionnaires. Eight months after the Expectations 
Questionnaire, the Experiences Questionnaire was 
administered to students still enrolled. One year 
after the students had entered the schools of nursing, 
a questionnaire was sent to each cooperating school 
to learn which students had withdrawn and the 
reason for withdrawal of each. 

The 14 scores derived from the questionnaires 
were correlated, using point-biserial coefficients, 
with (a) survival-withdrawal, (b) survival-academic 
withdrawal, and (c) survival-nonacademic with- 
drawal. 


RESULTS 


The scores of the 1,439 survivors and the 
three categories of dropouts are compared 
in Table 1. Differences significant at the 
.0O1 level were found between the mean scores 
of the survivors and each of the dropout 
groups in the total number of satisfactions 
experienced, the number of expected satis- 
factions experienced, and the number of ex- 
pected satisfactions which were not experi- 
enced. In each case, those who remained in 
school had experienced more satisfactions. 
Although survivors and dropouts did not differ 
in numbers of expected satisfactions and 
stresses, the survivors had significantly more 
of their expectations confirmed than did the 
dropouts. 

Survivors experienced more unexpected 
satisfactions than did dropouts, the difference 
being significant in the academic and total 
dropout groups. The academic dropouts ex- 
perienced significantly more unexpected 


2 Copies of the questionnaires and data relative to 
their construction may be obtained from the author 
on request. 
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TABLE 1 


Scores oF 1,439 Survivors, 413 Dropouts, 183 AcApEmIc Dropouts, AND 
230 NonAcADEMIC Dropouts 



































: D. Non- 
A. Survivors | B. Dropouts Rapeede tc academic t ratios 
Dou dropouts 
Score 
M SD M SD M SD M SD A-B A-C A-D C-D 

Satisfactions expected 37.68 | 6.64] 37.33 | 6.99 | 37.99] 6.89] 36.81] 7.01] .91 57 1.76 eA 
Stresses expected 36.18 | 12.09 | 36.53 | 12.46 | 35.07 | 12.59 | 37.68 | 12.24 A Ls 1.72 Dae 
Satisfactions experienced 31.94] 6.80] 28.62] 7.78] 28.67] 7.70| 28.58] 7.84] 7.85** | 5.47** | 6.13** | .12 
Stresses experienced 37.51 | 10,82 || 37.72 | 12.01 | 38.14 | 11.83 | 37.38) 12.14} .32 68 vim .64 
Expected satisfactions which were 26.49} 7.07] 23.77| 7.54] 24.10] 7.55 | 23.50] 7.52] 6.54** | 4.05** | 5.63** | .80 

experienced 
Expected satisfactions not 11.19] 4.92] 13.57] 6.07] 13.89] 6.26] 13.31] 5.90] 7.30** | 5.60** | 5.16** | .96 

experienced 
Unexpected satisfactions which were 5.45] 3.66} 4.85] 3.79| 4.57] 3.23] 5.08] 4.16 | 2.86** | 3.41** | 1.27 1.40 

experienced 
Unexpected satisfactions not 12,48") 5201 |/ 13428! 5.62) |) 112782) | 95.659113.64) | 95157) 2.608" |) 77 2.96** | 1.47 

experienced 
Expected stresses which were 24.13 | 9.66 | 24.04 | 10.40 | 23.57 | 10.27 | 24.42 | 10.48] .16 .70 39 .83 

experienced 
Expected stresses not experienced 12.04} 5.92]12.48| 6.45] 11.50] 6.19|13.27] 6.55] 1.24 dead 2.67** | 2.81** 
Unexpected stresses which were 13.37 | 6.69] 13.68 | 6.80] 14.57] 7.48|12.97/] 6.11] .82 2.06* 91 2.33* 

experienced 
Unexpected stresses not experienced | 33.04 | 11.09 | 32.08 | 11.79 | 32.52 | 12.04 | 31.73 | 11.59 | 1.48 Bees 1.60 -67 
Confirmed expectations 96.14 | 9.13 | 93.17 | 10.68 | 93.02 | 11.52 | 93.28] 9.96 | 5.13** | 3.51** | 4.07** .24 
Unconfirmed expectations 42.04] 9.02 | 44.58 | 10.19 | 44.52 | 10.82 | 44.63] 9.66] 4.53** | 2.94** | 3.77** | 10 
Items omitted 80] 2.75} 1.25] 3.94] 1.46] 4.44] 1.09] 3.49] 2.17* | 1.96* | 1.20 .92 

p< 205. 
#FD < 01. 


stresses than either the survivors or the 
ynonacademic dropouts, and expected sig- 
nificantly fewer stresses than the nonacademic 
dropouts. 

The nonacademic and total dropout groups 
exceeded survivors in numbers of satisfac- 
tions which were neither expected nor ex- 


perienced. Nonacademic dropouts also ex- 
ceeded academic dropouts and survivors in 
the number of stresses expected but not ex- 
perienced. The survivors omitted fewer items 
than the dropouts. 

The point-biserial correlations between 
withdrawal and each of the 14 questionnaire 


TABLE 2 


CORRELATIONS BETWEEN WITHDRAWAL AND SCORES DERIVED FROM THE TWO QUESTIONNAIRES, 
BASED ON ALL SURVIVORS AND Dropouts 








All survivors and 





All survivors and 


All survivors and : 
nonacademic 





Score all dropouts academic dropouts dronouts 
(NV = 1,852) (N = 1,622) (N = 1,669) 

Satisfactions expected +.022 +.015 +.045 
Stresses expected +.012 —.029 +.043 
Satisfactions experienced LOS — .148** —.164** 
Stresses experienced +.008 +.018 + .004 
Expected satisfactions which were experienced =i Oe —— Asn 
Expected satisfactions not experienced 1875 +.165** +.143** 
Unexpected satisfactions which were experienced —.067* Ou — .034 
Unexpected satisfactions not experienced +.063* +.021 -+.077* 
Expected stresses which were experienced — .004: —.018 +.010 
Expected stresses not experienced +.030 +.029 +.070* 
Unexpected stresses which were experienced +.019 +.056 —.021 
Unexpected stresses not experienced 030 —.014 —.040 
Confirmed expectations an LOA ae — One 
Unconfirmed expectations +.112** +.084** +.097** 

* 
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scores are presented in Table 2. In each 
group, 5 of the 14 coefficients, though small, 
are significant at the .01 level. The 5 all 
pertain to satisfactions which were or were 
not experienced and to confirmation of ex- 
pectations. 

No relationship was demonstrated in any 
of the groups between withdrawal and (a) 
expected satisfactions, (6) expected stresses, 
(c) experienced stresses, (d) expected stresses 
which were experienced, (e€) unexpected 
stresses which were experienced, and (f) 
unexpected stresses which were not experi- 
enced. 


DISCUSSION 


The results of this study supported March 
and Simon’s (1958) postulates relative to 
satisfaction, but gave no support in relation 
to stress. The predicted effect of expectations 
was observed in the area of satisfactions, but 
received uncertain support in regard to 
stresses. 

Contrary to Herzberg et al.’s (1959) view, 
it did not appear that student withdrawal 
was primarily due to dissatisfiers. It did, 
however, seem clear that there were dif- 
ferences between satisfiers and dissatisfiers, 
as Herzberg indicated, although no attempt 
was made here to validate MHerzberg’s 
measures. 

The measures used in this study also did 
not lend themselves to direct implications 
for Ross and Zander’s (1957) position. It 
would, however, be anticipated from their 
theory that dropouts, more than survivors, 
would report interference of school with their 
pursuit of extramural satisfactions, but this 
was not found to be the case. 

The negative relationship between satisfac- 
tion and withdrawal, posited by Katzell 
(1964) and by Vroom (1964), received sup- 
port from these findings. Their respective 
views concerning the relation of dissatisfac- 
tion, or negative valence, to withdrawal were 
not supported. The absence of stress did not 
have the favorable consequences associated 
with satisfaction, although the absence of 
satisfaction tended to have negative con- 
sequences, especially when the satisfaction 
was expected. 


Tod 


Despite the fact that significant correla- 
tions were obtained, so small a fraction of 
the total survival-withdrawal variance is pre- 
dictable that the contribution may seem 
negligible. On the other hand, because of the 
large numbers of students and schools in- 
volved, the saving of even a small percentage 
of those who would otherwise withdraw from 
schools of nursing could contribute sub- 
stantially in absolute numbers to the allevia- 
tion of the nursing shortage. 


CONCLUSIONS 


1. Withdrawal was inversely related to 
experienced satisfaction. 

2. Withdrawal was inversely related to the 
confirmation of expectations, especially in the 
area of satisfactions. 

3. Withdrawal was unrelated to experienced 
stresses, whether expected or unexpected. 

4. Stress and satisfaction did not operate 
as opposite ends of the same continuum. 
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RELATION BETWEEN ETHNIC ORIGIN AND GSR 
REACTIVITY IN PSYCHOPHYSIOLOGICAL 
DETECTION * 
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The psychophysiological data of 3 Jewish and 1 Bedouin samples were ana- 
lyzed in connection with the detectability of a previously chosen card. Ss of 
Near Eastern origin tended to show lower pulse rate, higher basic skin con- 


ductance, and lower relevant GSR reactivity. 


It is proposed that psycho- 


cultural differences are likely to be the explanation for the consistent differen- 
tial GSR reactivity to the relevant stimuli. 


Attention has recently been drawn to the 
significant differences in skin conductance 
found in different ethnic and racial groups. 
Johnson and Corah (1963) reported that 
Negro subjects (Ss) showed lower skin-con- 
ductance values than did Caucasian Ss. This 
finding was supported by further data from 
Bernstein (1965), Johnson and Landon 
(1965), and Malmo (1965). A study by 
Lazarus, Tomita, Opton, and Kodama (1966) 
compared the skin conductance of Japanese 
and American students as they watched a 
stressful film. These data suggest that the 
Japanese may have higher skin conductance. 
More limited data reported by Wenger in 
Johnson and Landon (1965) as well as by 
Malmo (1965) suggest that oriental Ss may 
have lower skin conductance than Caucasian 
Os. 

Less interest has been focused on analogous 
differences in GSR reactivity. Sternbach and 
Tursky (1964) reported skin-potential dif- 
ferences related to ethnic background during 
the use of electric shock stimulation in con- 
nection with pain-threshold determination. 
Johnson and Landon (1965) reported racial 
differences in GSR reactivity to the initial 
presentation of a tone stimulus, but this 
difference became insignificant upon correc- 
tion for prestimulus base-line conductance. 
Lazarus et al. (1966) did not analyze their 
data on a time scale comparable to the usual 
GSR analysis, but did report that “Unlike 


1The research reported in this paper has been 
sponsored by the Air Force Office of Scientific Re- 
search, through the European Office, Aerospace 
Research, United States Air Force, under Contract 
AF 61 (052)-839. 


American samples, the Japanese did not show 
marked stimulus-controlled variation in skin 
conductance during the stress film ... [p. 
631]. 

In this laboratory, differences in GSR re- 
activity were found to exist between Jewish 
Ss of different ethnic backgrounds (Kugel- 
mass, 1963; Kugelmass & Lieblich, 1966). 
Jewish Ss of Near Eastern origin (born in 
the Moslem countries of the Mediterranean 
basin, Morocco, Yemen, Iraq, or Iran) 
showed less GSR reactivity than Jewish Ss 
born in America, Europe, or Israel. These 
differences in GSR reactivity were observed 
in connection with the response to the name 
of the relevant chosen card during the sub- 
sequent interrogation part of a card-detection 
experiment. The purpose of the present report 
is to present supplementary analyses of the 
above-mentioned data as well as to report 
new data derived from independent samples. 


METHOD 
Original Sample 
This sample has been previously described in 
Kugelmass (1963) and Kugelmass and Lieblich 


(1966). The Middle Eastern Ss exhibited less GSR 
reactivity in the three different stress conditions, 
reaching statistical significance in one of them. In 
addition, 8 of the 36 Ss turned out to be non- 
reactors in all three experimental tests. All eight 
of these total nonreactors were of Near Eastern 
origin, and no Western S showed this pattern. It 
was of interest that no significant difference in 
GSR reactivity was observed in the responses to 
the other irrelevant cards. 

Base-line data. (a) Conductance: Values of skin 
resistance (ohms) were taken 30, 60, 90, and 120 
sec. after the beginning of the first 3-min. rest 
period which opened each experimental examination. 


158 


ETHNIC ORIGIN AND GSR REAcTIVITY 


These values were converted to conductance in 
micromhos, and averaged for each S, and then 
averaged over Ss. The mean conductance value for 
the Near Eastern Ss was 19.74 micromhos, and 
17.38 micromhos for the Western group. The dif- 
ference was not statistically significant at the .05 
level (t=1.26 for 13 Western Ss and 23 Near 
Eastern Ss). Examination of the data of the sub- 
group of total nonreactors showed a mean _ basic 
conductance of 22.40 micromhos which is signifi- 
cantly higher (f= 4.10, with a combined variance 
estimate of 7.40, a< 01). 

(b) Pulse rate: Five-second samples of pulse-rate 
data were taken at 30, 60, 90, and 120 sec. after 
the beginning of the first 3-min. rest period. The 
mean pulse rate of the Near Eastern Ss was 80.49 
beats/min, while the rate for the Western Ss was 
86.45 beats/min. With a combined variance esti- 
mate of 145.36, this difference between means is 
not significant at the .05 level (t=1.43). 

It should be added that no significant difference 
in years of education or intelligence stanine scores 
was found to exist between the two groups of 
Ss who had a mean number of 6.30 yr. of education 
(GSa= 75.) 


Blood-Pressure Cuff Sample 


This sample was also described in Kugelmass 
and Lieblich (1966). The 40 Ss of this additional 
sample participated in an experiment in which GSR 
readings during card detection were made both 
with and without an accompanying inflation of a 
blood-pressure measurement cuff. The data to be 
reported here are all from the GSR condition which 
did not include this additional variance and thus 
are more comparable to the rest of the data to be 
presented. 

GSR differential reactivity data. The mean re- 
actions to the “hot” relevant chosen card were 
analyzed as in the original sample, and Ss were 
categorized as reactors if their mean response was 
at least 1,000 ohms. The relation between ethnic 
background and this reactivity may be seen in 
Table 1. 

A chi-square analysis of the data indicates that 
a significantly higher proportion of Near Eastern 
Ss are nonreactors, X2=3.30 (a@ < .05, one-tailed). 
Once again no significant difference was found in 
GSR reactivity to the irrelevant cards. 


TABLE 1 


RELATION BETWEEN ETHNIC ORIGIN AND GSR 
REACTIVITY TO THE ‘HoT’? CARD 











Ethnic origin R NR Total 

Western 10 6 16 

Near Eastern 8 16 24 
Total 18 22 40 





Note.—R = reactor; NR = nonreactor. 
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Base-line data. (a) Conductance: Conductance 
values were obtained as in the original sample. The 
mean skin-conductance value of the 24 Near Eastern 
Ss was 17.92 with 18.36 micromhos as a mean for the 
16 Westerners. The difference between these two 
means with a combined variance estimate of 7.50 
was not significant at the .05 level (t =0.50). 

(b) No pulse-rate data had been collected in 
this experiment. 

There was no significant difference in education 
or intelligence stanine between the Near Eastern and 
Western parts of the sample which had a mean 
number of 9.30 yr. of education (S = 2.06). 


Criminal-Suspect Sample ? 


As a follow-up to the two experiments described 
above a sample of criminal suspects was examined 
at the central laboratory of the Israel Police Force. 
The standard card-test procedure used in the above 
experiments was incorporated into the regular 
police interrogation procedure of 62 criminal suspects 
connected with serious offences. The only other dif- 
ference in procedure derives from using the police 
apparatus which was a _ standard three-channel 
Stoelting polygraph. This limited the analysis in 
that no estimate of skin conductance was obtained 
and changes in millimeter deflections serve as an 
indication of GSR reactivity. 

GSR differential reactivity data. GSR responses 
were defined as the maximal change in millimeters 
from base line within 5 sec. of the presentation of 
the number. Mean responses to each of the cards 
were calculated for the Near Eastern Ss (N =39) 
and the Western Ss (NV =19). The records of four 
of the Ss were technically inadequate. The mean 
response of the Near Eastern Ss to the relevant card 
was 12.51 mm. while for the Western Ss it was 
21.47. With a combined variance estimate of 303.11 
the difference between means is significant at the 
.05 level (tf = 1.82, one-tailed). No significant differ- 
ence was found in GSR reactivity to the irrelevant 
cards (Near Eastern, X = 7.68, Western, X = 7.56, 
combined S = 28.91). 

Base-line data. (a) This Stoelting model does not 
provide for measuring basic conductance. 

(b) Pulse-rate data: The pulse-rate data were 
obtained by counting the spikes in the blood- 
pressure cuff recording which is a standard feature 
of this apparatus. Five-second samples of pulse- 
rate data were measured at 30, 60, 90, and 120 sec. 
after the beginning of the card-test interrogation 
during the use of the blood-pressure cuff. The mean 
pulse rate of the Near Eastern Ss was 95.99 beats/ 
min, while the rate for the Westerners was 101.90. 
With a combined variance estimate of 364.07, this 
difference between the means is not significant at 
a=.05 (t=1.13, 39 Near Easterners, 20 Western- 
ers). 


2The authors wish to thank the Israel Police 


Force and especially M. Kaplan, A. Ben-Ishai, and 
A. Opatowski for their help and cooperation. They 
also wish to thank G. Ben-Shakhar for expert 
assistance. 
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Bedouin Sample 


Following the previously mentioned experiment 
which showed the same general trends, it was 
decided to obtain a sample of Ss who approximate 
the phenotype of the Near Eastern natives as 
closely as possible. Such Ss were available in 
Bedouin Moslem tribes which dwell in the Israeli 
Negev desert. A detailed description of these 
Bedouin tribes may be found in Groen, Balogh, 
Levy, and Yaron (1964). It has been noted that 
there exists a rough gradient of decreasing modern 
influence on Bedouin culture in moving from the 
northern Negev to the southern part where few 
Jewish settlements exist as yet.2 As a first step it 
was decided to draw a sample which would be at 
least roughly comparable to the previous samples 
in having some minimum education in a formal 
setting. 

Subjects. Ten male Bedouin Ss of the Arab El 
Houzaiel tribe who live close to the Jewish Kibbutz 
settlement Shuval in the Northern Negev were 
tested. All of these tribesmen were engaged in 
agricultural work. The mean level of formal educa- 
tion was 5.8 yr. of school. 

Procedure. The Ss were tested with the Stoelting 
apparatus obtained from the Police Laboratory as 
well as with the constant-current (20 mwa.) battery- 
operated GSR apparatus used in the original and 
blood-pressure cuff experiments described above, 
which employs Beckman electrodes, on the volar 
surface of the first and fourth fingers of the left 
hand. Sanborn paste is used to ensure standard con- 
tact. The testing took place in the main room of 
Sheik Souleiman E] Houzaiel on a cool January day 
(17° C). The Ss were tested using the two apparatus 
setups in a rotated order. The standard card test 
(Kugelmass & Lieblich, 1966) was administered in 
Arabic through the use of an Israeli police interpre- 
ter. The research had been presented to the Ss as 
part of a research effort of the Hebrew University to 
investigate physiological responses of different ethnic 
groups in Israel. 

GSR differential reactivity data. All of the 10 
Bedouin Ss were found to be nonreactors by the 
criterion previously defined. The records tended to 
be generally “flat” and even the mean response to 
the hot card was only 187.50 ohms. Because of this, 
no significant card detection was possible. As an- 
other means of examining GSR reactivity it seemed 
worthwhile to check the “orienting response” of 
the Ss to the first buffer item that is used to open 
all interrogation sequences. The mean orienting re- 
sponse of the Bedouin Ss was 1000.00 ohms, that of 
the Near Eastern of the original sample (most 
comparable in level of formal education) was 
1843.48 ohms, and of the Western Ss was 1730.00 
ohms. Neither the difference between the Bedouin 
and the Westerners (combined variance estimate 
2,188,812, t= 1.04) nor the difference between the 


3 Lt. Col. Sasson Bar-Zvi, former military gover- 
nor of the Negev region, personal communication, 
1966. 


Sot KuGELMASS AND ISRAEL LIEBLICH 


Bedouin and the Near Easterners (3,974,465, t= 
1.04) was significant at the .05 level. 

Base-line data. (a) Conductance: The average 
basic skin conductance of the 10 Bedouin Ss was 
found to be 31.39 micromhos. With a combined 
variance estimate of 79.55, the difference between 
this mean and the mean of the Jewish Near Eastern 
Ss, 19.74, is significant at the .01 level (t =3.50). 
The Western mean was even lower, 17.38, and 
with a combined variance estimate of 109.47 is 
also significantly lower at the .01 level (f =3.11). 

(b) Pulse rate: Pulse-rate data were obtained 
from the Stoelting records as described in the 
criminal-suspect sample. The mean pulse rate of 
the 10 Bedouin Ss was 69.50 beats/min. This is 
significantly lower than the Near Eastern mean, 
80.49 (a<.05, t=2.42, with a combined variance 
estimate of 155.78), and the Western mean, 86.45 
(a<.01, t=3.08, with a combined variance esti- 
mate of 205.87). 


DISCUSSION 


The ethnic differences that caught the 
authors’ attention in the first two experiments 
were primarily concerned with a particular 
kind of GSR reactivity to the relevant card. 
Following the results described above it 
seemed important to consider the more sus- 
tained base-line state of the organism in ad- 
dition to the relatively transient reactions 
of the S to specific stimuli. 

The data indicate that there may be a 
tendency for the Near Eastern Ss to have 
higher skin conductance. This is clearly seen 
in the Bedouin sample. Johnson and Corah 
(1963) considered two aspects of the skin 
in connection with the racial differences they 
found. They did not consider the skin color 
per se as a likely explanation, and Johnson 
and Landon (1965) were not able to find 
significant differences in the number of active 
eccrine sweat glands. While Malmo’s (1965) 
more limited data do suggest a negative cor- 
relation between skin conductance and pig- 
ment concentration, the present data are not 
in accord with this suggestion since Near 
Eastern Jewish and Bedouin Ss are darker 
on the average than Western Jews. Another 
possibility suggested by Johnson and Corah 
(1963) related lower conductance to a 
thicker stratum corneum. Superficially, at 
least, the present data are not in accord with 
this position. The Bedouin group with by 
far higher skin conductance is the only group 
of manual workers who might be expected to 
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have a thick, calloused, stratum corneum. 
Of course a more precise test of this issue is 
necessary. Of possible greater relevance to 
the present data is a finding mentioned by 
Johnson and Landon (1965): K. M. Taylor 
found that “Oriental Ss had greater chloride 
concentration than Caucasian Ss [p. 328].” 
The higher conductance finding in the 
Bedouin may reflect adaptation to some 
aspect of the immediate environment, which 
involves the amount of sweat secreted, its 
chemical composition, or both. 

It is also possible to view these conductance 
differences as reflecting a difference in basic 
activation. An examination of the pulse-rate 
data which is another index of activation sug- 
gests caution in such an approach. The pulse- 
rate data would actually suggest the opposite 
since the Bedouin Ss had a significantly lower 
pulse rate and there is some tendency toward 
the same in the Near Eastern Jews. It is, of 
course, difficult to evaluate the comparative 
psychological states of the Ss during the 
examinations, but it does seem plausible to 
assume much less familiarity and possibly 
greater stress for the Bedouin Ss. On the 
other hand, medical research has focused on 
the lower risk of cardiovascular disease ob- 
served in Yemenite Jews and Israeli Bedouin 
(Groen et al., 1964). This might reflect a 
low level of cardiovascular response to stress 
states. Another way of describing these find- 
ings would be to extend the notion of “re- 
sponse stereotypy” used in connection with an 
individual’s psychophysiological responses 
over situations (Lacey & Lacey, 1958) to 
ethnic-group response stereotypy. The rela- 
tively low variance found in the Bedouin 
records might even suggest a genetic mechan- 
ism for such a notion. 

GSR reactivity should be considered in 
the light of the tendency to find higher base- 
line skin conductance in the Near Eastern 
Ss. All three studies dealing with different 
Jewish samples indicated significant ethnic 
differences in reactivity to the “relevant” 
card as opposed to little or no difference to 
the irrelevant card. In the case of the 
Bedouin, all GSR reactivity to the stimuli 
was relatively low although an appreciable 
“orienting” reaction was elicited. Previous 
analysis had suggested some relationship be- 
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TABLE 2 


CORRELATIONS BETWEEN BAsic CONDUCTANCE AND 
BOTH RELEVANT AND IRRELEVANT 
GSR REACTIVITY 











Base line 
(conductance) | Base line and 
and cold card “hot” card 
(ohms) 
Original sample 
(N = 36) —0.363* —0.370* 
Blood-pressure cuff 
(N = 40) —0.155 —0.245 
*a <.05. 


tween relevant-card reactivity and base-line 
skin conductance when considering a special 
nonreactive subgroup of the Near Easterners 
as compared with the Westerners (Kugelmass, 
1963). Correlations were therefore calculated 
for the relationship between base-line skin 
conductance and both relevant- and irrele- 
vant-card GSR reactivity for the original and 
the blood-pressure cuff samples, and pre- 
sented in Table 2. 

All of these Pearson correlations were 
negative and low, reaching significance only 
in the original sample. This direction is what 
may have been expected from a traditional 
conception of the Law of Initial Values. The 
recent findings of Hord, Johnson, and Lubin 
(1964) as well as Johnson and Landon (1965) 
obtained significant correlations in the op- 
posite direction, when using conductance 
changes in micromhos. In the light of their 
findings it is difficult to conceive of an index 


TABLE 3 


MEAN CONDUCTANCE CHANGES IN MICROMHOS AND 
VARIANCE ESTIMATES TO THE RELEVANT CARDS 
IN THE DIFFERENT SAMPLES 








S Middle Eastern Western 
ample 
sample sample 
Original 
xX 523 .630 
S? 057 024 
Blood-pressure cuff 
K .246 Sil 
Se 054 050 
Bedouins 
x 216 — 
5? 111 — 
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that completely frees the measure of re- 
activity from the tonic base-line conductance 
within the GSR channel. Nevertheless, con- 
ductance change in micromhos was calcu- 
lated for the relevant-card data of the origi- 
nal, blood-pressure, and Bedouin samples and 
is presented in Table 3. The same pattern of 
differences in reactivity appears when using 
this transformation. 

Given the above findings it is difficult to 
conceive of the base-line condition being an 
important factor in explaining the consistent 
differential GSR _ reactivity pattern that 
emerged in all three Jewish samples. There is 
some possibility that the influence of the 
base line comes into operation only after 
surpassing a minimal level of reactivity which 
is more likely in the greater response to the 
relevant card. This should produce a higher 
negative correlation between relevant-card 
reactivity and base-line conductance than for 
the irrelevant-card reactivity. The correla- 
tions are too low to supply a clear test of 
this notion, but this should be considered 
further. On the whole, the data would sug- 
gest that this would only be a minor source 
for the differential effect, which would ap- 
pear to be a result of psychological factors. 

It is possible to conceive of a range of 
psychological variables that might bring 
about such a differential reaction. As a con- 
servative position it seems best to consider 
all possible factors that produce GSR re- 
sponses such as shifts of focus of attention, 
concentration, effort, etc. rather than limit 
theorizing to the specific lie-detection aspects 
of the experimental situation. This might be 
differentially activated by any of the demand 
characteristics of the card-detection task 
which might vary in meaning for each in- 
dividual and probably systematically for 
groups of different cultural background. A 
study from this laboratory has recently man- 
ipulated this differential GSR reactivity by 
variations in relevance of stimuli to the 
person (Lieblich, 1966). 

In summary, there appear to be at least 
two issues to be considered in relating ethnic 
differences to psychophysiological reactivity 
which should be considered further. The 
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present data support the notion of consistent 
differences in basic activation as indexed by 
skin conductance and possibly pulse rate 
which do not go in the same theoretically 
expected direction. Possibly related to these 
base-line differences are consistent differences 
in short-term GSR reactivity which appear 
to be more closely related to psychological 
factors. 
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EFFECT OF PERCENTAGE AND SPECIFICITY OF FEEDBACK 
ON CHOICE BEHAVIOR IN A PROBABILISTIC 
INFORMATION-PROCESSING TASK * 
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Ohio State University 


Proficiency in choosing the better of 2 sources of information was studied as 
a function of 3 variables: percentage feedback, specificity of feedback, and 
difficulty of discrimination between sources. 210 Ss performed 200 trials under 
1 of 21 conditions. The information pool consisted of 2 distinct distributions 
of numbers from which random selections were made and displayed as read- 
ings on a pair of meters. Only 1 reading was shown per trial, and S judged 
whether it or the other meter reading (not shown) was higher. Choices could 
thus be based upon both the momentary state of 1 meter and familiarity with 
the distributions from which information was drawn. Contrary to typical 
findings, percentage feedback had no influence on acquisition or performance; 
paradoxically, a specificity effect appeared when feedback was provided. 


The term knowledge of results (KR) refers 
to information provided for an individual or 
group of individuals concerning the outcome 
of a response or group of responses. Research 
on a variety of tasks including the ergograph 
(Arps, 1917), graded handwriting (Gilliland, 
1925), and complex arm-leg coordination 
(Lindahl, 1945) has demonstrated that per- 
formance with KR is superior to performance 
without KR. 

In many realistic settings, however, such as 
complex military and industrial systems, it is 
difficult or impossible to provide complete or 
accurate KR after each response. An issue of 
some practical importance, therefore, is how 
often—or on what percentage of responses— 
feedback must be provided in order to main- 
tain its facilitating influence. Research ad- 
dressed to this problem has yielded ambiguous 
results. Kimble (1961), for example, sum- 
marizes the findings of several studies con- 
cerned directly with percentage of feedback 
by stating that performance “. . . is usually, 
but not always poorer... [p. 161]” for 
partial than for 100% conditions. On the 

1 The research reported in this paper was carried 
out at the Human Performance Center and was 
sponsored by the Aerospace Medical Research Lab- 
oratories, Aerospace Medical Division, Air Force 
Systems Command, Wright-Patterson Air Force 
Base, Ohio, under Contract No. AF 33(615)-2248 
with the Ohio State University Research Founda- 
tion. Further reproduction is authorized to satisfy 


the needs of the United States Government. 
2.Now at the University of Maryland. 


other hand, other investigators have reported 
that performance is not seriously impaired on 
simple conditioning and reaction-time tasks 
until reinforcement or feedback is withheld on 
70-80% of the trials (McCormack, Bending, 
& McElheran, 1963; McCormack & McE]l- 
heran, 1963). Goldstein, Southard, and Schum 
(1967) obtained a similar result using a com- 
plex decision-making task; again, no decre- 
ment appeared until KR was withheld on 
67% of the decision responses. 

Part of the confusion with respect to effi- 
cacy of KR may stem from the fact that it is 
likely to affect more than one organismic 
process. Bilodeau (1966), for example, dis- 
tinguishes three potential functions which 
might be involved—to one or another degree 
—in any specific instance of KR: a motivat- 
ing function, a reimforcing function, and a 
directive function. Each of these processes 
might be affected somewhat differently by 
withholding feedback on some proportion of 
trials. It could be, for example, that the per- 
centage required to maintain a high level of 
motivation is fixed and relatively small, while 
that necessary for proper direction varies with 
such factors as task difficulty, specificity of 
instructions, and availability of other sources 
of guidance. 

While it is difficult, if not impossible, to 
isolate one of these KR functions completely 
in any experimental design, the primary em- 
phasis of the present study was upon the 
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directive or informational facet. Thus, the 
importance of percentage of KR was examined 
in light of several other variables which might 
be expected to control the informational value 
of feedback provided. These variables were 
task difficulty and specificity of KR. 

The relevance of task difficulty to the per- 
centage of KR issue is rather obvious. If the 
information in KR is of any value for response 
guidance, it should become increasingly im- 
portant to the subject (S) as his course be- 
comes less clear. Consequently, percentage of 
feedback should be of more significance as 
difficulty is increased. 

On a purely logical basis, specificity of KR 
also seems to bear directly upon the informa- 
tional value of feedback; more specific KR 
should provide more precise guidance of be- 
havior. Actually, however, the relationship is 
not nearly so simple. Evidence to date sug- 
gests that people are limited in their capacity 
to use highly specific feedback information 
(Eriksen, 1958; Goldstein et al., 1967); while 
some specificity may be beneficial, further 
increases in detail may actually degrade per- 
formance (Annett, 1961; Crafts & Gilbert, 
1935). Since it is thus difficult to anticipate, 
for any particular set of conditions, how well 
specific KR will be utilized, it is virtually im- 
possible to predict the way in which speci- 
ficity should interact with percentage of KR. 
Nevertheless, if both factors appear orthog- 
onally in a design, it should be possible to 
determine whether the specific information is 
used and, if so, whether it increases in useful- 
ness as it occurs on a greater percentage of 
trials. This was the approach taken in the 
present study; two distinct levels of speci- 
ficity were combined factorially with four per- 
centages of KR and three levels of difficulty. 

A task originally developed by Howell and 
Funaro (1965) provided a useful vehicle for 
the study of KR variables. In this task, the 
display consisted of two meters (A and B) 
upon which could be presented sample num- 
bers (readings) drawn from two distinct 
number populations. On each trial a sample 
number was drawn from both A and B, but 
only one of these was displayed. The S was 
required to guess whether or not it was a 
higher number than that sampled from the 
other (not shown) population. These stimulus 
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readings were displayed alternately on the 
two meters. Thus, as more and more stimulus 
readings were presented, S could gain increa- 
sing familiarity with the distributions of num- 
bers from which both the A and B readings 
were drawn. He could gain even more in- 
formation about these distributions, however, 
to the extent that KR was provided and he 
was able to use it. Difficulty of discrimina- 
tion could easily be varied by manipulating 
the differences between the A and B popula- 
tions. 

The Howell and Funaro (1965) study in- 
dicated that S comes to establish certain 
numbers as cutoff points for each stimulus dis- 
tribution such that whenever a reading ap- 
pears which is below that cutoff point, his 
choice is the alternate meter. The present 
investigation examined the effects of percent- 
age of KR (0%, 33%, 67%, 100%) and 
specificity of KR on S’s choice of cutoff points 
for three levels of input distribution discrimin- 
ability. Specificity of KR was included at two 
levels by either permitting S to see both the 
A and B readings after his response to one 
of them (specific feedback) or by providing 
him simply with right-wrong information 
(nonspecific feedback). 


MetHop 


Subjects. The Ss were 210 male undergraduate 
students enrolled in an introductory psychology 
course. They were divided at random into 21 groups 
of 10 each. Each S served for 200 trials. 

Apparatus. The S was seated at a table in front 
of the display and response keys. The room was 
completely separated from the experimenter (E£). 
This display consisted of a pair of microammeters 
with scale markers reading from 0 to 200 microamps 
in units of 5. The meter on the left was labeled 
“A” and the one on the right “B.” The E was 
able to adjust and present the readings on either 
dial by adjusting potentiometers. A pair of switches 
located beneath each dial was used by S in making 
his choice of dials. The response activated an 
A or B light in £’s booth and the choice was then 
recorded manually. Another set of switches permitted 
E to present feedback by activating either a green 
light to signify a correct choice, a red light to 
signify an incorrect choice, or a combination of the 
appropriate light and the reading on the alternate 
meter (i.e., whichever one was not used to present 
the stimulus on that trial). 

Independent variables. The independent variables 
investigated in this study were: (a) specificity of 
feedback, (b) percentage of feedback, and (c) dif- 
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ficulty of discrimination, or the relationship of the 
number distributions underlying the two meters. 

Specificity of feedback was manipulated by indi- 
cating to S either the correctness of his choice alone 
or the correctness of his choice plus the reading on 
the alternate meter. Percentage of feedback was 
varied by presenting feedback either 33%, 67%, or 
100% of the time. In order to assess more fully 
the effects of percentage of feedback, a 0% feed- 
back condition was also included, but as a control 
condition rather than as part of the overall design. 

The difficulty of the discrimination between the 
two populations was manipulated by varying the 
means of the two populations. Each population 
represented a normal distribution of numbers with 
an SD of 5 units. The range of each distribution 
was 32 units. The mean of the first population 
(Meter A) was 16 and the means for the second 
population (Meter B) were either 17, 19, or 21. 
This separation of 1, 3, or 5 units resulted in the 
following theoretical differences between the two 
distributions: For the 1-unit separation the pop- 
ulation with the larger mean was higher on 56% of 
the draws; for the 3-unit separation the population 
with the larger mean was higher on 67% of the 
draws; and for the 5-unit separation it was higher 
on 78% of the draws. The readings (numbers) to 
be used on any trial were drawn randomly and 
independently from the two populations; one num- 
ber from each population (A and B) constituted a 
trial. One restriction placed on the sampling pro- 
cedure was that there be no tied readings; when- 
ever this occurred, that pair was discarded and a 
new pair was drawn. The sampling procedure was 
carried out by a computer program which also 
supplied data sheets with all pairs of readings 
listed for E’s use in manual presentation of the 
readings. 

Procedure. The S was seated in front of the dis- 
play and corresponding response keys. He was in- 
structed that the two meters represented two distinct 
normal distributions and that each trial consisted of 
a random draw of two numbers, one from each of 
the two distributions. He was told that one number 
would be presented to him and the other number 
would not, and that after each trial both numbers 
would be returned to the original populations. He 
was also informed of the type and percentage of 
feedback under which he would be operating, and 
the manner in which he was to respond. On each 
trial, E set the value of the potentiometers and 
presented the reading on one of the meters. After 
S responded, E recorded the response and then pre- 
sented the feedback required by the experimental 
condition. On successive trials the value presented 
appeared alternately on Meters A and B so that S 
would have equal opportunities to sample both 
meters. Each presentation and subsequent recording 
of the response took approximately 10 sec. Each S 
was required to make 200 choices in a single session 
which lasted approximately 1 hr. After every 50 
trials, summary feedback was provided and a 2-min. 
rest period was introduced. 
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For this task, S could optimize his predic- 
tion by choosing A when the presented value 
on A was higher than the average value for 
B or when the presented value on B was 
lower than the average for A. On all other 
occasions he should pick B. To determine how 
closely he approximated this strategy, it was 
necessary to calculate the empirical cutoff 
points at which S switched his prediction from 
the presented value on one dial to the un- 
known value on the opposite dial. The point 
chosen for analysis in the present study was, 
for the A-distribution cutoff, the point midway 
between the highest B value at which A was 
chosen and the lowest B value at which B was 
chosen; for the B distribution, it was midway 
between the highest A value at which B was 
chosen and the lowest A value at which A was 
chosen. 

In addition to cutoff points, the more usual 
dependent variable of correct response fre- 
quency was also tabulated. Since the latter 
scores merely confirmed everything shown by 
the former, they were omitted from the pres- 
ent report. 

Average cutoff points obtained under the 
three levels of input discriminability are 
shown as deviations from the true distribu- 
tion means in Figure 1. It is apparent that 
performance generally improves over blocks 
of trials; that is, the average cutoff points 
show progressively less deviation from the 
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Fic. 1, Average discrepancy between empirically 
determined and theoretically optimum cutoff points 
for three levels of input discriminability. 
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1134) = 14.00, p < .001. Also apparent from 
Figure 1 is the fact that as the two meter 
distributions become more discriminable, the 
amount of improvement in performance in- 
creases; that is, when the two distributions 
are widely separated, performance improves 
much more noticeably over blocks than when 
the two distributions are closely aligned. 
This significant interaction of trial blocks with 
discriminability, (14/1134) =5.85, p< 
.001, warrants further comment. The per- 
formance of the low-discriminability group 
is initially quite good and shows very little 
improvement after the early trials, appearing 
to approach an asymptotic value of 2.0 de- 
viation units during the third block of trials. 
At this discriminability level the two dis- 
tributions overlap enough to permit S to 
operate at a near-optimal level by adopting 
a single cutoff for the two distributions. He 
appears to establish such a cutoff point very 
early and, having little to gain by adopting 
a two-cutoff strategy, stabilizes on this value. 
At the higher levels of discriminability, on 
the other hand, adoption of a single cutoff 
value ‘must result in an appreciable reduction 
in score, and S wisely abandons this strategy 
in favor of the more rewarding double-cutoff 
approach. Obviously, however, a number of 
samples must be drawn before he is in a 
position to establish separate A and B cutoffs; 
thus, it is not surprising that he initially per- 
forms rather poorly but shows rapid and 
marked improvement as he gains information 
upon which to base a discrimination. This 
general improvement over blocks and the 
obvious deviation, especially in the higher dis- 
criminability level groups, from a single cut- 
off strategy serve to reaffirm the Howell and 
Funaro (1965) conclusions. 

Figure 2 summarizes the influence of per- 
centage of feedback on deviation scores. The 
obvious triviality of this effect is supported 
by the analysis of variance which indicated 
nonsignificance (fp > .05) for percentage of 
feedback (33%, 67%, and 100%) and all 
its interactions. The 0% feedback condition 
was not included in the analysis, but it is 
apparent from Figure 2 that the 0% condi- 
tion does not differ from any of the other 
conditions. It seems especially surprising that 
the 0% feedback group was able to approxi- 
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Fic. 2. Average discrepancy between empirically 
determined and theoretically optimum cutoff points 
for the four levels of percent feedback. 


mate an optimal cutoff strategy as well as the 
100% feedback group. It is important to 
note, however, that the 0% feedback group 
does receive information about each distribu- 
tion through the alternate presentation of 
readings on Dial A and Dial B on successive 
trials; this information, if used, is adequate to 
permit inference of an optimum strategy. 

The conclusion to which one is led by the 
percentage of KR results, then, is that, con- 
trary to the growing weight of evidence in 
support of a critical 20-33% KR level, KR 
can indeed be completely superfluous in 
maintaining performance. Ammons (1956) 
noted that in most tasks S can find some 
guideline to use in establishing a response 
criterion. If the stimulus situation actually 
contains enough information to enable him 
to perform optimally, the response criterion 
which he sets for himself may be equally as 
effective as one administered externally. 
Such, of course, was the nature of the present 
situation: information provided by the in- 
structions and the alternation procedure seems 
to have been utilized maximally without the 
aid of KR. 

Since the information available in KR 
seems generally to have been unnecessary for 
optimal task performance, it is not surprising 
that predicted interactions of specificity and 
difficulty with percentage of KR failed to 
materialize. This finding strengthens even 
further the conclusion that KR can be a 
very impotent directive factor; even when 
the information is highly specific and task 
difficulty is very high, it matters little how 
much feedback is provided. 


EFFECTS OF FEEDBACK 


A seemingly paradoxical finding arises, how- 
ever, in connection with KR specificity. The 
fact that KR is unnecessary, as shown by the 
percentage of KR data, suggests that S com- 
pletely ignores this source of information. 
Figure 3 indicates that this is definitely not 
the case. That he can and does use KR in- 
formation to improve performance is indicated 
by the fact that specific feedback conditions 
were more effective than nonspecific feedback 
conditions in guiding him toward the optimal 
cutoffs. Moreover, Figure 3 indicates that 
the cutoffs established on Meter A were much 
nearer the optimal cutoff than were the cut- 
offs established by the same feedback condi- 
tion groups on Meter B (this would not occur 
if S were ignoring the information com- 
pletely). The analysis of variance supported 
both of these apparent differences: both the 
feedback specificity effect, F(1/162) = 7.67, 
@—eot and the meter effect, #(1/162) = 
11.51, p < .001, were highly significant. The 
latter finding seems to indicate that Ss were 
focusing more of their attention on the left 
dial than they were on the right dial for 
reasons that are still unexplained. 

It must be concluded, therefore, that al- 
though S will rely to a degree upon KR 
information when it is available, he will bene- 
fit from it only to the extent that it represents 
some critical level of specificity. In the ab- 
sence of KR, he seeks other sources of guid- 
ance which, it would seem, may be equally 
as useful in directing behavior. This conclu- 
sion supports the notion of a critical region of 
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Fic. 3. Average discrepancy between empirically 
determined and theoretically optimum cutoff points 
for specific and nonspecific feedback conditions 
plotted separately for the two meters. 
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specificity for KR (e.g., Ammons, 1956; 
Annett, 1961), and also demonstrates man’s 
resourcefulness in gathering information upon 
which to base future responses. 

The latter point is of particular significance 
for real-life situations in which KR is either 
difficult or expensive to obtain; in such situa- 
tions it might be well to review the entire 
informational picture and to consider alterna- 
tive ways of distributing information-gather- 
ing resources. Such an analysis might well 
suggest a deemphasis of KR relative to other 
sources of information. 

A final point should be made regarding 
the implications of these data for real-life 
situations. While the main emphasis of this 
report is upon KR as information, it might 
be argued that KR is valuable from a motiva- 
tional standpoint as well. If true, one would 
expect that percentage of KR would have 
some influence upon level of performance; 
in the present task, at least, such was not 
the case. Perhaps if greater value were at- 
tached to correct responses, a percentage of 
KR effect might emerge, serving as evidence 
for a motivational influence. A preliminary 
test of this hypothesis, using monetary pay- 
offs as incentive, also failed to produce such 
an effect. 

In summary, it appears that some tradi- 
tional concepts regarding the role of feedback 
in establishing and maintaining performance 
may require revision. In the present investi- 
gation, Ss without extrinsic feedback main- 
tained a performance level similar to Ss re- 
ceiving continuous feedback. Exactly what the 
groups with 0% feedback learned which en- 
abled them to perform as well as a 100% 
feedback group remains unanswered despite 
the hypothesis advanced concerning the alter- 
nation procedure. Conceivably, the instruc- 
tions which Ss received supplied sufficient 
information to make the addition of extrinsic 
feedback completely unnecessary. 
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RELATIONSHIP BETWEEN PERCEPTUAL STYLE AND 


DRIVER REACTION TO AN EMERGENCY 
SITUATION * 


GERALD V. BARRETT axnp CARL L. THORNTON 2 


Goodyear Aerospace Corporation, Akron, Ohio 


Previous investigators have found low relationships between human char- 
acteristics and accident behavior. The possibility was raised that these findings 
might be due to a number of factors including indiscriminate grouping of 
accidents and choice of predictors which have no logical relationship to 
accident behavior. In the present study 20 drivers were subjected to a con- 
trolled emergency situation in an unprogrammed automobile simulator, where 
a pedestrian (dummy) emerged from a shed into the path of the vehicle. 
Subsequent measurements of individual perceptual style, using the Rod and 


Frame Test (RFT), were correlated with effectiveness of reaction to the 


emergency situation. Series 3 


of the RFT correlated .67 with reaction time, 


.75 with deceleration rate, and .50 with a hit-miss criterion. Individual differ- 


ences in perceptual 


style and simulator discomfort resulted in 


shrunken 


multiple correlations of .89-.95 depending on the data transformations. The 
extrapolation of the results to real-world accident behavior has implications 
especially with regard to age, sex, and use of alcohol by drivers. 


In general, the obtained relationships be- 
tween various measures of driver character- 
istics and accidents have been quite low. 
Goldstein (1961), in an extensive review 
of research on human variables assumed to 
be important in safe motor-vehicle operations, 
found low relationships between various pre- 
dictors and accident behavior. He noted, for 
example, in over 45 studies that the correla- 
tions between criterion measures and visual 
functioning (visual acuity, depth perception, 
eye-muscle balance, etc.) were never above 
.20. Similarly, correlations were no higher 
than .17 between reaction time (RT) and ac- 
cident criteria in four studies using large 
samples (331-3,000 drivers). The use of 
psychomotor apparatus, paper-and-pencil psy- 
chomotor tests, sensory-perceptual tests, and 
cognitive measures shows correspondingly low 
correlation with accident criteria (Goldstein, 
1962). 

Two points should be noted concerning 
studies attempting to validate predictors of 

1 The emergency-behavior data used in this study 
were collected under Public Health Service Con- 
tract No. PH-108-64-168 while the collection of 
perceptual style data was supported by Goodyear 
Aerospace Corporation. 

2The authors express appreciation to Margaret 


J. Smith for statistical advice and to Patrick A. Cabe 
for computer analysis of the data. 
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accident criteria. First, the investigators have 
used very heterogeneous measures of driver 
accident behavior. There appears to have been 
little attempt to categorize or logically group 
accidents. A tacit assumption seems to be 
that all accidents have a common cause and 
therefore can be grouped together. In op- 
position to this point of view, one could 
reason that accident behavior is quite diverse 
and complex, having many causes, and only 
by an appropriate categorization of accident 
behavior would it be possible to obtain a 
better understanding of the phenomenon. 
Second, the predictors chosen often fail to 
have any theoretical or logical relationship 
to the accident behavior which is to be pre- 
dicted. The failure to categorize accidents 
logically and/or to choose a predictor which 
has a theoretical relationship to the category 
of accidents may account, to a large extent, 
for the low correlations which have been re- 
ported. 

An attempt to eliminate the category prob- 
lem was made by Barrett and Nelson (1965b, 
1966b). They tested driver reaction to what 
could be considered one category of accidents, 
a situation where the driver was forced to 
make an emergency stop because a pedestrian 
emerged suddenly into the vehicle’s path. 
This emergency situation was staged while 
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the driver was operating an unprogrammed 
automobile simulator. 

The present study attempted to find a pre- 
dictor which would have a logical relationship 
to the emergency responses. An analysis of 
the situation indicated that the main tasks 
were to detect and identify the pedestrian. 
This appeared to be related to Witkin’s 
concept of perceptual style (Witkin, Dyk, 
Faterson, Goodenough, & Karp, 1962). Witkin 
classified individuals on a_ field-dependent— 
field-independent continuum, along which in- 
dividuals may be differentiated with respect 
to their ability to pull a figure from an 
embedding context, with field-independent 
people being better able to perform the task. 
Sudden pedestrian emergence into the field 
of view presented a figure in an embedded 
context. 

Since the behavior required of the subject 
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(S) in the simulated driving situation ap- 
peared to be related to perceptual style from 
this logical viewpoint, it was hypothesized 
that the field-independent individuals should 
be more effective in reacting to the emergency 
driving situation than would field-dependent 
individuals. 


METHOD 


Automobile Simulator 


An engineering test model of an unprogrammed 
automobile simulator was the basic research tool 
used to obtain drivers’ reactions to an emergency. 
Figure 1 shows the arrangement of the simulator in 
the test area with the horizon panels around the 
terrain model and the black-out partition between 
the terrain model and the screen removed. The 
terrain model was an 87:1 (HO gauge) scale rep- 
resentation of several flat roads including a wide 
road beside which a small shed was located. This 
shed contained a pedestrian dummy, scaled to 5.5 ft., 
which could be released onto the roadway when a 





Fic. 1. Automobile simulator, in projected image display mode, with horizon panels 
and black-out partition removed. 
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microswitch was tripped by the television pickup 
carriage. A television camera mounted on a trans- 
port system viewed the terrain model from above 
through a special lens assembly and moved in 
direct response to the movements of the auto- 
mobile’s control elements. 

A projected-image visual display was used with 
half the Ss and a virtual image display with the 
remainder. Both visual displays gave the driver 
approximately a 50° horizontal angle of view. For 
the purposes of this study the two displays were 
considered to be functionally equivalent. With each 
visual display the driver sat in the automobile and 
performed the usual control movements associated 
with driving. A more detailed description of the 
simulator and associated visual displays has been 
reported by Barrett and Nelson (1965c, 1966a). 

During the emergency trial a continuous measure- 
ment of speed, time, brake position, steering-wheel 
position, lateral position of automobile, longitudinal 
position of automobile, and pedestrian position 
was recorded. From these measures, effectiveness 
of driver response to the emergency was obtained. 


Rod and Frame Apparatus 


Perceptual style was measured with the standard 
Rod and Frame Test (RFT) described by Witkin, 
Lewis, Hertzman, Machover, Meissner, and Wapner 
(1954). The apparatus consisted of a luminous 
square frame (40 in.), pivoted at its center so 
that it could be tilted left or right. Pivoted at the 
same center, but moving independently of the frame, 
was a luminous rod (39 in.). 

The S was seated in a chair 8 ft. from the rod 
and frame that could be placed in three positions: 
erect, tilted 28° to the left, or tilted 28° to the 
right. The above apparatus was situated in a 
completely darkened room and S wore goggles with 
dark lenses so that only the rod and frame could 
be seen. 


Subjects 


A total of 50 male Ss between the ages of 30 
and 45 were selected at random from approximately 
1,200 employees in an engineering division of an 
aerospace corporation. Twenty-six Ss left the simu- 
lator because they experienced extreme discomfort 
(simulator sickness). The data for two Ss were 
not used because of an error in procedure, and the 
data for one S were not used because he became 
aware of the purpose of the study. The remaining 
21 Ss completed the emergency-behavior trials. 
One S was not available for subsequent testing 
with the RFT so that the results in this study 
were based upon the driving behavior and perceptual 
style responses of 20 Ss. 


Procedure for Emergency-Behavior Study 


The same procedure was followed for all 20 Ss 
consisting of three orientation drives around the 
terrain model and a pretrial run for a study of 
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driving at requested speed without knowledge of 
apparent speed. The complete procedure for the 
speed study has been described by Barrett and Nel- 
son (1965a). 

Upon making the tenth and final speed judgment, 
Ss followed the same procedure as in the previous 
nine trials. They were not told the number of speed 
trials to be made nor that the last speed trial had 
been completed. The Ss made left U turns at the 
end of the roadway and proceeded down the middle 
of the right 8-ft. lane in the emergency-behavior 
test trial. As in the preceding trials, the device 
operator monitored speed at the control panel and 
either instructed the driver to maintain, increase, or 
decrease his speed so that a speed of approximately 
25 mph was maintained. The speedometer in the 
vehicle had been covered by tape at the beginning 
of the speed trials and remained covered for the 
emergency trial so that the instrument panel was 
not a distraction. 

While the driver was approaching the dummy, 
the experimenter (E) stopped giving speed instruc- 
tions. When the driver reached the release position 
(76.5 ft. equivalent from front bumper to the shed) 
the pedestrian emerged at a controlled rate from 
the shed and the driver’s responses were recorded. 


Procedure for Measuring Perceptual Style 


Approximately 6 mo. after the data had been 
collected for the emergency-behavior study, the 
perceptual style of Ss was measured with the Rod 
and Frame Test (RFT). Measurement followed the 
standard procedure developed by Witkin et al. 
(1954) consisting of three series of eight trials 
each. Series 1 consisted of trials in which S was 
tilted 28° in the same direction as the tilt of the 
frame; Series 2 was trials in which S was tilted 28° 
in the direction opposite that of the tilt of the 
frame; Series 3 consisted of four trials with the 
frame tilted 28° to the right and four trials with 
the same tilt to the left while S remained erect. 
The S’s task was to position the rod to what he 
considered to be true vertical by asking E to move 
the top of the rod right or left. The S’s score was 
the mean degrees error on each series. In addition to 
mean scores on each of the three series, an Orienta- 
tion Index was computed from the standard scores 
of the three series as outlined by Witkin et al. 
(1954). 


Procedure for Measuring Discomfort in the 
Simulator 


Since a certain degree of discomfort was ex- 
perienced even by those who completed their trials 
in the driving simulator, two measures of discomfort 
were obtained after Ss completed the RFT. The 
first was a simple graphic rating scale on which 
each S rated his perceived discomfort, from no 
discomfort (0) to extreme discomfort (10). The 
second was S’s estimate of how long after he left 
the simulator he still felt discomfort. 
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TABLE 1 


INTERCORRELATIONS OF EMERGENCY BEHAVIOR WITH RFT MEASURES 
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The results of the perception test (RFT) 
were compared to (1) initial brake reaction 
time, that is, the time which elapsed from the 
emergence of the dummy until S began to 
depress the brake pedal; (2) deceleration 
rate, that is, the miles per hour per second 
which the apparent motion of the simulator 
was decreased by S as measured from the 
time of emergence of the dummy to either 
subsequent impact with the dummy, or the 
obtaining of zero velocity before collision; 
and (3) the hit-miss dichotomy found in 
most real-world studies. Series 1, 2, 3, and 
the Orientation Index from the RFT were 
compared with these three criteria. 

The results confirmed the hypothesis that 
perceptual style was significantly related to 
the ability to react to emergency situations. 
However, only one aspect of the RFT cor- 
related with the criteria. 

Table 1 shows that correlations with Series 
1, 2, and Orientation Index were not signifi- 
cantly related to emergency behavior. Differ- 
ences between these aspects of the test and 
Series 3 will be discussed later in connection 
with Witkin et al.’s (1954) findings. 

The correlation (Pearson 7) between Series 
3 of the perception test and reaction time in 
the simulator was .61 (p < .01). The decelera- 
tion rate (ry = —.55, p < .01) was also sig- 
nificantly related to Series 3. However, the 
relationship between the hit-miss dichotomy 
and Series 3 was only .34 (nonsignificant) 
using a point-biserial 7. This latter finding 


measurement used in this study. 

Inspection of the data revealed that the 
relationships were not linear. It became evident 
that the empirical data were more nearly 
approximated by a log function for reaction 
time (see Figure 2) and a hyperbolic function 
for deceleration rate (see Figure 3) and the 
hit-miss dichotomy. Correlations increased 
in magnitude from .61 to .67 for reaction 
time, from .55 to .75 (p < .001) for decelera- 
tion rate, and from .34 to .50 (p< .05) 
for the hit-miss dichotomy. A single per- 
ceptual measure, then, was able to account 
for 25-56% of the individual differences in 
ability to react to the emergency situation. 

It was apparent from the data analysis 
that discomfort, due to the experience of 
operating the simulator, affected the results. 
Multiple regressions were calculated using a 
Wherry Test Selection Program with F test 
(Wherry & Smith, 1966) for the IBM 1620 
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Fic. 2. Relationship between perceptual style 
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Fic. 3. Relationship between perceptual style 
and deceleration rate. 


computer. It was found that discomfort was 
not a significant factor in initial reaction time, 
but was quite important for deceleration rate. 
Multiple Rs obtained ranged from .89 to .95 
(p < .001) depending on the particular data 
transformations involved. Table 2 shows the 
1 
multiple Rs obtained using values of x, x”, x, 
and log x as units representing various curv- 
ilinear functions that could yield a data fit. 
Two factors, perceptual differences and simu- 
lator discomfort, accounted for 79-90% of 
the variance in deceleration rate of the 
simulator. 
As mentioned previously, high correlations 
were found between emergency behavior and 


TABLE 2 


VARIABLES APPEARING IN THE MULTIPLE-REGRESSION 
Equation Usinc Various Data TRANSFORMA- 
TIONS—DECELERATION RATE AS RELATED TO 
PERCEPTUAL STYLE AND DISCOMFORT 
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Note.—S1 = Series 1 of the Rod and Frame Test (RFT); 
Se = Series 2 of the RFT; .S3 = Series 3 of the RFT; D = rating 
of subjective discomfort during simulator operation; I4 = esti- 
mate of the length of time the discomfort lasted after leaving 
the simulator. 

*p <.001. 
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Series 3 only. Previous research (Witkin 
et al., 1954) showed that the RFT was not a 
unitary test. Series 1 and 2 were found to 
be similar, while Series 3 (no body tilt) 
tapped a different dimension of perception. 
Embedded Figures Test (pulling an 
item from an embedded context), often used 
as a measure of perceptual style, was found 
to be most highly related to Series 3 and 
not to Series 1 and 2. Witkin’s perception 
tests were found to be on a difficulty con- 
tinuum. The Embedded Figures Test and 
Series 3 were the easiest of the perceptually 
embedding tests. Since at the time of the 
experiment it was not known how the driving 
task would compare to the difficulty levels of 
perceptual style, all three series of the RFT 
were given. 


DiscuUssION 


The present results support the hypothesis 
that field-independent individuals are more 
effective in responding to emergency situa- 
tions. The finding of high relationships, ac- 
counting for approximately 80% of the vari- 
ance, was probably a function of a number of 
interrelated factors. 

Five such factors were considered to be 
important in obtaining significant results in 
this type of accident research: the studying 
of only one specific type of accident, using an 
improved research tool to study a specific 
behavioral sequence, unitary exposure to the 
emergency sequence, the continuous measure- 
ment of relevant criteria, and the utilization 
of a predictor with a logical relationship to 
the criterion. 

The importance of studying one type of 
accident is shown by noting that such ac- 
cidents as rear-end collisions, loss of control 
on turns, accidents during rain or snow, and 
accidents related to mechanical malfunction 
are usually grouped together. They may have 
completely unrelated human characteristics 
associated with incidence and severity of 
outcome. Also, the classification of real-world 
accidents often does not take into account 
the problem of human culpability. The tacit 
assumption is that an accident is the driver’s 
fault. The term “accident” is often a legal 
designation rather than a description of a 
behavioral act. It is quite possible that 
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having an accident by the legal definition 
is the most effective driving behavior (e.g., 
hitting a parked car to avoid a child). 

Significant findings have often been the 
result of improved research tools. Looking at 
a specific structure rather than just a global 
process has made it possible to confirm or 
refute many theoretical issues. By using an 
automobile simulator, control and continuous 
measurement of pertinent variables in an 
emergency situation were possible. Also, the 
simulated situation presented every driver 
with the same emergency, giving constant 
exposure across Ss. This is not to imply that 
the simulator used in the present investigation 
is not without flaws, especially as evidenced 
by the discomfort experienced by the majority 
of Ss. 

The predictor in this study had a logical 
relationship to the criterion behavior. Often 
simple brake reaction to a red light is cor- 
related with accident records (e.g., Cation, 
Mount, & Brenner, 1951) where there is 
little correspondence between the braking 
response elicited by the red light and the 
complex perceptual process in the real world. 


Related Real-World Research 


In a review of the literature, one real-world 
study was found which appeared to be re- 
lated to the behavioral sequence analyzed in 
this experiment. An attitude scale, picture- 
frustration test, Raven Progressive Matrices, 
and a visual perception test were administered 
to accident repeaters and accident-free drivers 
(Spicer, 1964). Only the visual perception 
test, motion picture films representing a wide 
variety of traffic conditions, was consistently 
related to driving and traffic mishaps. The 
Ss checked statements concerning each of 11 
film sequences to indicate what he felt was 
important to the driver of the car. This test 
might have some relationship to the concept 
of perceptual style since S was required to 
extract the salient information from the film 
strip in a short period of time (35 sec.). 

It was interesting to note that adolescents 
did not perform as well as older drivers on 
the visual perception test. This could have 
been a function of either inexperience and 
therefore lack of awareness of vital cues in 
driving or field dependence and consequent 
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lower ability to pull a figure from an em- 
bedding context. This point will be discussed 
later in more detail. 


Alternate Theoretical Explanations of Results 


A parsimonious explanation of the results 
would include a description of perceptual 
abilities, although perceptual style is also 
related to cognitive style (Witkin et al., 
1962). Field-independent individuals tend to 
think more analytically and critically. Pos- 
sibly, field-independent individuals reacted 
faster because they already had a preformed 
set to respond, thinking out in advance how 
they would react to an emergency situation. 
In contrast, the field-dependent person, being 
less analytical, may never have given any 
consideration to how he would react. 

Witkin et al. (1962) have reported that 
field-dependent individuals become confused, 
disorganized, and inadequate under stress. 
Since confusion could also be a factor in 
emergency situations, there apparently are a 
number of possible alternative explanations 
using other correlates of perceptual style 
which could account for the less effective be- 
havior of the field-dependent Ss. 


Extrapolation of Results to Real-World Ac- 
cident Situations 


Three complex real-world driving behaviors 
can be explained on the basis of the results of 
this investigation and other related perceptual 
style research. First, it has been well estab- 
lished that alcohol is a factor in many ac- 
cidents (Fox & Fox, 1963). Witkin et al. 
(1962) report that perceptual style, as meas- 
ured specifically by S’s ability to visually 
extract a figure from an embedding context, 
is affected by alcohol. It is conceivable there- 
fore that while under the influence of alcohol 
a driver would find it harder to extract an 
emerging pedestrian, vehicle, or other obstacle 
from the complex visual field. This could be 
one factor in explaining why drivers under 
the influence of alcohol are involved in a 
high percentage of accidents. 

Second, studies are consistent (Goldstein, 
1962) in showing that young drivers (below 
25) and older drivers (beyond 65) have a 
disproportionately high rate of accidents. It 
has also been shown that perceptual style is 
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a function of age. In a number of cross- 
sectional and longitudinal studies (Witkin 
et al., 1962) it has been determined that 
individuals become more field independent 
as they grow older, with an asymptote around 
21. Since young drivers (16-21) are relatively 
field dependent, this might be a factor in 
their higher accident involvement. While the 
younger drivers as a group have a faster 
simple reaction time, the real-world situation 
involves a complex perceptual element and 
the intruding object must first be recognized 
before an individual can respond to it. 
Similarly, recent evidence (Schwartz & Karp, 
1967) indicates that older individuals are 
field dependent. 

Third, previous research (Witkin et al., 1962) 
has shown that women as a group are more 
field dependent than men. On that basis one 
would predict that women drivers who are 
presented with an emergency driving situa- 
tion would be less effective in responding than 
men. One real-world study has a_ possible 
bearing on this point. Uhr (1959) found that 
women were more likely to cut across a motor 
scooter’s path from a stop street than were 
men. It is possible that the women were un- 
able to perceptually extract the small 
scooter from the background as quickly as 
the men. 

It appears feasible to extrapolate the 
results of the present study to several other 
real-world situations. Any occasion where an 
individual must quickly detect, identify, and 
properly react to a stimulus could be related 
to perceptual style. Many examples could 
be given such as sudden emergence of a child 
from between parked cars, the swerving of an 
automobile into the driver’s projected path, 
or the emergence of an automobile from an 
alley or hidden driveway. The generality of 
these results, however, must await future 
research. 

Past research with the Embedded Figures 
Test (EFT) and the RFT may also be 
brought to bear on practical traffic problems. 
It is known that various factors decrease 
one’s ability to pull an embedded item from its 
context, such as alcohol, fatigue, and brain 
damage. Besides these factors, which may 
cause severe decrement in performance for 
anyone, there are subtle genetic and environ- 
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mental factors which cause a wide range of 
individual differences even under optimal 
testing (or driving) conditions. For this reason 
the EFT and RFT when properly validated 
could be used to screen large populations of 
drivers who either are planning to enter or are 
presently employed in an occupation which 
requires a great deal of driving (such as 
truck, bus, and taxi drivers, or traveling 
salesmen). 

Since it is not now feasible to restrict 
field-dependent people from driving, training 
could be established for those who find 
driving to be an occupational necessity, but 
are extremely field dependent. Perception, 
per se, is resistant to training. It is, however, 
possible to “train around” certain deficits. 
That is, it is possible to identify perceptual 
deficits and teach people to compensate for 
them. One such possibility is to train field- 
dependent persons to respond completely 
(i.e., reach full brake depression), as soon 
as they detect the emerging image. In this 
manner, the lag time between detection and 
identification may be utilized in decelerating 
the vehicle. This amounts to telling the 
driver to “be careful,” which many slogans 
do, but in a very specific way. 

In conclusion, this discussion illustrates one 
of the main advantages of physical stimulation 
of the driving task. Besides factors such as 
exact experimental control and safety, con- 
firmation of hypotheses which could not be 
easily evaluated in the real world is possible. 
Further research, of course, is required to 
confirm the hypothesized functional relation- 
ships between perceptual style and human 
performance in specified emergency situa- 
tions. Also, a replication of this study in the 
real world should be conducted, especially in 
light of the restriction in sample imposed 
by the “simulator sickness.” There is now, 
however, some empirical evidence which 
gives a basis for a number of specific in- 
vestigations. 
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OF BASIC MAINTENANCE TASKS 


RICHARD J. SHAVELSON anp JOSEPH L. SEMINARA 1 
Lockheed Missiles and Space Company, Sunnyvale, California 


9 Ss were trained extensively on 3 maintenance tasks: bolt torquing, con- 
nector mating, and nut threading. They were randomly distributed into 1 of 
3 clothing conditions (shirt-sleeve, vented suit, and pressurized suit) and 
trained and tested on all 3 tasks in 3 gravity conditions (1 gravity, 1 gravity 
harnessed, and 4% gravity). This study demonstrated that lunar gravity 
imposed a 25% performance decrement over performance in 1 gravity (p < .01). 
The vented suit imposed a 60% performance decrement (p < .01), and the 
pressurized suit imposed a 150% performance decrement (p < .01) when 
compared to performance in the shirt-sleeve mode. On the basis of these 
findings and subjective reports, preliminary human-factors design criteria 
were suggested for lunar-gravity performance aids. The need for subsequent 
research in the areas of mission-specific maintenance tasks and candidate 
job aids to improve performance in the lunar environment was pointed out. 
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EFFECT OF LUNAR GRAVITY ON MAN’S PERFORMANCE 


Past research, employing a variety of simu- 
lation methods, demonstrated that man’s be- 
havior in lunar gravity differed considerably 
from his behavior in the earth’s gravity. 
Hewes? and Spady (1964a, 1964b) found 
that man could jump vertically as high as 14 
ft. in lunar gravity and perform other ordi- 
nary tasks in an extraordinary manner. 
Roberts (1963) investigated man’s walking 
behavior in lunar gravity which he described 
as a fast walk in slow motion. Prescott and 
Wortz* (1966) and Hazard (1965) showed 
that, during walking, man’s metabolic rate 
decreased in 4% gravity when compared 
to his 1-gravity rate. Although there has 
been a comprehensive examination of man’s 
physical and physiological behaviors in lunar 


1 The authors wish to acknowledge the assistance 
of G. F. O’Keefe, C. J. Baker, and A. L. Hall in 
providing life support, communications, and technical 
advice for pressure suit research, and R. E. Mahan, 
simulation engineer. 

2D. E. Hewes, A. A. Spady, Jr., & R. L. Harris. 
Comparative Measurements of Man’s Walking and 
Running Gaits in Earth and Simulated Lunar 
Gravity. NASA Tech. Note D-3363. Unavailable. 

3E. C. Wortz & E. J. Prescott. The Effects of 
Subgravity Traction Simulation of the Energy Costs 
of Walking. Personal communication, 1965. Pp. 2-26. 


gravity, a few studies have concerned them- 
selves with man’s performance of maintenance 
tasks. Holmes (1965) studied the effectiveness 
of a variety of tools on maintenance tasks in 
various gravity conditions including lunar 
gravity. Using two subjects (Ss) for his 
lunar-gravity simulation, he found that per- 
formance times increased by 8% over per- 
formance in the earth’s gravity. However, due 
to the paucity of Ss, the large number of 
tasks under examination, and the lack of 
statistical treatment, Holmes’ findings were 
considered tentative, at best. 

The purpose of the present research was 
to develop data regarding the effects of lunar 
gravity and candidate lunar clothing con- 
ditions (shirt-sleeves, vented suit, pressurized 
suit) on man’s performance of three basic 
maintenance tasks (bolt torquing, connector 
mating, and nut threading) to serve as guide- 
lines for task analyses, timelines, and per- 
formance-aid designs. It was hypothesized 
that performance times in 1% gravity would 
increase over those in 1 gravity, that per- 
formance times would increase with increas- 
ingly encumbering clothing requirements, and 
that the connector-mating task would be the 
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least affected by the gravity or clothing- 
condition variables. 


METHOD 
Subjects 


Nine male Ss participated in this study. The Ss 
were selected from a pool of “astronaut” test Ss 
maintained by Lockheed Missiles and Space Com- 
pany’s (LMSC) Biotechnology Organization. The 
Ss fell between the ages of 23 and 36, all were engi- 
neers or physical scientists, all were under 6 ft. in 
height, and all had performed as a pressure-suited 
test S in previous simulation research. 


Apparatus 


The effect of JIunar gravity was achieved by 
using LMSC’s 14-gravity simulator, LUNARG. This 
unit, which supports five-sixths of the S’s weight, 
consisted of nine negator spring motors attached to 
a cable reeling system. The S was attached to the 
aforementioned cable via a harness. Fine balancing 
of plus or minus 1 lb. accuracy was achieved by 
placing a skin diver’s weight belt around S’s waist. 

The space suit employed in this study was an 
Apollo A-4H training suit, on loan from NASA, 
Houston. The suit consisted of a Model C3 helmet, 
a torso-limb assembly, gloves, and astro-cap. 

Since the Apollo A-4H pressure suit had a helmet 
without a visor, the vented-suit condition was de- 
fined as helmet and gloves on with an air ventiliza- 
tion flow of 10 cu. ft. per minute. The pressurized- 
suit condition was defined as suit pressurization 
at 3.7 +2 psi differential with air. The clothing 
condition referred to as shirt-sleeve was defined 
as street clothing: shoes, socks, slacks, and shirt. 

The performance-fixture apparatus was a unistrut 
frame structure 94 ft. in height and 19 in. in width 
on which a uniform-sized panel, 54 in. in height 
X< 19 in. in width X % in. in depth, was placed. 
The panel contained three tasks: (1) bolt torquing, 
(2) connector mating, and (3) nut threading. On 
S’s left-hand side of the panel were nine threaded 
receptacles into which bolts 1 in. long with a %6 
of an inch hexagonal head were screwed and then 
torqued down to 70 in. lb. using a torque wrench. 
Nine identical Deutsch push-pull type connectors 
(% in. in diameter) were located in the middle of 
the panel. Twelve equally spaced bolts of three 
different sizes (14, %6, % in.) protruded from the 
right-hand side of the panel. Two of the nuts were 
hexagonal and fitted on the two larger bolts. The 
third was a small wing nut. Attached to the per- 
formance fixture at a height of 43 in., and extend- 
ing out to the right of the panel, was a contents 
tray (24 X 6% X 6% in.). During the experiment, this 
tray contained the torquing wrench and a predeter- 
mined number of each of the task elements. 


Design 


The experimental design employed in this research 
was a partial replication of an 81-celled 3X 3X3 
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factorial. Nine Ss were randomly assigned to one 
of three clothing conditions (shirt-sleeve, vented 
suit, or pressurized suit) with three Ss in each 
condition. All nine Ss, regardless of clothing con- 
dition, performed all three subtasks (torquing, con- 
nector mating, and threading) in all three gravity 
conditions (1 gravity, 1 gravity harnessed, and 
Yi, gravity). In order to minimize the effects of 
practice, Ss were tested in the principal condition, 
gravity, in a counterbalanced order. 

All Ss received task orientation and preliminary 
training in the shirt-sleeve, 1-gravity environment. 
They were then trained and tested in ‘their 
respective clothing conditions. 


Procedures 


The experiment was divided into two parts: (1) 
orientation and preliminary training, and (2) 
specific training and testing. The former part 
describes the instructions given to the Ss and the 
base-line training in the shirt-sleeve 1-gravity 
environment. The latter part describes training in 
specific clothing and gravity conditions and testing. 

Orientation and preliminary training. The Ss were 
instructed to perform each task separately, always 
working from panel top to bottom. They were told 
to work as quickly and as accurately as possible 
since both time and errors were being recorded. 
One exception to this procedure was necessary. 
During the course of the experiment, it was dis- 
covered that Ss participating in the pressurized-suit 
condition were unable to successfully complete all 
of the task elements on the performance panel. This 
was not due to the inability of Ss to reach all of 
the task elements, since for all Ss regardless of 
clothing condition the top of the panel was placed 
1 in. above their shirt-sleeve height, but rather 
to the amount of fatigue and the difficulty of manip- 
ulation in the extreme high and low positions. For 
example, although Ss were able to place the nine 
bolts into the panel, they were unable to torque the 
bolts in the highest and lowest positions with the 
torque wrench to the preset value of 70 in. lb. As 
a consequence, the work area of the panel was re- 
duced such that the pressure-suited Ss were required 
to complete the middle five torquing and connector- 
mating task elements, and the middle eight threading 
task elements. Average time to complete one task 
element was computed for all Ss on all tasks. 

At the completion of the instructions, Ss were 
given an opportunity to manipulate the three tasks 
until they were satisfied that they had fully under- 
stood the instructions. The Ss were then trained in 
the shirt-sleeve 1-gravity mode until an asymptote 
in time and errors was reached. This required about 
one-half to 1 hr.’s time. 

Specific training and testing. Specific training and 
testing proceeded first with the shirt-sleeve group, 
then with the vented-suit group, and lastly with 
the pressurized-suit group. In all three clothing con- 
ditions, Ss were trained to asymptote in time and 
errors in a particular gravity condition, and then 
tested immediately afterward. At the completion of 
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a test in a given gravity condition, Ss moved on to 
the next gravity condition, as described by the 
schedule of counterblanced orders, and were trained 
to asymptote and then tested. This procedure was 
carried out for all three gravity conditions. Time 
to perform each task, time between each task, and 
errors were recorded during each test. At the comple- 
tion of the testing, Ss were asked to fill out question- 
naires relating to performance in clothing and gravity 
conditions and performance of the various tasks. 


RESULTS 


The results reported in this section are 
divided into three parts: base-line training, 
Specific training and test, and questionnaire 
responses. 


Base-Line Training 


The mean number of shirt-sleeve, 1-gravity 
training trials to performance asymptote was 
8.89 with a standard deviation of 2.31 trials. 
The mean time to perform one torquing task 
element at asymptote was 11.37 sec. with a 
standard deviation of .81 sec. The mean time 
to perform one connector-mating task ele- 
ment at asymptote was 3.12 sec. with a 
standard deviation of .25 sec. The mean time 
to perform one threading task element at 
asymptote was 8.53 sec. with a standard 
deviation of .60 sec. It is important to remem- 
ber that the performance times reported refer 
to mean time to perform one task element, for 
example, the mean time to torque one bolt, 
instead of the total time to perform all nine 
bolt torques. A total of one error was com- 
mitted by the nine Ss at base-line training 
asymptote. 


Specific Training and Test 


All nine Ss received base-line training in 
the shirt-sleeve, 1-gravity mode. At the com- 
pletion of this training, Ss received specific 
training on all tasks in a particular clothing 
condition and in a given gravity condition. 
Since the gravity conditions were counter- 
balanced, individual Ss required different 
numbers of training trials for different cloth- 
ing conditions and different gravity sequences. 
The latter depended on the amount of transfer 
of training from base-line training. For ex- 
ample, training and test in 1 gravity had com- 
plete transfer to the 1-gravity-harnessed 
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condition while transfer from 1 gravity to 4% 
gravity showed much less transfer, depending 
on preceding specific training and _ test 
sequences. 

The results of this experiment are sum- 
marized in Table 1 and Figure 1 for per- 
formance times. In general, there was a sig- 
nificance in performance times between the 
three clothing conditions (p < .01). Further 
analysis of this finding revealed that perform- 
ance in the vented suit required more time 
than in the shirt-sleeve mode (p < .01). Per- 
formance in the pressurized suit required sig- 
nificantly more time than the vented suit 
(p< .01) and, therefore, required signifi- 
cantly more time than the shirt-sleeve con- 
dition. In terms of percentage of time increase, 
the vented suit showed a 63.61% increase 
over the shirt-sleeve mode. The pressure-suit 
condition showed a 127.70% increase over 
the shirt-sleeve condition and a 39.16% in- 
crease over the vented suit. 

The effect of the three gravity conditions 
was significant beyond the .01 level of con- 
fidence. An analysis of the specific com- 
ponents of this variable revealed no significant 
difference between the 1-gravity condition and 
the 1-gravity-harnessed condition. However, 
performance in the 14-gravity mode took sig- 
nificantly longer than in the 1-gravity- 
harnessed condition (p< .01), and signifi- 
cantly longer than in the 1-gravity condition. 
With respect to percentage of increase in time, 
the 1-gravity-harnessed condition showed a 
1.8% increase over 1 gravity. The %-gravity 
mode showed an increase in performance time 
of 23.52% over 1 gravity, and an increase of 
21.33% over the 1-gravity-harnessed con- 
dition. 

The difference in performance times for 
each of the three tasks was significant beyond 
the .01 level of confidence. Using a 1-degree- 
of-freedom test, it was demonstrated that the 
threading task took significantly longer than 
the connector-mating task (p < .01). Further, 
it was demonstrated that the torquing task 
took significantly more time to perform than 
the threading task (p< .01). In terms of 
percentage of time increase, the threading 
task showed a 192.24% increase over the 
connector-mating task. The torquing task 
showed a 267.18% increase over the con- 
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Fic. 1. Performance times by task for all gravity and clothing conditions. 


nector-mating and a 25.64% increase over 
the threading task. 

The interaction between clothing and 
gravity was not significant. This was inter- 
preted to mean that for every level of the 
gravity condition, performance in the clothing 
condition remained relatively constant.. In 
short, the %-gravity condition required 
greater time to perform each of the tasks 
regardless of clothing condition, and regard- 
less of clothing condition, 1 gravity and 1 
gravity harnessed required about the same 
time to perform. 

The interaction between clothing condi- 
tion and task was significant (p < .01). This 
was interpreted to mean that the difficulty 


of the tasks did not remain constant for every 
level of clothing condition. 

Specifically, the threading task when in the 
vented suit showed less of an increase in 
performance time over the shirt-sleeve condi- 
tion than did the torquing task in the same 
clothing mode (56.51% versus 76.68%). 
However, a comparison of performance times 
in the pressurized-suit mode revealed that 
the tasks were approximately equivalent for 
this condition (130.55% versus 138.48%). 
Further, the connector-mating task showed 
much less of a performance-time increase in 
the pressure-suit mode (87.53%) than did 
either the threading (130.55%) or the torqu- 
ing (138.48%) tasks. 
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TABLE 1 
STATISTICAL ANALYSIS OF PERFORMANCE TIMES 
Source OS af MS F 

Clothing (A) 1,349.31 | 2 674.65 | 26.92** 
Subjects within 

groups (B) 150.34) 6 25.06 
Gravity (C) 128.8350 2 64.42 | 26.40** 
Task (D) 2,630.99 | 2 | 1,315.50 | 539.14** 
DEX C 16.64] 4 4.16 1.70 
iN SE 1D) 349.69} 4 Saloon 
Cx) 30:21" 4 1S) 3.09* 
BEX iC 41.28 | 12 3.44 1.41 
BXD 100.39 | 12 8.36 S43re 
Nexus ID, LOG/ZaheeS 2.46 1.01 
Bee xX LD 58.57 | 24 2.44. 

*p <.05 

Ep <.01 


The interaction between gravity and task 
was significant (p< .05). Again, the diffi- 
culty of the tasks did not remain constant 
for all gravity conditions. The finding is sup- 
ported by the fact that the threading task 
in 14 gravity took 35% more time to perform 
than in 1 gravity. 

The interaction between Ss within groups 
and gravity condition was not significant. 
All Ss experienced a decreased efficiency in 4; 
gravity while performing approximately the 
same in either 1 gravity or 1 gravity har- 
nessed. However, the interaction between Ss 
and task was significant (p < .01). Different 
tasks were of varying difficulty for each S$ 
within a given clothing condition. Finally, a 
comparison of the intertask intervals re- 
vealed no differences between task, clothing, 
or gravity conditions. The mean interval 
was 15 sec. 

The term “error,” as employed in this re- 
search, referred to the frequency of occurrence 
of dropping any task element, or holding on 
to any part of the performance panel or 
structure framework while performing any of 
the tasks. There was no attempt made to 
assign relative weights according to the nature 
of the error. The Friedman two-way analysis 
of variance by ranks revealed that perform- 
ance as measured by errors did not differ 
significantly for any of the three main vari- 
ables, and there were no significant inter- 
actions. These findings were due, in part, to 
the loose definition of “error.” 
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Questionnaire Responses 


Subjective impressions of test Ss regarding 
the task under various gravity conditions 
were examined for conformity with the ob- 
jective data reported above. All nine Ss 
agreed that the harness, per se, made no dis- 
cernible difference. Therefore, the primary 
differences of concern were between 1 gravity 
and 1% gravity for the various suited con- 
ditions. 

In this comparison, whenever a number is 
reported in quotes, it refers to the scale 
number on a scale from ‘1” (easy) to “5” 
(very difficult), and is a mean representation 
of the three responses. 

The Ss were asked to rate the entire main- 
tenance task from easy to very difficult. The 
shirt-sleeve Ss rated the task as easy in both 
1 and 4% gravities, “1” and “1.67,” respec- 
tively. The vented-suit Ss found the task 
moderately difficult in both gravities, “3” 
and “3.67.” The pressure-suit Ss rated task 
difficulty as “3” for both gravity conditions. 

When asked to identify the most difficult 
single task, the connector-mating task was 
never indicated. All three shirt-sleeve Ss con- 
sidered the threading task most difficult in 1 
gravity, and considered the torquing task 
most difficult in 4% gravity. Two vented-suit 
Ss considered the threading task to be the 
most difficult in 1 gravity, and all three 
identified the torquing task as the most 
difficult when in 4% gravity. Torquing and 
threading were identified as equally difficult 
by the pressure-suited Ss in both gravities. 

The Ss did, in no instance, identify the 
middle portion of the performance panel as 
difficult to work on. Two shirt-sleeve Ss 
identified the lower portion as the most diffi- 
cult to work on in both gravities. The higher 
portion of the panel was identified as the 
most difficult by all three vented-suit Ss. 
However, two pressure-suit Ss described the 
lower portion of the panel as most difficult to 
work on. In short, it appears that extreme 
positions, high or low, caused the greatest 
performance difficulty relative to mainte- 
nance-task operations. 

The shirt-sleeve Ss did not consider the 
maintenance task fatiguing in either 1 or 4% 
gravity. Their ratings were “1” and “1.67,” 
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respectively. The vented-suit Ss found the 
task moderately fatiguing, “3,” for both 
gravities. The pressure-suited Ss rated the 
task as “3” in 1 gravity and “4” in &% 
gravity. The indication here is a trend in 
clothing condition going from the shirt-sleeve 
to pressure-suit mode. It is reasonable to 
suggest that, if the pressure-suit Ss were 
required to finish all task elements, they 
would have reported extreme fatigue. 

The Ss in all three clothing conditions 
unanimously agreed that they felt they had 
committed the most errors on the threading 
task, irrespective of gravity condition. They 
also unanimously agreed that they had to 
change their method of performance of the 
tasks when in 4% gravity as compared to per- 
formance in 1 gravity. 

The vented-suit Ss reported that perform- 
ance dexterity was moderately reduced, ‘3,” 
when compared to their shirt-sleeve base-line 
training. The pressure-suit Ss rated suit en- 
cumbrance as “5,” or extremely restrictive. 
Finally, no pressure-suit Ss reported anxiety 
in working in the pressurized mode. 


DiIscussIon 


The results of this study supported the 
hypothesis that a maintenance-task decre- 
ment would be manifested in lunar gravity 
when compared with performance in the 
earth’s gravity. The finding of no statistical 
difference between the 1-gravity and 1-grav- 
ity-harnessed conditions indicated that the 
performance decrement in lunar gravity was 
not due to the requisite harness. The Ss’ 
responses to the questionnaire also supported 
these findings: “In lunar gravity, I stood on 
tiptoes. More difficult to aim screws or nuts 
due to less stability. Torquing produced 
counter torques pushing me away from the 
panel.” 

Perhaps the most significant finding of 
this research was that Ss performing in the 
pressurized-suit condition were unable to 
complete all of the tasks. Further, the results 
of an analysis of the performance times sup- 
ported the hypothesis that the shirt-sleeve 
clothing condition would impose the least 
restriction on performance while the pres- 
surized suit would impose the greatest per- 
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formance restriction. These findings con- 
curred with Holmes’ (1965) description of 
the pressurized-suit mode as being the single 
most important variable hindering mainte- 
nance tool performance. 

The results of this research also supported 
the hypothesis that the connector-mating task 
would take less time to perform, while the 
torquing task would require the greatest 
amount of time due to counter-torques. The 
hypothesis that the threading task would 
cause the greatest number of errors was also 
supported. The interactions involving task 
indicated that maintenance tasks may be 
affected differentially by the person perform- 
ing them, the clothing condition, and the 
gravity condition in which they are performed. 


Task Analysis and Timeline Studies 


During successive stages of design and 
development of a complex system, information 
regarding the nature of the human interface 
with the system in terms of crew functions 
and tasks is necessary in order to insure that 
the system does not impose performance re- 
quirements beyond man’s capabilities. One 
purpose of this research was to derive extra- 
polation figures for maintenance-type tasks 
so that performance-time estimates could be 
made more accurately for lunar maintenance 
timeline studies. 

The performance data, summarized in 
Figure 1, provide an empirical basis for as- 
signing performance times to related lunar 
mission activities. For example, in estimating 
performance times for lunar intravehicular 
tasks in shirt-sleeves, 25.67% increment in 
performance time should be made over com- 
parable earth-based performance. Such gen- 
eralization from the data obtained in this 
study would provide a more valid basis for 
performance-time allocations than the “arm- 
chair analysis” technique so often resorted to. 


Performance Aids 


The findings of this research, that man’s 
behavior changes under the influence of lunar 
gravity, concur with the lunar-gravity re- 
search literature. The performance decrement 
observed in lunar gravity and in the various 
clothing conditions points to the necessity of 
considering human engineering design counter- 
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measures for coping with the peculiarities of 
the partial gravity lunar environment. While 
considerable attention is currently being given 
to performance aids for assisting the astronaut 
in the zero-gravity environment (Pierson & 
Geller, 1965), relatively little attention has 
been devoted to a comparable study of per- 
formance aids required to facilitate lunar 
maintenance and operational tasks (Holmes, 
1965). Based on the results of the present 
research, it may be just as necessary for the 
astronaut to anchor himself to his task in 
the lunar environment as it is for earth orbital 
operations. 

The following represents guidelines for 
unique design conceptualization of perform- 
ance aids. Handholds should be provided for 
the astronaut primarily as a method for 
obtaining a satisfactory position relative to 
the task. However, if possible, the astronaut 
should not be required to use his hands in 
steadying his position relative to his equip- 
ment task since this would limit his ability 
to use both hands in accomplishing tasks. It 
appears that a one-hand grip would not be 
sufficient to steady his position for many 
tasks. Properly spaced and distributed toe 
holds and tether anchor points appear to 
offer promise in helping to steady the astro- 
naut’s position for tasks involving force ap- 
plication such as the torquing task. Holmes 
(1965) investigated tether concepts in lunar 
gravity for maintenance tool work and con- 
cluded that a single tether located from the 


183 


navel area of the suit to the task was the 
best candidate. Further research is required. 
It has also been pointed out that the pres- 
surized suit prohibited performance in ex- 
treme positions. In addition to the tethers and 
handholds just cited, extravehicular work 
may require lunar ladders or toe holds on 
the side of structures where work is required 
above the astronaut’s shoulders. Also, work 
areas extremely close to the lunar surface 
should be avoided where possible. 
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MEMORY DYNAMICS AND WORK MOTIVATION* 
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Of 143 workers asked to report their pleasant and unpleasant experiences, only 
24 included work as 1 of their outstanding pleasant experiences. These 24 com- 
posed the work-motivated group. A group equated for age and job level was 
selected as a control group. The 2 groups were compared on measures of: 
(a) memory optimism, determined by predominance of pleasantness over un- 
pleasantness; (b) ratings of their adequacy as workers; (c) autonomy; (d) in- 
terpersonal competence. Significant differences were found in favor of the work- 
motivated group: they were more often memory optimists and superior in 
work adequacy, autonomy, and interpersonal competence. The relationship of 
these findings to the motivation-to-work controversy is considered. 


About work motivation and satisfaction al- 
most everything has been said, but not every- 
thing is known. Since Herzberg, Mausner, and 
Snyderman (1959) published their book, The 
Motivation to Work, in which they advocated 
a two-factor theory of job satisfaction, Herz- 
berg and his supporters have published many 
articles to prove this theory. However, other 
contributors studying the same problem are 
not at all convinced that Herzberg is right. 
Evidence negative to Herzberg’s theory has 
been reported in studies by Dunnette (1965), 
Ewen (1964), Friedlander (1963), and others. 
A review of the literature indicates that those 
studies which best support Herzberg’s theory 
are those which use the same or similar meth- 
ods. Quinn and Kahn (1967), in their review 
of organizational psychology, have this to 
say about the Herzberg controversy: 


Whereas Herzberg has rested a stimulating theory of 
job satisfaction on a rather shaky empirical founda- 
tion, the Cornell group (1963-65) have launched an 
intensive series of studies of the measurement of job 
satisfaction with only a minimum of theory to guide 
them [p. 456]. 


According to the Cornell researchers (Ewen, 
Smith, Hulin, & Locke, 1966), their conclu- 
sion is that neither the Herzberg theory nor 
the traditional theory are supported by the 
data they investigated. Instead, results indi- 
cate that intrinsic factors are more closely 


1 Based on a report presented at the Divison 14 
session on Organizational Behavior at the American 
Psychological Association, Washington, D. C., Sep- 
tember 9, 1967. 

2Now at Concordia Senior College, Fort Wayne, 
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related to overall satisfaction and overall dis- 
satisfaction than the extrinsic factor, pay, 
and suggest that functioning of the extrinsic 
variable may depend on the level of satisfac- 
tion with the intrinsic variables. In the litera- 
ture on work motivation by Herzberg and 
protagonists, as well as by his antagonists, the 
problem of personality differences between a 
motivated group and a nonmotivated group 
has not been studied. 

Vroom (1962) hypothesized and confirmed 
that if a person believes himself to possess an 
ability and believes that successful perform- 
ance of his task requires that ability, he will 
prefer performing the task effectively to per- 
forming it ineffectively. If he believes effective 
performance is irrelevant to his ability, he will 
be indifferent about performing. If effective 
performance is negatively relevant, he will 
prefer to perform ineffectively. 

Porter (1963a, 1963b), in a series of studies 
of managers, showed that their satisfactions 
were substantially related to such variables as 
level of job, line-staff type of position, and 
size of company. His studies are based on 
a modified form of Maslow’s hierarchy of 
needs. In that sense it can be considered a 
problem of motivation and satisfaction. How- 
ever, he, too, has neglected to study person- 
ality characteristics of work-motivated people. 

In the first study on the problem of memory 
dynamics of workers, Meltzer and Ludwig 
(1967) found that, though younger workers 
are significantly more often memory optimists 
than older workers, each age group signifi- 
cantly expressed pleasant memories more often 
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than unpleasant. These findings were inter- 
preted as suggesting that personality factors, 
rather than age factors as such, influence 
memory dynamics involved. In a study still 
unpublished, a relationship was found be- 
tween memory dynamics of workers and cer- 
tain personality variables. While working with 
the memories reported, it was apparent that 
only a few included work as one of their out- 
standing pleasant experiences of a lifetime. Of 
the 143 workers, only 24, or 16.78%, included 
work as one of their outstanding pleasant ex- 
periences. It dawned on the investigators that 
here, if ever, was a work-motivated group. 
Not only did work have intrinsic satisfaction 
to them, but work evidently was ego-involv- 
ing enough to be included in pleasant experi- 
ences of a lifetime. It would be interesting to 
compare workers of that kind with those to 
whom work did not have that kind of ego- 
involving meaning. Accordingly, it was ar- 
ranged to compare an equated group with 
them on four measures—memory optimism, 
work adequacy, autonomy, and interpersonal 
competence. 

The purpose of the present study was to 
fill, in some small measure, the gap in knowl- 
edge about the relationship of work satis- 
faction to personality characteristics by com- 
paring a work-motivated group with a non- 
work-motivated group in certain personality 
variables, including memory dynamics. The 
problems studied, expressed in the form of 
hypotheses, were: (1) More of the work- 
motivated group will be memory optimists; 
(2) the work-motivated group will be sig- 
nificantly higher in work adequacy than the 
non-work-motivated group; (3) the work- 
motivated group will be significantly superior 
in the two personality variables measured— 
autonomy and interpersonal competence. 


METHOD 
Subjects and Setting 


The Ss used in this study were 143 workers in a 
paper converting industry located in a stable college 
community in upstate New York. This paper mill is 
the only plant of any size in the community, and 
obtaining a job there is considered to be a sign of 
establishment to many people living in the com- 
munity. For the most part, Ss were semiskilled 
workers with large families and strong feelings about 
their jobs in the industry. The Ss ranged in age from 
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19 to 78 and represented about one-third of all the 
workers in the plant. The 143 were selected from a 
larger group because they were the only ones inter- 
viewed for data concerning pleasant and unpleasant 
experiences. All of the workers who mentioned their 
work experiences as a part of their pleasant memories 
were classified in the work-motivated group. A 
group matched to these 24 workers was selected from 
the remaining 119 workers by equating them for 
age and job level. 


Source Material 


In the interview as structured, the directed ques- 
tioning about work, family, personal story, and 
present conditions and problems was presented in 
open-end fashion. The more projective phases 
(Meltzer, 1950) of the interview, which in this study 
include the recall of outstanding pleasant and un- 
pleasant experiences of a lifetime, were given after 
the factual phase of the interview was completed. 


Procedure 


The interview was structured in the following 
sequence: 


1. Direct questioning about work, family, personal 
story, and present conditions or problems. 

2. Recall of the most outstanding pleasant experi- 
ence with the question, “And now tell me what comes 
to your mind when you think of the most outstand- 
ing pleasant experience of your life.” 

3. Recall of other outstanding pleasant experiences. 

4. Recall of the most outstanding unpleasant ex- 
perience. 

5. Recall of other outstanding unpleasant experi- 
ences. 


Memory optimism was determined by predominance 
of pleasantness over unpleasantness in the experi- 
ences given by the S$; memory pessimism was de- 
termined by the predominance of unpleasantness 
over pleasantness. In the original article, however, 
wishful and eternal optimists were differentiated from 
realistic optimists (Meltzer & Ludwig, 1967). 

Work competence was based on the rating by the 
interviewers at the end of the interview. The inter- 
viewers had available the ratings of the respective 
foremen of the Ss studied to consider as an aid in 
making their judgments. Scores for various ratings 
were as follows: 1 for D; 2, D+; 3, C; 4, C+; 
5, 1B @, Ware Wo Ave Bb Alas 

Autonomy was determined on the basis of a con- 
tent analysis of the personal and work histories given 
by the S, including projective interview material. 
The S was given a “+” in autonomy if the histories 
contained statements indicating adequate security 
and adjustment; a “—” indicated insecurity and mal- 
adjustment; and a “0” was assigned if no clear as- 
sessment could be made. A second judge made inde- 
pendent ratings of the same factors, with a correla- 
tion of .6 between the two. (A 3 X3 contingency 
table was used with the maximum correlation being 
82.) 
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Interpersonal competence was also determined by a 
content analysis of the same source material, namely 
the work and family histories, plus projective re- 
sponses to memories of pleasantness and unpleasant- 
ness. An S was given a “+” in interpersonal com- 
petence if the histories contained statements indi- 
cating that S got along well with others. If there 
were statements indicating a history of poor rela- 
tions with others a “—” was given. A “O” was given 
if no clear judgment could be made. The rating of 
the second judge correlated .64 with the first. (A 
3 X 3 contingency table was used with the maximum 
correlation being .82.) 


RESULTS 


Differences in Memory Optimism and Work 
Adequacy (Hypotheses 1 and 2) 


The two groups were compared on memory 
optimism and work adequacy, or performance. 
The means, standard deviations, and ¢ values 
for the two groups on memory optimism and 
work adequacy are reported in Table 1. 

These results support the first two hypoth- 
eses; namely, that the work-motivated group 
is composed of people who are more likely to 
be memory optimists, and that the work- 
motivated group rates significantly higher on 
work adequacy. 


Differences in Autonomy and Interpersonal 
Competence (Hypothesis 3) 


The Ss were then compared on the measures 
of personality variables—autonomy and inter- 
personal competence. A comparison was made 
in a 2 X 2 fashion with the chi-square values 
as a measure of significant difference. The 
relevant data for the comparison of the two 
groups are presented in Table 2. On the mea- 
sure of autonomy, the work-motivated group 
rated significantly higher than the matched 
group. On the measure of interpersonal com- 


TABLE 1 


DIFFERENCES BETWEEN WoORK-MOTIVATED GROUP AND 
ContROL Group on Memory Optimism 
AND Work ADEQUACY 











Factor Group M SD |tvalue 
Memory 
optimism Work-motivated ZVZ5 Lo 
3.47* 
Non-work-motivated S7on 1.20 
Adequacy Work-motivated 5.5025) 2532) 
2.89* 
Non-work-motivated 3.461 | 2.11 
*p <.01. 
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TABLE 2 


DIFFERENCES BETWEEN WoRK-MOTIVATED GROUP AND 
CONTROL Group ON AUTONOMY AND 
INTERPERSONAL COMPETENCE 

















Positive} Negative 
Factor Group factor factor 7 
Autonomy Work-motivated 11 
9.03** 
Non-work- 
motivated 5 16 
Interpersonal 
competence |Work-motivated 14 6 
4.04* 
Non-work- 
motivated 5 9 
* bp <.05 
Fd < .01 


petence, the work-motivated group was rated 
more favorably than the matched group. These 
two findings support the third hypothesis; 
namely, that the work-motivated group is 
significantly superior on the two personality 
variables—autonomy and interpersonal com- 
petence. 


DISCUSSION 


Significant characteristics which differenti- 
ate the work-motivated group from the con- 
trol group are revealed in this study. In every 
respect measured, the work-motivated group 
comes out more favorably or superior than 
the equated control group of fellow workers. 
The work-motivated group contains signifi- 
cantly more realistic memory optimists, who 
are rated more adequate as workers on the 
job, are more autonomous or secure and ad- 
justed, and are more competent in their in- 
terpersonal relations. The two groups are 
greatly different in memory dynamics and job 
adequacy as well as personality variables. In 
this plant, at least, people who are work 
motivated as evidenced by their ego involve- 
ment in the form of expressed feelings in their 
life memories are identified by better per- 
formance, better adjustment, and more op- 
timistic outlook on their past lives. The pro- 
cedure described here has been and can be 
used to advantage for management-evaluation 
purposes. For research purposes it would be 
worthwhile to compare the findings based on 
a similar study with the results obtained on 
the same workers with the use of the Critical 
Incident Approach of Herzberg. Such explora- 
tion opens up problems of significance, the 


Memory DyNAmIcS AND Work MOTIVATION 


study of which could help resolve some of the 
conflict about motivation to work found in 
the literature of today. 
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Incremental influence, a new concept for the study of organizational leadership, 
was empirically tested by relating 12 work-group performance measures with 
measures of supervisors’ social power. Referent power and expert power were 
conceptualized as incremental influence and considered qualitatively different 
from reward power, coercive power, and legitimate power. Referent power 
and expert power are considered to be idiosyncratic in character and depend 
upon an individual’s unique role behavior, while reward power, coercive power, 
and legitimate power are organizationally determined and designed to be 
equal for all supervisors on the same hierarchical level. As predicted, expert 
power was related to a low accident rate, referent power was somewhat 
related to measures of subordinate withdrawal, and both referent power and 
expert power were generally related to group measures of production per- 
formance. These results are regarded as supporting the concept of incremental 


influence. 


A recent book by Katz and Kahn (1966) 
has suggested a new concept for the study of 
organizational leadership. The concept, in- 
cremental influence, is stated as follows: 


When we think of leadership in contrast to routine 
role performance, however, we become particularly 
interested in those kinds of individual behavior 
which go beyond required performance and realize 
more fully the potential of a given position for 
organizational influence. In other words, we con- 
sider the essence of organizational leadership to be 
the influential increment over and above _ the 
mechanical compliance with routine directives of 
the organization [p. 302]. 


The present research is an empirical investiga- 
tion of the concept of incremental influence in 
an industrial organization. More specifically, 
this research will analyze some performance 
correlates of the first-line supervisors’ in- 
cremental influence. 

A five-fold typology of social power sug- 
gested by French and Raven (1959) was 
used as the framework for the author’s 
analysis of supervisory influence. Although 
other bases of power could be cited, French 
and Raven specify these five as the primary 
by which an agent, O, can exert influence 
over a person, P: (1) Reward power, based 
on P’s perception that O has the ability to 
mediate rewards for him. In order for O to 
use reward power to influence P, he must 
possess resources that P values, and P must 


believe that by conforming he will actually 
benefit by the resources in question. (2) 
Coercive power, based on P’s perception that 
O has the ability to mediate punishments for 
him. Analogous to the situation for reward 
power, in order for O to use coercive power 
to influence P, P must want to prevent some 
act of O and believe that through conforming 
he will do so. (3) Legitimate power, based on 
internalized values in P that O has a right 
to influence P and that P has an obligation to 
accept this influence. (4) Referent power, 
based on P’s identification with O. French 
and Raven define identification as “a feeling 
of oneness of P with O, or a desire for such 
identity.” If O is a person toward whom P 
is attracted, P will have a desire to be closely 
associated with O. This identification of P 
with O can be established or maintained if 
P behaves, believes, or perceives as O does. 
(5) Expert power, based on P’s perception 
that O has some special knowledge or expert- 
ness in a given area. French and Raven hold 
that the strength of expert power varies with 
the extent of knowledge which P attributes 
to O and that the range of expert power is 
restricted to those cognitive areas in which 
O is believed to have superior knowledge. 
Experience, training, reputation, demonstrated 
ability, etc. are among the many reasons 
why P attributes expertness to O. 
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This five-fold typology of the bases of 
supervisory power makes possible an im- 
portant conceptual distinction between the 
bases of power and leads to an operational- 
ization of the concept of incremental in- 
fluence. The areas in which reward power, 
coercive power, and legitimate power may 
be exercised are largely specified by the 
organization; on the other hand, areas in 
which the supervisor can exercise referent 
power and expert power are to a substantial 
degree uniquely determined by his own be- 
havior and his interactions with his sub- 
ordinates (expert power is personally de- 
termined, but the amount of information 
available may vary as a part of organiza- 
tional structure). The extent and range of a 
supervisor’s referent power and expert power 
cannot be specified by the organization. 
Expert power and referent power are idio- 
syncratic in character. On the other hand, 
reward power, coercive power, and _legiti- 
mate power are nomothetic in char- 
acter and result from the supervisor’s oc- 
cupancy of a position in the organization’s 
role system. The formal organizational struc- 
ture is designed to provide equal legitimate 
power to all supervisors on the same hier- 
archical level and to give them equal access 
to the use of organizational rewards and 
punishments. However, supervisors are not 
equal in their referent power or their ex- 
pert power; the supervisor’s referent power 
and his expert power constitute his particular 
utilization of his formal position. Thus the 
author concludes that referent power and 
expert power are conceptually and qualita- 
tively different from reward power, coercive 
power, and legitimate power. Influential acts 
based on referent power and expert power 
constitute an increment in organizational in- 
fluence beyond the influence inherent in the 
routine functioning of the organization’s role 
system. Consequently, referent power and 
expert power are the basis of incremental 
influence, and incremental influence has been 
operationalized as the additive combination 
of referent power and expert power. 

The present research relates incremental 
influence to work-group performance. Since 
recent research in a variety of organizational 
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settings casts doubt on the validity of job 
performance as a unidimensional construct, 
a multiple-criteria approach to work-group 
performance was used. Supervisory influence 
will be related separately to 12 performance 
measures. These measures were conceptual- 
ized by a framework suggested by March and 
Simon (1958) who state that organizational 
members face two different decisions—the 
decision to participate and the decision to 
produce. Consequently, the performance 
measures are considered as measures of with- 
drawal, or as measures of production; ac- 
cident rates are considered separately since 
this criteria does not fit into the withdrawal- 
production classification. An impressive body 
of research evidence (Kahn & Katz, 1960; 
Likert, 1961; Mann, 1965; Pfiffner, 1955) 
suggests that incremental influence (i.e., both 
referent power and expert power together) 
would relate to production measures. On the 
other hand, March and Simon (1958) hy- 
pothesized that withdrawal measures relate 
to satisfaction but not to performance, which 
in the author’s terms would suggest that 
withdrawal measures relate mainly to referent 
power. Two studies support this view. Fleish- 
man, Harris, and Burtt (1955) found a 
negative relationship between supervisory 
consideration and absenteeism; Fleishman 
and Harris (1962) found a negative relation- 
ship between supervisory consideration and 
turnover. Finally, in terms of the analytical 
model, the author considers that accidents 
are unintended consequences resulting from 
low supervisor “technical competence” 
(Mann, 1965, defines technical competence 
as the ability to use pertinent knowledge, 
methods, techniques, and equipment neces- 
sary for the performance of specific tasks 
and activities) and the inability of the super- 
visor to impart his competence to his sub- 
ordinates. Accordingly, one would expect 
that accidents relate mainly to supervisory 
expert power. Consequently, the operational 
hypotheses were that incremental influence 
(i.e., both referent power and expert power 
together) would relate to the production 
measures; referent power would relate to the 
withdrawal measures; expert power would 
relate to the accident rate. 


190 


Kurt R. STUDENT 


TABLE 1 


RELATIONSHIPS BETWEEN MEASURES OF SUPERVISORY INFLUENCE 











Measure 1 | 2 | 3 | 4 | 5 | 6 
1. Incremental influence 
2. Referent power coo nae 
3. Expert power oor 49** 
4, Reward power A4** aoa Ah 
5. Coercive power a me 09 A5** 
6. Legitimate power son” 30" 24 08 Ai** 








Note.—Group means, N = 40. 
* p <.05, two-tailed. 

** » < .01, two-tailed. 

eK << 001, two-tailed 
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Research Site and Sample 


This study employed data collected from a 
manufacturer of major home appliances. The data 
were collected for a larger study sponsored by the 
company and conducted by the Survey Research 
Center. The sample consisted of 486 hourly em- 
ployees and 39 first-line supervisors representing 40 
of the 52 work groups which comprised the com- 
pany’s two main plants, and the unit of analysis 
in this research was the work group. Measurements 
of attitudes, opinions, and perceptions of behavior 
were collected by paper-and-pencil questionnaires 
administered to nonsupervisory production em- 
ployees during March 1964. 


Questionnaire Measures of Supervisory In- 
fluence 


The measures of supervisory influence were as 
follows: 


Referent power. A single-item rating by sub- 
ordinates of the extent to which they comply with 
their supervisor’s directives because he is a “nice 
guy” and they don’t want to hurt him. 

Expert power. A single-item rating of the ex- 
tent to which subordinates comply with their 
supervisor’s directives because they respect his ex- 
perience and good judgment. 

Reward power. A single-item rating of the extent 
to which subordinates comply with their supervisor’s 
directives because he can give special help and 
benefits to those who cooperate with him. 

Coercive power. A single-item rating of the ex- 
tent to which subordinates comply with their 
supervisor’s directives because he can _ penalize 
or make things difficult for those who do not co- 
operate with him. 

Legitimate power. A single-item rating of the ex- 
tent to which subordinates comply with their super- 
visor’s directives because he has a right, consider- 
ing his position, to expect subordinates to do what 
he wants. 


Incremental influence. As noted earlier, this con- 
cept was operationalized by summing the group 
mean for referent power and the group mean for 
expert power. 


Table 1 presents the relationships between these 
measures of supervisory influence, while Table 2 
presents the means and standard deviations of in- 
cremental influence and the bases of social power. 


Measures from Company Records 


For purposes of the larger study from which 
the present research data are taken, a roster of 
work-group performance measures was developed 
so that a number of different measures of per- 
formance could be related to questionnaire data. 
Twelve of these work-group performance measures 
were used in the present research. These 12 measures 
were tabulated weekly by the company during a 
17-week period from immediately preceding and 
following the questionnaire administration. Six of 
the measures were direct measures of frequency or 
amount, and these measures were converted to a 
per-employee basis for each work group; the other 
six measures were based on ratings of each work 
group by the department head. The ratings were on 
4-point descriptive scales based on the firm’s ex- 
isting budgetary, allowance, or schedule standards. 
Actual group performance was compared against 


TABLE 2 


MEAN AND STANDARD DEVIATION OF INCREMENTAL 
INFLUENCE AND BASES OF POWER 











Measure M SD 
Incremental influence 6.12 2 
Referent power Did AT 
Expert power 3.39 9 
Reward power 2.78 AO 
Coercive power Dead 42 
Legitimate power 3.43 44 


Note.—Group means, N = 40. 


SUPERVISORY INFLUENCE AND WorkK-GrouP PERFORMANCE 


TABLE 3 


LISTING AND DESCRIPTION OF WoRK-GROUP 
PERFORMANCE MEASURES 











Rating measures of performance 





Indirect costs (+) 
Performance rating based on actual versus budgeted 
amounts of inspection, stockhandling, general labor, 
make-up allowances, and rework labor (4-point scale). 
Maintenance costs (+) 
Performance rating based on actual versus budgeted 
amounts for maintenance, considering both type and 
quality of maintenance orders (4-point scale). 


Supply costs (++) 
Performance rating based on actual versus budgeted 
allowance for supplies (4-point scale). 


Scrap costs (++) 


Performance rating based on actual versus budgeted 
amounts for scrap and rework (4-point scale). 


Performance against schedule (+) 
Performance rating based on the extent to which the 
work group: stayed on or ahead of schedule (Rat- 
ing = 4), required help to stay on schedule (Rating = 
3), was behind schedule (Rating = 2), or was behind 
to the extent of causing down-time (Rating = 1). 


Quality (+) 
Performance rating based on actual versus antici- 
pated numbers of rejects, taking into consideration 
the disposition, amount, and kind of reject (4-point 
scale). 


Average earnings (+) 
The average earnings per hour per man for each 
work group. 
Excused absences (—) 
The number of occurrences of excused absence, 
divided by the number of men in the work group. 
Unexcused absences (—) 
The number of occurrences of unexcused absence, 
divided by the number of men in the work group. 
Accidents (—) 
The number of reported injuries divided by the num- 
ber of men in the work group. 
Turnover (—) 
The number of quits, transfers, and formal bids for 
jobs in other work groups and not involving advance- 
ment, divided by the number of men in the work 
group. 
Suggestions submitted (++) 


The number of suggestions formally submitted, 
divided by the number of men in the work group. 


Note.—In each case, direction of scale is indicated by a plus 
or minus sign. ‘‘-+-’’ means that a high numerical rating repre- 
sents ‘‘good’’ performance. 
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these abstract standards. Table 3 lists the 12 work- 
group performance measures taken from the com- 
pany records and presents a description of each 
measure. 

Note that a high score on any 1 of the 12 work- 
group performance measures may indicate either 
excellent or poor performance depending upon the 
specific measure in question. 


RESULTS 


The first hypothesis predicted that expert 
power would be positively related to a low 
work-group accident rate. The results from 
an analysis of this prediction are presented in 
Table 4 indicating that the hypothesis is sup- 
ported. Expert power is the only one of the 
five bases of supervisors’ power to relate 
significantly to a low accident rate. 

The second hypothesis predicted that meas- 
ures of withdrawal—excused absences, un- 
excused absences, and turnover—would be 
related to referent power. Specifically, the 
author predicted that high referent power 
would be associated with low absences and 
low turnover. The results relating to this 
prediction are presented in Table 5. 

Table 5 indicates that this prediction is 
only partially supported. As expected, referent 
power is associated with low excused absences. 
However, it was found that unexcused ab- 
sences and turnover are not associated as 
expected with supervisory bases of power. 
Surprisingly, high referent power tends to be 
related to Aigh turnover. This latter finding 
is contrary to most previous research which 
would suggest that supervisory behavior 
which is characterized by high referent power 
should result in employees’ continuing in the 
system. In the present case, however, un- 
excused absences are not at all related either 
to referent power or to the other bases of 
supervisory power, while turnover is related 
in a direction opposite to that predicted. 

The third hypothesis predicted high in- 
cremental influence (both referent power and 
expert power) would be related to good 
performance on those measures which were 
considered to represent measures of produc- 
tion. The results from an analysis of this 
prediction are presented in Table 6. 

The predictions of this study are generally 
supported. For four production performance 
measures (indirect costs, supply costs, quality, 
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TABLE 4 


RELATIONSHIPS BETWEEN ACCIDENTS AND SUPERVISORY POWER 


Subordinate work-group’s mean perception of their supervisor’s 

















Incremental Referent Expert Reward Coercive Legitimate 
influence power power power power power 
Accidents — 24 — 12 — .28* — .03 | —.16 — .20 
Note.—Group means, N = 40. 
* p <.05, one-tailed. 
TABLE 5 


RELATIONSHIPS BETWEEN WITHDRAWAL PERFORMANCE MEASURES AND SUPERVISORY POWER 





Subordinate work-group’s mean perception of their supervisor’s 














Withdrawal 
performance 
measure Incremental Referent Expert Reward Coercive Legitimate 
influence power power power power power 
Excused absences Or =135* — 28" aS 16 —.12 
Unexcused absences .00 AO, 02 18 02 — .08 
Turnover a2 23 —.01 14 08 01 





Note.—Group means, N = 40. 
*» <.05, one-tailed. 


and suggestions submitted) the predicted 
zero-order relationships are significant. For 
two other production performance measures 
(scrap costs and maintenance costs) the 
predicted zero-order relationships tend toward 
significance. However, no relationships exist 
for two predicted relationships; incremental 
influence is not related to good performance 


on average earnings or performance against 
schedule. 


Discussion 


Although legitimate power ranked first 
among the reasons for compliance with super- 
visory directives, legitimate power is not 
related to differences in the performance of 


TABLE 6 


RELATIONSHIPS BETWEEN PRODUCTION PERFORMANCE MEASURES AND SUPERVISORY POWER 











Subordinate work-group’s mean perception of their supervisor’s 
p percep p 





Production performance 








Se ee Incremental] Referent Expert Reward Coercive | Legitimate 

influence power power power power power 
Indirect cost performance poe » 40** 10 IB) 22 .00 
Maintenance cost performance® ale .00 18 — 20 aos 10 
Supply cost performance pol al O28 soy ae 08 08 
Scrap cost performance 8 ron 13 .26 12 06 
Performance vs. schedule Emel .05 —,.21 — .06 04 —105 
Quality p30 Bo sgl 13 — .08 Al 
Average earnings 01 .00 01 — .40** aD 05 
Suggestions submitted es” 0m 14 .09 40** 10 














Note.—Group means, N = 40. 


8 Mean for 29 groups used as measure for mean of remaining 11 groups. 


* p <.05, one-tailed. 
¥t » <.01, one-tailed. 
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these subordinate work groups. The data 
suggest, therefore, that when the supervisor 
has a choice in the bases of power he em- 
phasizes, to achieve compliance with his di- 
rectives the supervisor should attempt to 
create and to use referent power or expert 
power rather than rely too strongly on his 
legitimate power. 

Reward power and coercive power, unlike 
legitimate power, do relate to a number of 
performance measures. However, the overall 
relationship between these two bases of power 
is two-sided. From the organization’s point 
of view the correlates of reward power and 
coercive power are problematic and have un- 
intended consequences for the organization. 
Organizationally speaking, reward power re- 
lates functionally to supply costs but dys- 
functionally to average earnings; coercive 
power relates functionally to suggestions sub- 
mitted but dysfunctionally to maintenance 
costs. Therefore the data suggest that the 
supervisor’s use of reward power and coercive 
power should be tempered where possible by 
the creation of referent power and expert 
power. Since there appear to be no dysfunc- 
tional or organizationally unintended cor- 
relates of incremental influence, the latter 
should be created and used where possible. 

As was noted above and as predicted, in- 
cremental influence has positive and organiza- 
tionally functional correlates. However, it is 
important to note that only 7 of the 12 per- 
formance measures relate to incremental in- 
fluence. Five measures, two withdrawal meas- 
ures (unexcused absences and turnover) and 
three production measures (maintenance 
costs, performance against schedule, and 
average earnings), showed no relationship to 
incremental influence. The reason for the 
difference in relationship between incremental 
influence and performance is not altogether 
clear. Perhaps the five performance measures 
which do not relate to performance represent 
areas or factors over which the first-line 
supervisor has little or no control. Unexcused 
absences and turnover might be related to 
extraorganizational considerations or to or- 
ganizational factors which are unrelated to 
compliance with the first-line supervisor. The 
three production measures which do not re- 
late to incremental influence may be said 
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to represent company-wide or system factors 
rather than to be related to factors at the 
first-line supervisory level of the company. 
Maintenance costs, performance against 
schedule, and average earnings may be related 
to company-wide policies regarding mainte- 
nance programs, production schedules, and 
pay rates, respectively. 

Finally, it should be noted that the correla- 
tions for referent power and expert power are 
not generally additive, and referent power 
is a better predictor of performance than is 
expert power. Referent power is significantly 
related to five performance measures (excused 
absences, indirect costs, quality, suggestions 
submitted, and scrap costs), while expert 
power relates significantly to four measures 
(accident rate, excused absences, supply 
costs, and quality). Only in the case of 
quality is the correlation for incremental 
influence greater than the correlations for 
referent power and expert power, which sug- 
gests that in this case the bases of power 
are additive. 

The present research has shown the con- 
cept of incremental influence to be a useful 
and viable one. The qualitative distinction 
between referent power and expert power 
on the one hand and reward power, coercive 
power, and legitimate power on the other 
was generally supported by the data. The 
performance correlates of referent power and 
expert power are surprisingly large in view of 
the fact that the research did not include 
extraorganizational factors and included only 
one level of management. Moreover, the first- 
line supervisor in the present research site is 
a supervisor who may be said to have a 
relatively small area of freedom; his role 
range is limited by industrial engineers, 
quality-control specialists, time and motion 
study men, and others, as well as by the 
standardized nature of high-volume, single- 
product production. In view of the limitations 
inherent in the role which was studied, the 
findings are encouraging and further research 
and validation of these findings are indicated. 
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INFLUENCE OF TRAINING, METHOD, AND RELATIONSHIP 
ON THE HALO EFFECT 


EVA METZGER BROWN 1 


Teachers College, Columbia University 


120 student nurses each rated 6 peers on 10-point scales for 6 sets of traits. 
The variance of a judge’s ratings for each object was found, and the variance 
scores of the 6 objects for each judge summed. This dispersion score was used 
as the measure of the halo effect. It was found that trained judges exhibited 
less halo than untrained judges, but that ratings made by rating 1 individual 
on 6 traits at a time yielded substantially the same amount of halo as did 
ratings made by rating all individuals on 1 trait at a time. The relationship 
between the judge and the object was a significant variable for the untrained 
judges. Here the ratings for the less intense relationship exhibited significantly 
more halo than the ratings for the intense relationship. Training of the judges 
improved all ratings to the point where the judge-object relationship did not 


play a significant role. 


The halo effect poses a problem for the 
investigator because of its pervasive yet 
elusive presence in the rating situation. It 
occurs when “a particular rater tends to rate 
a particular ratee similarly on all traits” 
(Guilford, 1959, p. 146), for any reason. The 
halo effect was noted briefly by Wells in 
1907 and labeled by Thorndike in 1920. 

Statistically, the halo effect has been de- 
fined in three ways: (a) as an inflated inter- 
correlation among traits (Gilinsky, 1947; 
Taylor & Hastman, 1956), (0) as a general 
bias factor arrived at through matrix and 
factor analyses (Grant, 1952; Meyers, 1965), 
and (c) as the judge-object (J—O) inter- 
action error evaluated according to Guilford’s 
(1954) analysis-of-variance model. The first 
definition, although responsible for the orig- 
inal concept of the halo effect, is now viewed 
as lacking in sophistication. The findings of 
factor-analytic studies, while of value in 
identifying halo, are limited to a specific ex- 
perimental situation. The third definition is 
the most sophisticated in that it differentiates 
between objective (valid) and relative (in- 
valid) halo, and the more useful in that it is 
applicable to all studies where a number of 


1 The research reported in this paper is based on 
a doctoral dissertation submitted to Teachers Col- 
lege, Columbia University. Special thanks are given 
to Laurance F. Shaffer, chairman of the dissertation 
committee, Elizabeth P. Hagen and Rosalea A. 
Schonbar, members of the committee, and to 
Rosedith Sitgreaves, consultant to the project. 


Js are rating common Os on a common scale. 
Although the data of the present study were 
not based on ratings of common Os, the 
conception of Guilford (1959) was used to 
define halo as the J—O interaction error and 
to apply the analysis-of-variance technique. 
In the study, the halo effect was defined as 
inversely proportional to the variance score 
for ratings given an O by a J over six sets 
of traits. 

The main purpose of this study was to 
identify the conditions which minimize the 
operation of the halo effect. The study ex- 
amined the influences of the training of the 
Js, of the method of rating, and of the 
quality of the J—O relationship. Previous re- 
search has indicated that Js benefit from 
training when it includes: (@) practice with 
the specific rating scale (Wakeley, 1961), 
(6) discussion of errors in rating by the Js 
(Levine & Butler, 1952), and (c) special 
emphasis on the importance of trait differ- 
entiation (Taylor & Hastman, 1956). As for 
the method of rating, Symonds (1925) sug- 
gested that more halo results when a J rates 
one individual at a time on all traits (Method 
1), than when a J rates all individuals on one 
trait at a time (Method 2). Method 2 has 
been widely accepted as superior to Method 
1, but recent studies (Blumberg, DeSoto, & 
Kuethe, 1966; Johnson, 1963; Taylor & 
Hastman, 1956) have reported substantially 
the same degree of halo effect for both 
methods. Characteristics of the J—O relation- 
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ship which influence the J’s ratings of the O 
have not been studied adequately. Greater 
attention has been focused on the level of 
acquaintance between the J and the O (Bare, 
1954; Koltuv, 1962), and the data support 
the hypothesis that increased acquaintance is 
related to decreased bias in judgment. 

The broad hypothesis of the study is that 
the operation of the halo effect is influenced 
by the training of the Js, by the method of 
rating used by the Js, and by the intensity 
of the relationship between the J and the O. 


MeEtHOopD 
Subjects 


One hundred and twenty student nurses in at- 
tendance at St. Luke’s Hospital School of Nursing, 
New York, served as subjects. They were divided 
into four groups: Al, A2, B1, and B2. Groups A 
were trained, whereas Groups B were not trained; 
Groups 1 made ratings according to Method 1, 
whereas Groups 2 made ratings according to 
Method 2. 


Instruments 


The instruments were designed and adapted to 
serve the needs of the study, and written instruc- 
tions preceded each questionnaire. 

The Information Questionnaire was used to 
gather some background information on each J. 
The Form for the Selection of Objects required 
each J to select three Os toward whom she felt 
as favorable as possible (intense relationship) and 
three Os toward whom she felt indifferent (less 
intense relationship). All Os were fellow students. 

The Acquaintance Checklist assessed how well 
the J felt she knew the O, while the Social Dis- 
tance Scale measured how well the J liked the O. 

The Rating Scale consisted of six bipolar traits 
(factors) on 10-point scales. The factors, selected 
from the Guilford-Zimmerman Temperament Survey 
(Guilford & Zimmerman, 1949), included lethargic— 
active, impulsive-self-controlled, withdrawn-sociable, 


emotionally unstable-emotionally stable, mannish— 
feminine, and hostile—friendly. 
The Postexperimental Meaction Questionnaire 


tapped the J’s knowledge of the aims of the study 
and her previous experience in rating others. 


Procedure 


The data were collected in four 1-hr. periods 
spaced 1 wk. apart. No more than 30 Js were seen 
per session and no more than 15 Js were trained 
at a time. 

In Session I a broad outline of the procedure 
relevant to the trained and untrained groups was 
given, and the confidentiality of ratings was stressed. 
The Js completed the Information Questionnaire and 
selected their Os. 
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In Session IJ, Group A _ was trained, while 
Group B did not meet. The experimenter, acting 
as an instructor, told Group A about different kinds 
of rating scales and rating procedures, problems in 
obtaining sound ratings, and constant errors. The 
Js were given some practice on the Rating Scale 
and their ratings were discussed. 

In Session III, the Js completed the Acquaintance 
Checklist, the Social Distance Scale, and the Rating 
Scale. In Session IV, these three scales were re- 
administered and their test-retest reliabilities found 
to be .95, .96, and .85, respectively. Finally, the 
Postexperimental Questionnaire was completed. 


RESULTS 
Characteristics of the J-O Relationship 


Student nurses were selected as subjects 
because there was a basic level of familiarity 
among them. In the sample used, however, 
91% of the “favorable” Os were known “ex- 
tremely well” or “very well” by the Js, 
whereas 62% of the “indifferent” Os were 
known only “a little” or “hardly.” The cor- 
relation between the Acquaintance Checklist 
and Social Distance Scale was .79. It was 
concluded that the Os who had an intense 
relationship with the J were well known to 
the J, and the Os who had a less intense 
relationship with the J were less well known 
to the J. 


Influence of Training and Method on the 
Halo Effect 


The means and standard deviations for 
the variance scores of the experimental groups 
appear in Table 1. Each mean value was 
based on 30 scores, except for the combined 
“Groups” and “Methods” where the mean 
value was based on 60 scores. Each score was 
the sum of one J’s variance scores for three 
favorable or three indifferent Os, or the sum 
of one J’s variance scores for six Os when 
the relationships were combined. The higher 
the score the smaller was the halo effect, and 
the lower the score the greater was the halo 
effect. 

The influence of training and method on 
the halo effect was evaluated by a two-way 
analysis of variance (Table 2). The signifi- 
cance of the variable of training at the .01 
level led to the conclusion that a brief but 
specific training period leads to more analytic 
and discriminating judgments. 

The variable of method was nonsignificant, 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF RATING-SCALE 
VARIANCE SCORES ACCORDING TO TRAINING, 
METHOD, AND RELATIONSHIP 











. Method| Method] Both 
Experimental groups 1 2 me tere 
Trained 
Intense relationship 
xX O95 Blan 9299 Oa 
s 6.12 eo 6.64 
Less intense relationship 
xX 13,08): | 78:33. | 10871 
s 7.30 3.44 6.1 
Both relationships 
X 23-030) 7eSOn eo 0:44 
s 11.24 9.58 | 10.68 
Untrained 
Intense relationship 
x Fel Ox) M923, |p B19 
S 4.50 4.75 4.71 
Less intense relationship 
Xx TOL 636 |) 6:70 
Ss 4.56 ors 4.92 
Both relationships 
xX 14.20 | 15.59 | 14.89 
s Tesh, 8.17 7.91 
Both groups 
x 18.61 LOS 1767 
Ss 1Oz5i7 8.90 9.77 














indicating that ratings made according to 
Method 1 and Method 2 did not differ sub- 
stantially from one another. A significant 
interaction between training and method at 
the .05 level was found to exist. A further 
evaluation of differences using Scheffé’s 
method indicated that the trained and un- 
trained groups differed significantly only 
when they made ratings according to Method 
1. Hence, the significant interaction reflected 
the effect of training on Method 1. 


TABLE 2 


Two-Way ANALYSIS OF VARIANCE: THE INFLUENCE OF 
‘TRAINING AND METHOD ON THE HALO EFFECT 














eee wl ay SS MS F 
variation . 
Training if 935.36 | 935.36 10.86 
Method it 103.50 | 103.50 1.20 
Interaction it SLs |) SAhlsw 3.968 
Within groups} 116 | 9985.31 86.08 
Total 119 | 11365.49 
a F95 = 3.92. 
b F 99 = 6.85. 
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Influence of the J-O Relationship on the 
Halo Effect 


The influence of the J—O relationship on 
the halo effect was tested by chi-square. The 
basic score was the ratio of the sum of the 
variance scores for the three high-intensity 
Os, over the sum of the variance scores for 
the three low-intensity Os. Each J was rep- 
resented by one value or ratio score. The null 
hypothesis of the chi-square test was that, 
with an F distribution of 15 and 15 degrees 
of freedom, the 30 values (30 per group) 
are randomly observed. The degrees of free- 
dom were based on the number of Os per 
relationship (3) multiplied by one less than 
the number of traits (5). The probability 
tables for such an F distribution are pre- 
sented by Guenther (1965, p. 298). As the 
smaller the variance score the greater the 
evidence for the halo effect, it followed that 
when the ratio value was less than one, the 
ratings of the favorable Os exhibited greater 
halo than those ratings of the indifferent Os. 
When the ratio value exceeded one, the 
ratings of the indifferent Os exhibited greater 
halo than those ratings of the favorable Os. 

The chi-square test for each of the four 
experimental groups indicated that there was 
no significant difference between Method 1 
and Method 2 for both the trained and un- 
trained groups, so that the data were com- 
bined (Table 3). The null hypothesis of the 
chi-square was accepted for the trained group 
and rejected for the untrained group, at the 
.05 level of significance. A difference between 


TABLE 3 


CHI-SQUARE: THE INFLUENCE OF INTENSITY OF RELA- 
TIONSHIP ON THE HALO EFFECT WITH 
TRAINING HELD CONSTANT 














Observed frequency 
F distribu- | Expected 
Quarters) tion of ratio fre- 
scores quency | Trained Je 
7 trained 
1 .000— .701 15 Dat 13 
eZ, .702-1.000 15 11 11 
3 1.001-1.430 15 14 10 
4 Over 1.430 15 14 26 
Chi-square 3.6 sls 











aX295 = 7.8. 
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the intense and less intense relationship was 
thus found only for the untrained group. 
An examination of the mean variance scores 
(see Table 1) indicated that for the un- 
trained group the ratings of the less intense 
relationship exhibited a greater amount of 
halo, while for the trained group the ratings 
became differentiated to the point where the 
relationship between the J and the O did not 
play a significant role. 


DISCUSSION 


The conception of the measure of halo as 
a variance score was a new one. The variance 
score was a direct statistical translation of 
the definition of halo which stated that “a 
particular rater tends to rate a particular 
ratee similarly on all traits” (Guilford, 1959, 
p. 146). With the variability of ratings 
built into the basic score, individual differ- 
ences in the relative amount of halo among 
Js could be identified. In addition, group dif- 
ferences could be found through the analysis- 
of-variance technique so that conditions 
which functioned to increase or decrease the 
amount of halo in ratings could be identified. 
This procedure is practical and can be applied 
to any setting where Js are rating Os on 
graphic rating scales. However, a shortcoming 
of the procedure is that the statistical signifi- 
cance of the objective and relative components 
of the halo effect cannot be determined in- 
dependently as in Guilford’s (1954) model. 
One has to assume that a large difference in 
variance scores indicates significant variation 
in the relative component of the halo effect. 
Such a difference was found between the 
trained and untrained groups. 

Training small groups of inexperienced Js 
was very effective in reducing the amount 
of halo even when such training was brief 
(1 hr.). The fact that the subjects were ac- 
customed to the role of the student and ac- 
cepted the experimenter as an _ instructor 
probably had a positive influence on the 
findings. 

The training process was affected by two 
uncontrolled features, the first of which led 
to a difference in the observed motivational 
level between the trained and untrained 
groups. It is generally assumed that Js who 
are more involved and more interested in 
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the task will make more careful and more 
accurate ratings (Conrad, 1932). In the 
study, the trained group had more contact 
with the experimenter and more general in- 
formation about the study. It was observed 
that the trained Js wrote more on the Re- 
action Questionnaire and exhibited greater 
interest in the results of the study than the 
untrained group. Therefore, it is proposed 
that the trained Js had more invested in the 
ratings they were making than the untrained 
Js and hence took more care in making 
discriminating ratings. Further exploration of 
other effects of the training period, apart 
from learning, is required. For example, to 
what degree is the amount of halo in ratings 
affected by increased experimenter-J contact 
or by a desire to please the experimenter? 

The second uncontrolled feature of the 
training process lay in the content of the 
training session and is proposed as an ex- 
planation for the significant interaction found 
between training and Method 1. During 
the training session, practice in rating and 
discussion of ratings were based solely on 
Method 1. It follows that Js who rated 
according to the same method by which they 
were trained could apply what they had 
learned more directly than could the Js who 
rated according to Method 2 and were trained 
according to Method 1. This shortcoming in 
the study points out the importance of having 
the training session equally relevant for all 
the variables in the experiment. 

As for the method of rating, a sizable 
sample of student nurses rated Os with 
whom they were familiar similarly whether 
they used Method 1 or Method 2. Added to 
the accumulated body of research (Blumberg 
et al., 1966; Johnson, 1963; Taylor & Hast- 
man, 1956), the findings affirm the irrelevance 
of this methodological variation for the halo 
effect. Therefore the often suggested recom- 
mendation that ratings should always be 
made by Method 2 (Guilford, 1954, p. 279) 
seems poorly founded in the case of graphic 
rating scales. 

The significance of the J—O relationship 
for the halo effect varied with the training 
or lack of training of the Js. For the untrained 
group, the ratings of indifferent Os, the less 
intense relationship, exhibited greater halo 


INFLUENCES ON THE HAto Errect 


than the ratings of favorable Os, the intense 
relationship. As the indifferent Os were less 
well known than the favorable Os, the find- 
ings of Bare (1954) and Koltuv (1962), 
that increased acquaintance is related to 
decreased bias in judgment, seem to be 
supported. However, in the present study 
none of the Os was as “unfamiliar” to the 
Js as was the case in Koltuv’s study, nor as 
briefly known as was the case in Bare’s study. 
The student nurses had been going to school 
and living together for at least a year and a 
half, so that it could be assumed that the 
Js had some basis on which to make differ- 
entiated ratings for the indifferent as well as 
the favorable Os. A possible explanation of 
the findings was offered by the Js, themselves. 
On the Reaction Questionnaire, they reported 
difficulty in selecting Os for the less intense 
relationship. Apparently, there was greater 
resistance to and anxiety in selecting in- 
different Os than favorable Os, for both the 
trained and untrained group. As the J—O 
relationship did not play a significant role 
in the ratings of the trained group, it is 
proposed that the training period not only 
functioned as a learning experience, but also 
tended to reduce the anxiety of the Js and 
hence the restrictiveness of their ratings. The 
latter function would be particularly pertinent 
for ratings of Os toward whom the Js felt 
“less favorable’ and possibly ambivalent. 
Certainly, the variable of the J—O relation- 
ship needs to be explored further. How is the 
amount of halo affected when the level of 
acquaintance and the social distance are 
varied with each other? Does the amount of 
halo increase with increased social distance 
or is there a U-shaped distribution where the 
amount of halo is greatest for ratings of Os 
toward whom the J is ambivalent? 

The questions posed by this study reveal 
the need for a further understanding of the 
halo effect and the conditions which affect it. 
Examined as unitary factors, the variables 
of training and relationship were found to 
have a significant influence on the amount of 
halo exhibited, whereas the variable of 
method was nonsignificant. It follows that the 
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conditions of training and J—O relationship 
need to be analyzed into their significant 
components, and the contribution of these 
characteristics to the halo error assessed. 
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DRIVER GAP ACCEPTANCE AT INTERSECTIONS 
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To determine the most probably correct theory of the several currently accepted 
theories of driver gap acceptance, the gap limen was investigated by the 
method of constant stimuli. In a field-test situation, drivers were exposed to 
an approaching vehicle for a 1-sec. interval at various times during its 
approach, Approach speeds of 16, 32, 48, and 64 mph were used. Analysis 
of the data indicates drivers accept a gap equal to 5.4 times the approach- 
vehicle velocity (expressed in ft/sec). The obtained function most nearly 
supports the time hypothesis of driver gap acceptance. 


Driver decisions at street intersections re- 
quire a complex series of judgments. The 
driver is required to detect any approaching 
vehicle, determine range and speed of ap- 
proach, then decide whether to cross, or 
merge with, the traffic stream. Although 
drivers accomplish the task with great fre- 
quency and a high degree of safety, com- 
paratively little is known about the actual 
process or processes utilized by the driver. 
Early experimenters (Hs) made gross mea- 
surements of vehicle movements, and devel- 
oped theories of gap acceptance based on 
these measurements, but comparatively little 
is known about the actual decision processes 
of the driver at the intersection. 

Development of several different theoretical 
approaches to the driver’s acceptance of gaps 
at intersections has followed the initial study 
by Greenshields (1947). Early Es measured 
and cataloged the gap acceptance of drivers 
in free traffic situations to develop the early 
expressions of the model of driver gap accept- 
ance. Following are the major hypotheses 
relating gap acceptance to various aspects of 
cue structure or cognitive organization of such 
cues. 

1. Time hypothesis: The Greenshields 
model considered gap acceptance judgments 
as if the driver made a precise estimate of 
arrival time of the approaching vehicle. Thus 
a driver would make the judgment that there 
was some time period before the oncoming 
car would arrive at the intersection and thus 
he could, or could not, cross, or merge with 
the mainstream traffic. 

2. Modified-time hypothesis: Evolving from 
the time hypothesis, a more complex expres- 


sion for the time-gap acceptance decision 
process was developed by R. L. Moore 
(1956). Working with pedestrians, he re- 
quired them to make an estimate of the last 
possible moment they would attempt to cross 
a roadway. (They did not actually cross.) 
The function he developed was of the order 
D=45+2.9V, where D = distance in feet 
from the pedestrian to the approaching 
vehicle, and V=approach speed of the 
vehicle in feet per second. His experiment 
covered the range of approaching vehicle 
speeds of 15-60 mph with 100 observations 
at each data point. One particular item of 
interest is that as speed of the approaching 
vehicle (V) increased, there were more po- 
tentially hazardous errors in that insufficient 
time was allowed to cross the street. In fact, 
at 60 mph there were 35 judgments where 
insufficient time was allowed for crossing 
of the 100 total judgments made at that 
speed. 

3. Distance hypothesis: A third model, that 
of constant distance, has been postulated. 
This theoretical approach says that the driver 
will accept a gap of a constant distance re- 
gardless of the speed of the approaching 
vehicle. 

Brain (1962) in a largely theoretical treat- 
ment of gap acceptance considered several 
additional models: 

4. Rate of change of angular position: The 
apparent movement of the approaching vehicle 
in relation to the coordinates of the head of 
the driver in space. Brain showed that these 
cues are of: limited value because the rate 
of change is approximately zero when the two 
cars are approaching each other at speeds 
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which would produce a collision, and also 
when the driver’s car is halted at the stop line. 

5. Rate of change of horizontal dimension 
due to apparent rotation: As the vehicle ap- 
proaches, the changing appearance of the car 
as it changes from head-on to more and more 
side view may provide the cues to cross or 
not to cross. Brain contends that the values 
derived from the condition of two cars 
on a collision course with the driver’s vehicle 
halted at the stop line will provide insufficient 
cues. This would, in addition, require the 
postulation of a mechanism to develop the 
length/breadth ratio required and then in- 
tegrate the length/breadth ratio over time to 
develop the decision. Brain considers that the 
complexity of such mechanism makes this 
model highly improbable. 

6. Rate of change of dimension due to 
distance: Brain worked with height of the 
vehicle to derive a theory of angular velocity 
similar to that of Michaels and Weingarten 
(1965). Although Brain’s (1962) theoretical 
derivation indicates this model is most de- 
scriptive of the process the driver follows, 
after experimental work he states, “It would 
appear . . . that one relies on visual image 
size based on past experience rather than 
calculus for estimating minimum time inter- 
vals [p. 791].” 

7. Experience: The method, according to 
Brain, which is apparently used by drivers. 
The crossing decision is based on known size 
of the visual image and the expected perform- 
ance of the approaching vehicle, and not on 
subtle cues derived from the change or rate 
of change of the image of the approaching 
vehicle. 

For the most part, all preceding studies 
(which used Ss) have used free observation 
wherein Ss could see the approaching vehicle 
until he made his decision. The decision of 
S was thus a summation of information over 
an extended period of time. 

In the present study, an information 
acquisition time (IAT) of 1.0 sec. was 
selected to permit a driver sufficient time to 
make a crossing decision. Use of an JAT 
of 1.0 sec. was considered to have two ad- 
vantages: (1) it appeared to be close to the 
actual decision times utilized by drivers, and 
(2) it permitted a precise determination of 
the threshold of crossing decisions due to the 
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experimental control over the cues used to 
make the gap acceptance decision. 

This experiment was for the purpose of 
determination of the threshold of a driver’s 
gap acceptance by the use of the psycho- 
physical method of constant stimuli in order 
to permit a comparison of the different 
models of crossing behavior which have been 
proposed by several different Es. 


METHOD 
Subjects 


Four male Bureau of Public Roads employees 
who had not had prior contact with the experiment 
acted as Ss. Vision was tested and found to be 
within normal tolerances for all Ss. Age ranged 
from 31 to 62 yr. with an average of 51 yr. Driving 
experience was commensurate with age. 


Apparatus 


Two automobiles were used. One was parked 
at an “intersection” while the other was the stimulus 
vehicle (SV). 

The S was seated in the parked vehicle with his 
vision restricted by an electrically operated shutter 
mounted in the driver’s side window of the auto- 
mobile. A chin rest associated with the shutter 
assured precise positioning of S’s head. A system of 
pneumatic tubes and electronic timers controlled 
the shutter opening time and duration of 1.0 sec. 
Shutter opening time was controlled for each run 
by the preset times which were manually inserted 
into the timer before each run. Total time for each 
run was also recorded in order to measure ac- 
curacy of SV approach velocity. 


Procedure 


The testing was conducted on an airport runway 
200 ft. wide and 3000 ft. long. 

With one vehicle “stopped” at a simulated inter- 
section with a through street, the SV approached 
from the left and passed in front of the S in the 
stopped vehicle. The centerline of the SV path was 
approximately 20 ft. from S and perpendicular to 
the centerline of the stopped vehicle. 

Four approach speeds were used for the SV: 16, 
32, 48, and 64 mph. Preset times were so deter- 
mined that the approach times, the time between 
the closing of the shutter and the SV arriving at a 
point directly in front of S$, were in the same 
increments for all speeds. Seven approach times 
were used ranging from 1.8 to 9.0 sec. in 1.2-sec. 
steps. 

With 4 speeds and 7 approach times there were 
thus 28 conditions (cells). Although the method of 
constant stimuli was used, a maximum of 10 trials 
was presented for any cell due to time and cost of 
each trial. To further reduce the number of trials 
the method of sequential analysis was used. Necessary 
data for the sequential analysis computations were 
obtained from pilot data. Using cumulative totals 
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for each cell, further trials were discontinued when 
the number of “yes” or “no” responses exceeded 
the limits of the sequential analysis table. 

The task of the S was to observe the approaching 
vehicle and make judgments. The following in- 
structions were read to S: 


We are conducting a controlled study of people’s 
ability to determine whether they have appropri- 
ate gaps in traffic so they can cross through 
intersections. Through our simulated intersection 
you are asked to observe the vehicle that is ap- 
proaching just as though you were sitting at a 
stop sign at an intersection. At a certain period 
of time during the approach of the vehicle a 
shutter in this viewing device will open. You 
are to judge during this period whether or not 
you would accept the gap and cross through the 
intersection. Base your decision on the position 
of the car just as the shutter closes. Try to 
make judgments as you normally do when driv- 
ing. Prior to the time when you are requested 
to make a judgment I will say “ready.” At that 
time you should put your chin in the chin rest. 
Shortly afterward you will see the shutter open. 
After the shutter closes you are to indicate by 
saying “yes” if you would have crossed through 
the intersection, or “no” if you would not. 
Between each of these trials you may take your 
chin away from the chin rest and relax. 


Intertrial interval was about 1.5 min. Each S 
received about 100 trials per day with rest periods 
after each 25 trials. During testing, Ss read books 
or magazines between each trial. 


RESULTS 


In order to determine whether there was 
a difference between Ss, a comparison was 
made of the number of acceptance judg- 
ments at each of the four speeds by use of 
the chi-square statistic. The results indicate 
the performance of Ss was different: at 16 
and 32 mph the difference was significant 
at the p < .02, while at 48 and 64 mph the 
difference was p< .01. Due to the signifi- 
cant differences obtained and the small num- 
ber of Ss, extreme caution should be used in 
interpretation of further analysis. Whenever 
possible, each S will be considered sepa- 
rately. 

Determination of the limen and standard 
deviation of gap acceptance was performed 
for each S by the normal graphic process 
(see Table 1). The limen in the present 
experiment is the point at which S would 
theoretically accept a given gap on 50% of 
the trials. The obtained limen was the same 
as the approach time each man would accept 
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TABLE 1 


Time LIMEN AS A FUNCTION OF APPROACH-— 
VEHICLE VELOCITY 








Velocity of approach vehicle* 











Subject 
16 oo, 48 64 

Si 

Limen 3.05 2.97 4.0 on 

SD noe oD Rif 54 
Se 

Limen 7.15 53d 6.30 5.28 

SD 63 EoD. 83 eo 
S3 

Limen 7.67 5.33 4.89 4.18 

SD 87 .33 rAS a0 
S4 

Limen 4.85 6.75 6.69 8.05 

SD 98 1.67 BS 91 
Mean limen 5.68 5.10 5.47 5:25 
SD .70 .66 70 52 











8 In miles per hour. 
bIn seconds. 


50% of the time. Although there was marked 
individual variation in threshold, individual 
slopes were similar, and the mean approach 
time accepted was remarkably constant over 
the speed range. 

Using the time gap obtained for each 
man, the gap distances which he accepted 
were calculated. These data, presented in 
Table 2, were corrected to reflect the precise 
velocity of the stimulus vehicle. 

To permit comparison of the four cur- 
rently accepted theories of gap acceptance, 
a plot was made of the curves derived from 
previously derived empirical and theoretical 


TABLE 2 


DISTANCE LIMEN AS A FUNCTION OF APPROACH- 
VEHICLE VELOCITY 








Velocity of approach vehicle 


Subject 
16 32 48 64 

Sy FSA?) | 141855 289.64 336.56 

So 171.38 254.03 456.18 507.72 

S3 183.85 254.03 354.08 401.95 

S4 116.25 321.71 484.42 774.09 
M 136.15 243.07 396.08 504,84 
SE 25.63 SEG0 45.19 96.36 

8 In miles per hour. 

> In feet. 
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Fic. 1. Gap accepted as a function of approach-vehicle velocity. 


values obtained from these theories (Fig- 
ure 1). 

The plot of the gap accepted versus the 
velocity of the approach vehicle permits 
graphic comparison of expected results 
which would be predicted by each theory. 
Data from the present experiment (Table 2) 
are included with the range of obtained 
values indicated by vertical lines and the 
crossbar indicating the standard error. These 
data produce a function which fit the equa- 
tion G = 5.4 V, where V = vehicle velocity 
in feet per second. 


DIscUSSION 


The approximately 5-sec. median gap size 
accepted by drivers in the present study 
agrees with Raff (1950) and Bissell (1960), 
and is only slightly shorter than that found 
by Robinson (1951) and Wagner (1966). 
In general, lower speed ranges were utilized 
in previous studies, but the congruence of 
the data, within similar speed ranges, in- 
dicates that the experimental controls im- 
posed in the present experiment did not 


bias the results. The congruence of the data 
with most median accepted gaps in the 5-6 
sec. range provides a strong indication that 
the time hypothesis is the most probable 
basis for driver gap acceptance decision. 

Graphical analysis of the data also tends 
to support the time hypothesis as the basis 
for driver gap acceptance. Analysis of the 
data indicates the time hypothesis most 
closely approximates the correct description 
of the actual gap acceptance process. The 
results obtained in the present experiment 
strongly disagree with the modified-time 
hypothesis. Although at the two lower speeds 
it would appear that any of the hypotheses 
could be supported, the overall trend of the 
obtained function refutes these three hy- 
potheses. 

The constant necessary to approximate the 
fit between the time hypothesis and the ob- 
tained data requires some discussion. The 
marked change in gap distance required 
by Ss is considered to be a function of (a) 
perceptual processes and (0) cognitive fac- 
tors. Perceptual factors include such factors 
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as: image size, vehicular image oscillations, 
and velocity estimation. The cognitive fac- 
tors include: S set, and risk-taking char- 
acteristics. 

Image size is considered to give very 
important cues to distance judgments in a 
static environment. In the present experi- 
ment the short (1.0 sec.) information 
acquisition time (IAT) emphasized any 
tendency of the Ss to make judgments of 
the approaching vehicle as if they were 
making judgments of a stationary object. 
A large proportion of the gap acceptance 
judgment is considered to be based on the 
correctness of judgment of image size. Brain 
(1962) considered that Ss rely on visual 
image size based on past experience rather 
than calculus for estimating minimum time 
intervals. Thus the factors of image size and 
prior knowledge which form the basis for 
image size judgments combine to produce 
one component of the gap acceptance judg- 
ment. 

Vehicular image oscillations contribute 
some cues to the judgment. Brain (1962) 
stated that, “there was a conscious effort to 
allow for the speed of faster vehicles from 
the rate of apparent image growth (over the 
long viewing time he used) and from the 
amount of yaw, roll, and general visual re- 
sponse to carriageway irregularities ‘ 
[p. 791].” Thus it is possible that Ss in 
the present experiment did perceive slight 
vehicular oscillations during the short IAT 
and hence could make some judgments of 
relative approach speed. Using the oscilla- 
tion cue as a modifier of the information 
from the image size/distance cue, S made 
a gap acceptance, or rejection, judgment. 

The perceptual factors are further modi- 
fied by cognitive factors, such as the set of 
the S. The present experiment required Ss 
to respond “normally,” whereas most other 
studies have requested S to judge “the last 
moment for crossing” (Brain, 1962; Moore, 
1956). If Ss were following the instructions, 
then the magnitude difference between the 
obtained data and previous data could be 
reconciled. However, the slopes obtained by 
previous investigators and the slope of the 
present data do not agree. Therefore some 
alternative, or additional, process may be 
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operating in the different gap acceptance re- 
search situations. The results obtained were 
considered to be influenced by Ss acting ac- 
cording to a “normal” set that resulted in 
a more accurate estimation of gap acceptance 
behavior than previously available. 

Risk-taking characteristics of Ss  ap- 
parently differ significantly as indicated by 
their performances in Table 1. Note that S, 
accepted gaps of very similar size at all 
speeds, while there was considerable differ- 
ence in the gap accepted by the other 
drivers. The S; was the youngest of the Ss, 
a bachelor, and inclined to take risks (per- 
sonal knowledge). The other Ss were older, 
married, and not inclined to take risks (from 
interview statements). With present data a 
specific statement cannot be made on risk 
taking. However, future research is planned 
to provide information on this aspect of gap 
acceptance. 
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INDIVIDUAL VERSUS GROUP PROBLEM SOLVING 


IN AN INDUSTRIAL SAMPLE 


JOHN P. CAMPBELL 


University of Minnesota 


A comparison of the quality of solutions to N. R. F. Maier’s Change of 
Work Procedure problem under 3 experimental conditions: (1) individual 
solutions, (2) individual solutions after hearing group discussion without 
criticism, and (3) group solution after discussion, using a sample of 80 2nd- 
and 3rd-level managers from a large midwestern public utility. Motivational 
(Bass’s Orientation Inventory) and biographical correlates of individual and 
group problem-solving behavior were also examined. Group discussion did not 
improve individual problem-solving efforts; group solutions tended to be of 
significantly lower quality. There also tended to be a negative correlation be- 
tween solution quality and orientation toward maintaining group interaction. 


When compared to the efforts of individuals 
working by themselves, do committee meet- 
ings and group discussions facilitate finding 
higher quality solutions to organizational 
problems? As pointed out by Gerard and 
Miller (1967) and Hoffman (1965), this has 
been a viable issue for a long time but con- 
siderable previous research on the topic has 
yielded conflicting results. Much of the earlier 
work tended to show that groups were better 
problem solvers than individuals (Lorge, Fox, 
Davitz, & Brenner, 1958), while more recent 
studies, such as those by Tuckman and Lorge 
(1962), Taylor, Berry, and Block (1958), 
and Dunnette, Campbell, and Jaastad (1963), 
have shown that under certain conditions a 
group may actually inhibit the best effort of 
its members. However, another recent study 
(Hall, Mouton, & Blake, 1963) tends to 
favor the superiority of the group solution. 

The difficulties involved in interpreting 
much of the past research have been cited by 
Lorge et al. who point out that such investiga- 
tions have frequently been carried out on 
ad hoc groups (in which the group members 
have not previously met and probably never 
will again) of students, and the experimental 
problems are often of the puzzle or anagram 
type. Another difficulty lies in the definition 
of what is an individual. Should the average 
group score be compared to the average in- 
dividual score or to some index derived from 
the “nominal” group (Faust, 1959)? In a 
nominal group, individual efforts are com- 
bined in some fashion by an independent 


party, for example, by choosing the best in- 
dividual solution in a group or obtaining a 
composite of the best elements of all the 
individual solutions. Comparing real groups 
to nominal groups seems to be the more mean- 
ingful comparison, and, in general, the more 
recent studies have used this approach. 

In an attempt to fill in some of the gaps in 
previous research, the present study makes 
use of an industrial sample, a complex 
realistic problem, and compares the solution 
quality of real groups versus nominal groups 
as well as real groups versus individuals. In 
addition, an effort is made to relate the differ- 
ential performance of an individual (isolation 
versus group discussion) to certain measures 
of individual differences. Attempts to find out 
what kinds of individuals are benefited or in- 
hibited most by the group experience are 
totally lacking in the literature. 


MeEtTHOD 
Subjects 


The Ss were 80 second- and third-line managers 
from a large midwestern public utility employing 
about 5,000 people. Most of the participants knew 
one another fairly well and many had previously 
been in meetings together. Thus, the groups used 
in the present study at least approach what Lorge 
et al. call a traditional group. 


The Experimental Problem 


The problem used was a slight modification of the 
Change of Work Problem developed by Maier and 
described in Maier, Solem, and Maier (1957). In 
its present form it reads as follows: 
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In a large firm which manufactures communica- 
tion equipment, a four man department has the 
responsibility for the entire production of a par- 
ticular component. Three of the men perform the 
assembly work and the fourth acts as a foreman. 
The assembly operation is divided into three posi- 
tions and the workers have adopted a system of 
hourly rotation among the three jobs. The three 
jobs are not entirely routine and require some- 
what different skills. However, all three of the 
men have been able to master each of the jobs 
at least minimally well but still perform some 
jobs better than others. 

The foreman called a meeting to discuss the 
possibility of their changing their work method to 
one where each man works on only one position. 
This would be his best position according to the 
time study data given to the foreman. Although 
the new method should increase the productivity 
of the workers and thus their piece rate wages, 
the foreman’s suggestion of a change to the new 
method was met with some resistance. The em- 
ployees were concerned about the amount of 
boredom resulting from working at the same job 
all the time and believed that this might actually 
lower production. They were also a bit suspicious 
of management’s motives in suggesting the new 
work method. 

Can you suggest any solution to the problem 
faced by the foreman and his crew? 


The problem has previously been scored by 
classifying solutions as old, new, and _ integrative. 
An old solution is simply to leave things as they 
are, and the new solution consists of adopting the 
work method proposed by management with maxi- 
mum effort directed at communicating the mutual 
benefits of the work change to the employees. 
An integrative solution is a suggestion that the three 
men still rotate on the three jobs but that each 
man spend the most time on the job he does best, 
the second largest block of time on the job he does 
second best, and the least amount of time on the 
job he does most poorly. Thus, the integrative 
solution goes beyond selecting either the manage- 
ment or employee solution and suggests a unique 
alternative that combines the best elements of both. 
For the present study an attempt was made to de- 
velop a scoring system with a larger number of 
categories. The problem was first pretested on 43 
individuals attending an extension course in super- 
vision at the University of Minnesota and 38 in- 
dividuals in a senior level course in personnel psy- 
chology. Most of the people in the extension class 
had supervisory responsibilities of some kind. These 
data were used to develop a wide range of pos- 
sible solutions to the problem. Points were assigned 
to the alternatives roughly corresponding to Maier’s 
progression from old to new to integrative. Scores 
could range from 0 to 60. Doing nothing re- 
ceived a score of 0. The alternative with the highest 
possible score entailed: (a) ranking jobs for each 
person in terms of how well he can perform each 
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one; (b) having each person rotate among the 
three jobs such that he spends the most time on 
his best job, etc.; (c) carefully discussing the ad- 
vantages and disadvantages of the new method in 
a group meeting with full opportunity for the em- 
ployees to air their fears; (d) providing for the 
experimental tryout of various rotation schedules 
so that the employees and foreman will be better 
equipped to choose the best one; and (e) discussing 
the possibility of setting some production goals to 
shoot for during the transition period. Some rep- 
resentative alternatives between these two ex- 
tremes are: rotating between the two most suitable 
jobs and spending more time on the more suitable 
(35 points); equal time rotation between the two 
best jobs but for periods longer than an hour (20 
points); accepting management’s new method with- 
out benefit of experimental tryout or group dis- 
cussion of the pros and cons (5 points). At all the 
steps along the scale, bonus points are given for 
adopting an experimental outlook with regard to 
the various elements in the solution, providing for 
maximum two-way communication between manage- 
ment and the employees, and for attempting to set 
transitional production goals in consultation with 
the employees. 


Other Measures 


All Ss in the study filled out an extensive 60-item 
personal-history questionnaire dealing with family 
and educational background, leisure-time activities, 
work history, participation in community affairs, 
etc., and also the Orientation Inventory (Bass, 
1962) which yields three subscores: (1) self-orienta- 
tion—which reflects the extent to which a person 
describes himself as expecting direct rewards to 
himself regardless of the job he is doing or the way 
he affects others; (2) interaction orientation— 
which reflects an individual’s degree of concern with 
maintaining harmonious relationships in a _ super- 
ficial sort of way at the expense of concern for the 
task; (3) task orientation—which reflects the de- 
gree of concern with completing the job, solving 
problems, working persistently, and doing the best 
job one can. The test consists of 27 items all of 
which are keyed for each of the three scales. 


Procedure 


Forty-eight of the Ss were randomly assigned to 
the experimental group and 32 to the control group. 
Initially all Ss were mailed the biographical question- 
naire and the personality measure. In the same 
mailing the experimental group was given the 
Change of Work Problem and asked to write what 
they considered to be the best solution, in as much 
detail as possible. The control group worked on 
the Circular Assembly Problem (Maier et al., 1957) 
which is also a combination production-process— 
human-relations type problem. Materials were origi- 
nally mailed to 82 Ss, and 81 were returned. 

The Ss were then assembled into 4-man groups, 
12 experimental groups and 8 controls. All groups 
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were asked to work on the Change of Work Problem 
in a two-stage sequence. First the members of each 
group were asked to discuss and offer solutions to 
the problem using a brainstorming format; that is, 
no criticism or evaluation was allowed. They were 
just to explore as many solutions as possible. No 
time limit was imposed, but most groups seemed to 
be satisfied that they had covered everything after 
30-60 min. of discussion. At this point Ss were 
asked to individually write down what they then 
considered to be the best solution to the problem. 
After these “individual-after-group-discussion” effects 
were collected, each group was asked to arrive at 
a consensus; one member of the group was asked 
to record the agreed-upon solution. This usually 
entailed another 20-40 min. of discussion. 

Thus, for the experimental group three different 
solutions were offered to the problem: (1) an in- 
dividual solution while working alone, (2) an 
individual solution after hearing and participating 
in a group discussion, and (3) a group solution. 
The control group offered solutions to the Change 
of Work Problem only under the last two conditions. 

Assuming that the comparison of controls and 
experimentals does not show a significant practice 
effect for the experimental group, a number of 
different analyses are possible. Using individuals 
as degrees of freedom, comparisons can be made to 
see if group discussion increases or decreases the 
quality of individual solutions to the problem. 
Using real and nominal groups as degrees of free- 
dom, comparisons can be made across all three 
experimental conditions. For the nominal groups 
(independent combinations of individual solutions) 
two different group scores were used, the average 
individual score in the group and a composite score 
obtained by putting together the best elements for 
each of the four solutions. The individuals in 
each nominal group were the same individuals who 
later made up a real group. 


Scoring 


The reliability of the scoring system was deter- 
mined first on the pretest data by having three 
individuals (the writer and two graduate students)1 
independently score the 81 solutions and by com- 
puting the intercorrelations among the three scorers. 
It was checked again for two of the scorers on 
data from the experimental group. These two sets 
of scores were correlated for the 48 individual 
solutions to the Change of Work Problem and 
for the 48 individual-after-group-discussion solu- 
tions. Both scorers also derived and scored com- 
posite solutions from the 12 nominal groups and 
scored the consensus solutions from the 12 real 
groups. 

The scores actually used for the experimental 
analysis were obtained by reconciling any differ- 
ences between Scorer 1 and Scorer 2 to their mutual 
satisfaction. 


1 The writer wishes to thank Phillip Pettman and 
Martin Rock for their assistance as raters. 
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RESULTS 


In general, the interscorer correlations 
were relatively high, ranging from .73 to .96 
with an average of .82. Since the scores 
actually used in the analysis were the con- 
sensus of Scorers 1 and 2, the reliability 
of the composite ratings is probably higher, 
but it is impossible to estimate how much. 
The Spearman-Brown, for example, is inap- 
propriate in this case. 

For all types of scores, none of the differ- 
ences between experimental and _ control 
groups was significant at p= .10 using a 
t test. Thus, a practice effect does not seem 
to be a crucial concern. Also, none of the 
treatment effects was significant when in- 
dividuals were used as degrees of freedom; 
that is, there was no significant change in the 
average individual solution score after Ss 
participated in a brainstorming type group 
discussion. 

For the group scores, differences across ex- 
perimental treatments and the interactions of 
experimental treatments with the variables 
measured by the Orientation Inventory were 
examined by a series of two-way analyses of 
variance. Two levels of the orientation dimen- 
sions were obtained by dichotomizing scores 
at the median. The analyses for nominal and 
real groups were carried out for two different 
types of nominal group scores, the composite 


TABLE 1 


ANALYSES OF VARIANCE FOR Groups USING 
INTERACTION ORIENTATION SCORES 























AS LEVELS 
Composite score eae 
used as nominal Romace ed 
as nominal 
Source group score ploutiscan 
df | MS PB MS F 
Between 
Levels 1 | 393.4} 2.40 | 50.1) 1.99 
Error 10 | 164.0 25.2 
Within 
Treatment 2 | 756.8 | 16.12* | 62.6 | 10.12* 
Treatment X 
Levels Dalen De2 SHS || PRESS PAO 
Error 20 | 46.9 6.2 





*p =.01. 
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TABLE 2 


COMPARISONS OF MEAN Group SCORES CLASSIFIED BY EXPERIMENTAL CONDITION 


AND BY INTERACTION ORIENTATION SCORES 


























Orientation test scores ee : iver Group Combined 
responses discussion consensus 
Composite score used as nominal group score 
High interaction orientation score 15.2 10.2 De) 9.3 
Low interaction orientation score 23.5 19.0 5.2 15.9 
Combined 19.4 14.6 3.9 
Average score used as nominal group score 
High interaction orientation score 8.0 ee Das OE 
Low interaction orientation score 8.2 Oy S22. hes) 
Combined 8.1 el 3.9 








score and the average of individual scores 
within the group. 

The superiority of the nominal group was 
significant and large, regardless of which type 
of score was used to reflect the nominal 
group’s performance. However, none of the 
main effects pertaining to the orientation 
variables and none of the interactions of these 
variables with the experimental treatments 
was statistically significant. Since the inter- 
action orientation score was the only dimen- 
sion to approach significance, only the analy- 
sis of variance for Experimental Treatment 
x Interaction is shown in Table 1. The cor- 
responding means are shown in Table 2. 
The means and sums of squares for the treat- 
ments would of course be the same no matter 
which orientation score was used. 

Correlations between the solution scores 
obtained under particular experimental treat- 
ments and the orientation data were also ex- 
amined. For the individual scores the rela- 
tionships were computed using the Pearson r, 
and for the analysis of group scores Spear- 
man’s rank-order correlation was used. In- 
dividual orientation scores were correlated 
with individual solutions obtained before dis- 
cussion and after discussion, and with the 
difference. The rankings of average group 
orientation scores were correlated with the 
ranking for nominal group composite scores 
obtained before discussion, the ranking for 
the group consensus scores, and the difference 


between these two rankings. For example, 
a positive correlation between an orientation 
score and a difference score means that the 
higher an individual or group stands on the 
orientation variable the greater the decre- 
ment in solution quality under the group 
discussion or group consensus condition. 

The analyses using individuals as observa- 
tions yielded no correlations significant at 
p = .05; however, two of the correlations 
using group rankings were statistically sig- 
nificant and several others approached sig- 
nificance. The rank-order correlations for the 
group data are shown in Table 3. Composite 


TABLE 3 


RANK-ORDER CORRELATIONS OF AVERAGE ORIENTATION 
SCORES WITH NOMINAL Group ComposiTE SCORES, 
THE REAL Group SOLUTION, AND THE DIFFER- 
ENCE BETWEEN THE COMPOSITE SCORE 
AND THE REAL Group SCORE 














Solution quality scores 
Score Nominal Real 
group group Difference 
composite | consensus 

Self-orientation Obs 45 STi 
Interaction 

orientation — .68** —.37 —.46 
Task orientation ei} .07 25 

Note.—N = 12. 

= — 05, 

> = 01, 
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scores for nominal groups tended to be higher 
for those groups with high-average self- 
orientation scores and low interaction orienta- 
tion scores. However, groups with high- 
average interaction orientation scores tended 
to show less of a decrement in problem- 
solution quality when they were asked to 
reach a group consensus. Somewhat sur- 
prisingly, the task orientation variable 
yielded the lowest relationships. 

No consistent relationships were found 
with any of the biographical items. 


DIscUSSION AND CONCLUSIONS 


The results of the present study tend to 
support the more recent research on individ- 
ual versus group problem solving cited 
earlier. Group participation and discussion 
did not generate higher scores but rather 
tended to be inhibitory. The quality of the 
group solution was inferior to the nominal 
eroup’s composite score and was even inferior 
to the average individual solution. It should 
be pointed out that such a finding is de- 
pendent on the type of problem used and the 
manner in which it was scored. Also, nothing 
has been said about the properties of the 
group situation which might have led to the 
reduction in quality scores. Perhaps new and 
better alternatives were stimulated but were 
just never expressed. 

Groups seemed to settle very early on a 
particular line of attack, and alternative work 
procedures were seldom explored in the group 
discussion. However, this did not seem to 
occur in an atmosphere of obvious inhibition 
and threat. Most Ss knew one another on a 
first name basis and all seemed highly mo- 
tivated and enthusiastic. Also, it is doubtful 
that time limitations could explain the in- 
feriority of the group solution. A 24-hr. period 
was set aside for the experimental session 
and no time limitations were imposed. The 
absence of such a limitation was reinforced 
by the experimenter. The experimental ses- 
sions were relaxed but task oriented, and none 
of the Ss expressed any discomfort because 
of being away from his job too long. The 
degree of S cooperation and involvement was 
ideal. 

The results obtained with the Orientation 
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Inventory are very tenuous but carry the sug- 
gestion that high self- and task orientation 
and low interaction orientation groups per- 
form better on problems such as this, both 
when their efforts are combined by an in- 
dependent party and by consensus. However, 
high interaction orientation groups seem to 
suffer less than low interaction orientation 
groups when asked to reach a consensus. 

In sum, the present study casts further 
doubt on the notion of group facilitation of 
problem solving and this time with a sample 
from an appropriate population and with a 
more realistic experimental problem. How- 
ever, it should be remembered that the group 
meeting has a number of other uses besides 
problem solving. Even when problem solving 
is the aim, there are criteria other than 
solution quality on which to judge the effec- 
tiveness of the group experience. Maier and 
Hoffman (1964) have attempted to categorize 
some common management problems into two 
classes, those for which finding a solution is 
the crucial aspect and those for which 
a number of solutions are possible, but the 
important consideration is the motivation 
of the individuals implementing the solution. 
In the latter case, the effects on solution 
quality may not be as important as the mo- 
tivational aspects of group participation. 

In addition, the present study and those 
previously cited have said nothing about the 
utility of individual versus group solutions. 
Perhaps it takes considerably more time in 
an organizational setting to ask the group 
members to problem solve individually and 
then evaluate and combine the individual 
solutions than it does to hold a group meet- 
ing. If the cost of this additional time out- 
weighs the benefits of the superior solution, 
it may not be worth the effort. Conversely, 
Hoffman (1965) has pointed out that a 
number of methods exist for enhancing the 
problem-solving effectiveness of groups. How- 
ever, the question still remains whether the 
added effort required to implement such tech- 
niques yields solutions which result in the real 
group being superior to the nominal group. 
The relative utility of various problem-solving 
strategies for different types of problems is an 
area which remains unexplored. 
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ENVIRONMENTAL FRUSTRATION AND CREATIVE 
PROBLEM SOLVING 


BERNARD L. HINTON 1 
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Report of an empirical study of the effect of environmental frustration on 
creative-problem-solving performance. The research design is based upon a 
linear model of creative problem solving. Alternate forms of analysis are 
applied, testing the relationship between environmental frustration and crea- 
tivity under various sets of assumptions about the experimental conditions. 
It is concluded that environmental frustration does significantly reduce crea- 


tive-problem-solving performance. 
alternative assumptions about the “ 
and ability factors. 


Recent research on creative problem solv- 
ing has tended to focus primarily on the 
factors of the creative person (e.g., Barron, 
1953, 1954, 1955, 1957; Dentler & Mackler, 
1964; Drevdahl, 1956; Garwood, 1964; Guil- 
ford, 1959; Hitt & Stock, 1965), the creative 
process (e.g., Crutchfield, 1961; Ghiselin, 
1956; Harmon, 1956; Taylor, 1959), or the 
measurement of creativity through the evalua- 
tion of the creative product (e.g., Ghiselin, 
1958; McPherson, 1956; Taylor, Smith, & 
Ghiselin, 1959). The research which has 
been reported on investigation of the creative 
environment has been almost exclusively re- 
stricted to studies of organizational climate 
or group climate (e.g., Crutchfield, 1962; 
Fiedler, 1962; Gibb, 1951; Ziller, Behringer, 
& Goodchilds, 1962), with considerably less 
interest being shown in the micro- and psy- 
chological climate as it pertains to the in- 
dividual problem solver. 

Maddi (1964), on the basis of his experi- 
ence, has labeled some generally held assump- 
tions about the effects of psychological stress 
as being ‘“‘old wives’ tales.” One of these as- 
sumptions concerns the reduction of creativity 
in frustrating environments. Maddi argues, 
in part, that a creative person can ignore, or 
at least overcome, such restriction and offers 


1 Now at Indiana University, School of Business. 
This article is derived, in part, from a doctoral 
dissertation completed at Stanford University. The 
author wishes to express appreciation to Charles 
W. Haley, now at University of Washington, for 
his assistance on the model development and to 
Thomas W. Harrell for his many contributions to 
the research effort. 


This 
rest of the world” variables of personality 


is supported under a number of 


case examples to support his position. The 
research reported herein was designed to 
test the validity of that position; the null 
hypothesis is: Creative-problem-solving per- 
formance will not be influenced by a state 
of frustration experienced by the problem 
solver. 


PROCEDURE 


A battery of criterion measurement instruments 
was administered, comprised of an Alternate Uses 
test, a Match Problems test, and a Consequences 
test (scored for both “obvious” and “remote” re- 
sponses). Certain modifications to the published 
form of these tests were found to be necessary due 
to the pre-post-test research design. Primarily these 
required the partitioning of the instruments into two 
comparable forms. In the case of the Match Problems 
and the Consequences tests this merely required a 
physical division of the test booklets. The reliability 
of the Match Problems test was judged to suffer 
a negligible amount by such action. The reliability 
of the Consequences test had already been demon- 
strated to be unaffected, as the manual reported as 
few as eight items could be used without affecting 
the reliability, and after generation of alternate 
forms there remained 10 items in each form. 

In the case of the Alternate Uses test, however, 
a split into halves was judged to have a serious 
effect on reliability and thus a parallel set of test 
questions was deemed necessary. This generated set 
was balanced, pair-wise, against the original set 
according to several dimensions, including the nature 
of the objects listed in the test, the relative size of 
the objects, and the estimated cost of the objects. 
A sample pretesting of the response frequency of 
this set against the original set indicated com- 
parability. To further reduce the introduction of 
bias or error, the two sets were then merged, with 
each set having half of its items from the original 
set and half from the generated set. Additionally, 
the half from the generated set were the paired items 
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to those now placed in the alternate form. A random- 
sample item analysis on response frequency after 
the original collection of data indicated that the 
original and generated sets of items were com- 
parable in that no significant differences occurred 
in the mean number of responses given to the 
generated items and those given to the original items. 

The initial testing was carried out at the beginning 
of the academic year. The research Ss were entering 
Master of Business Administration (MBA) candi- 
dates in the Graduate School of Business at Stan- 
ford University. The data were collected prior to 
registration and in conjunction with the collec- 
tion of data for other on-going research. The struc- 
ture of this situation was such as to tap a situation 
where the level of frustration was probably as low 
and as randomly distributed as it is possible to 
achieve in students in the real world. At the least, 
the environment contained no identifiable sources 
of frustration. What frustration may have been 
present, from whatever source, does of course serve 
to increase the random-error term in the analysis 
presented later. 

Parallel test forms were used for the second 
administration, which was carried out about 6 wk. 
later, well into the fall quarter. Prior observation of 
MBA students had indicated that this point in 
time was for many of them a peak in their level 
of frustration as they underwent an adjustment to 
the high ljevel of output required of them by the 
academic program. Additionally, the tests were ad- 
ministered immediately after the return of a major 
paper, a task on which many students had failed to 
achieve their expected performance level. 

Also administered were two sheets of graphic 
rating scales which were designed to measure the 
emotional state and attitudes of the testees. In- 
cluded were distractor questions intended to help 
mask the prime interest area. The second of these 
sheets presented eight scale measures on which the 
S indicated what his emotional state had been 
while taking the tests. The S’s response on the 
frustration scale was of primary interest. 

Scoring of the test battery was done after all 
of the data had been collected, but was independent 
of S’s responses on the graphic scales. (It was 
“blind” in that the quantification of the scale 
responses was the last step in the scoring procedure.) 
This was deemed necessary in that a fair degree 
of scoring subjectivity was implicit on both the 
Alternate Uses and Consequences tests. The Match 
Problems test was scored objectively. The scores 
were recorded separately on each factor, as a 
joint score on the Consequences test, and as a 
total (additive) score. Analysis was carried out on 
each of these six sets of dependent variable measures. 

The administration, in both instances, was carried 
out in large groups and in a classroom situation. 
The intent was to hold as constant as possible the 
physical setting (thus avoiding the introduction of 
extraneous stimuli or cues) and the sociological 
setting. The Ss were not informed of the nature 
of the criterion (creativity) until after the testing 
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had been completed, although the nature of the 
tests may have given some indication of this. 

After the collection of the data had been ac- 
complished, as described above, the raw scores on 
the various creativity measures were transformed 
into standard scores (TJ scores) with a mean of 50 
and a standard deviation of 10. This procedure 
rendered the scores on the alternate forms of the 
criterion instruments used directly comparable. The 
initial sample size was 226, but after the elimination 
of those Ss for whom incomplete data were received, 
and all of those whose first language was other 
than English, the final sample size was 147. 
Data were incomplete on the criterion test battery 
in only three cases. The remaining Ss eliminated 
because of incomplete data either failed to complete 
the graphic scales or gave identical responses to each 
scale. 


THe Moper 


This research project was designed through the 
use of a model of creative problem solving (Hinton, 
1966) which has been suggested as offering a basis 
for the comparison and design of creativity re- 
search. The proposed model hypothesized a simple 
linear relationship between the variables of person, 
environment, process, and product. Process is con- 
sidered, however, to be an intervening variable, and 
is not directly expressed in the model. The model’s 
initial expression is thus: 


Cis =i Aigx se B Paina 
Y Bige t+ 6 S¥e3n +X S**i3n +e [1] 


where C represents an index of creative-problem- 
solving performance, A an index of given ability or 
intellect factors, P an index of personality factors, 
E the psychological or emotional state, S* the 
sociological setting, and S** the physical setting. 
In each case the subscript z identifies an individual, 
the subscript j the specific variable, and k the point 
in time. 

Expressed verbally, creativity is a function of the 
person and the environment where the person is 
represented by measures of his general level of 
ability and measures of his personality makeup 
(although these are not necessarily independent), 
and the environment is comprised of psychological, 
sociological, and physical elements. 

For the current research effort Cij; is defined as 
the performance or score on the research instruments 
or tests of creative-problem-solving performance. 
Exjx is defined as the measured value taken from the 
eight graphic rating scales on which S indicated 
his emotional state while completing the test battery. 
There are, therefore, a total of six j’s for Cijx, and 
eight j’s for Ei;,. Within the time span and structure 
of this research effort the values of Atjx, Pijx, S*ijx, 
and S**;,;, are unknown but assumed constant. That 
is, the change A Ai, A Pes, A S*iy, and A S**4; 
from the first administration to the second ad- 
ministration is assumed to be essentially zero. 
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TABLE 1 


Model statement 


Car = Cr72 Simple correlation: Ci;1/Ci;2 


Covariance analysis: 
Emotional State Measure 
ACs = y Eijn2 +e 


factor) 





Caj2= 5 Cai + 
¥ Hijo + € 


Multiple correlation : 
Ciz2/Cigr, Lage 


HYPOTHESIS AND SCHEME OF ANALYSIS 


Data analysis to be applied* 


Variate = C,;»; Covariate = C,;1; 
Treatment Groups allocated by 


Simple correlation: C,;/Li;2 for Ly; = 
(Frustration) and for #;; = (3-scale 


Assumptions required for 
unbiased estimate 


Ass eae C# ijk 


= Ajiz03 Ea a 


= Pi30; Liz, random 
Hie 


A a ADs Teel 


with respect to 


A ii = A i323 [eee es 





Piss 


a Analysis applied is not necessarily direct test of model statement. The analysis tests strength of relationship rather than 
yielding coefficient for factor in model statement. The results must be interpreted accordingly. 


Table 1 shows the model statements which may 
thus be derived? and the analysis to be applied. 

It should perhaps be noted that the tests applied 
were not specifically testing the model statements. 
That is, the author was interested in the strength 
or predictive power of the relationship rather than 
the value of the coefficient. While the model does 
imply that it is possible to accurately quantify 
the relationship, it was in fact not realistic to do 
so, in view of the nature and design of the research 
and in spite of the fact that such coefficients could 
readily be calculated. 


RESULTS 


The emotional state scale measures (quanti- 
fied values) were analyzed, and the inter- 
corrélations shown in Table 2 determined. 
Primary interest is upon the frustration 


2 For the derivation of these statements the reader 
is referred to Hinton (1966). 


measure, but analysis was also carried out 
on a factor composed of three scales (frustra- 
tion, discomfort, and dissatisfaction) which 
show fairly high intercorrelations and predic- 
tive power in terms of the criterion measures. 
Such a factor was presumed to have greater 
stability than the single measure factor, but 
the results did not show many significant 
differences between the two factors. 

Following the analysis scheme, as presented 
in Table 1, correlation coefficients (shown 
in Table 3) were obtained between the first 
and second administration scores on the 
creativity criterion measures. 

The correlation between these scores would 
have an expected value of 1.00 only if the 
reliability of the measurement instruments 
were perfect and if all other influencing fac- 
tors were constant in their effect from Time 














TABLE 2 
CORRELATION MatTrrx—EMOTIONAL STATE SCALE MEASURES 
am == ——- i =" 
Feeling Frustra- Discon- Discour- Inade- nee | aoe Discom- | Dissatis- 

popne tion tentment | agement quacy a | ee ee fort faction 
Frustration 1.000 alice .660 Lo52 434 054 .962 625 
Discontentment 1.000 2S .035 041 038 310 192 
Discouragement 1.000 695 eS .005 469 617 
Inadequacy 1.000 459 004 492 474 
Anxiety 1.000 187 .442 411 
Aggression 1.000 142 054 
Discomfort 1.000 .530 


Dissatisfaction 


1.000 
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TABLE 3 


CORRELATION COEFFICIENTS FOR FIRST AND SECOND 
ADMINISTRATION SCORES ON CREATIVITY 
CRITERION MEASURES 


a ; Correlation coefficient 
Criterion measure (C;;x) = IC 
Cijrat/Cizkm2 


(7 = 1) Alternate uses .6466 
(7 = 2) Match problems 4899 
(7 = 3) Consequences—obvious .6104 
(i = 4) Consequences-remote 4133 
Gj = 5) Consequences-total .6885 

7147 


(7 = 6) Total score 


1 to Time 2. In view of the reliability esti- 
mates (from manual, with range from .70 
to .86) and the assumptions concerning 
variable changes over the time period con- 
cerned, the correlations shown do indicate 
that a change in the environment (the only 
variable in the model assumed to have 
changed) had some effect on creativity. In 
order to test the significance of this, and to 
relate it directly to frustration, a more direct 
test of the null hypothesis was carried out 
using covariance analysis. 

The Ss were divided into three groups 
according to their response to the scale 
Frustration on the Emotional State Scale 
Measure. This assignment was done accord- 
ing to two schemes, the first assigning cases 
to groups according to an equal-interval sub- 
division of the frustration-scale responses, 
the second according to a ranking of the 
frustration-scale responses, and then a split 
into three groups of approximately equal 
size. Ranking ties were not split, thus a 
minor inequality in group size. These assign- 
ments are shown in Table 4. 


TABLE 4 


Casr ASSIGNMENT FOR COVARIANCE ANALYSIS 





F scale value Group assigned} No. cases 





E32 = 0.00 to 5.00 1 
Hj = 5.01 to 9.99 py 75 
F332 = 10.00 to 15.00 3 





Lixj2 = 0.00 to 6.20 4 51 
Eizo = 6.21 to 9,99 5 44 
Ej;2 = 10.00 to 15.00 6 52 
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An analysis of covariance was then carried 
out on these “treatment groups” using the 
Cijo or creativity criterion test score on the 
second administration as the variate and the 
Cij, or creativity test score on the first ad- 
ministration as the covariate. This was done 
for each of the criterion measures. The results 
of this analysis are shown in Table 5. 

Accordingly, since the analysis shows the 
differences to be significant, and the only 
variable presumed to have changed is the 
emotional state, the null hypothesis is re- 
jected for each of the criterion measures ex- 
cept the Consequences—Total score. That is, 
the conclusion is drawn that the emotional 
state, or more specifically the state of frustra- 
tion, did have an effect on creative-problem- 
solving performance. 

The next test applied evaluates the strength 
of the relationship between the change in 
the creativity criterion test score (A Ci) and 
Eij2, the emotional state at the time of the 
second administration, for Ejj2 = Frustration 
and for ij. = Frustration + Discomfort + 
Dissatisfaction. 

Summary statistics for this analysis are 
shown in Table 6. This table gives both the 
correlation coefficient and the rv? (coefficient 
of determination) for each of the measures. 
These results are significant at the .05 level 
or better for all criteria except the Con- 
sequences—Obvious. 

The next test to be carried out was that of 
the multiple relationship of Cij2 with Cj, and 
Exjo, that is, the creative-problem-solving 
score on the second administration as ex- 


TABLE 5 


COVARIANCE ANALYSIS F RATIOS FOR 
CREATIVITY CRITERIA 








5 we F ratio for |F ratio for 
Criterion 





Groups Groups 

(Cis2) 1,2, 3) \aeee 

(7 = 1) Alternate uses 9.099** 9.996** 

(7 = 2) Match problems 6.193") | io 34cm 
(7 = 3) Consequences-obvious | 3.083* 3.683* 

(7 = 4) Consequences-remote | 37.414** | 43.049** 
(j = 5) Consequences-total 2.986 2.860 

(j = 6) Total score 11.2477) eS 


* » <.05; F = 3.07 or more. 
kD < 01; F = 4.7 


a 
4.79 or more. 
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TABLE 6 


CORRELATION COEFFICIENTS FOR EMOTIONAL STATE MEASURES AND CHANGE IN 


CREATIVITY CRITERION MEASURE SCORES 














a Ei;o = Frustration Eis;2 = 3-scale variable 
Criterion haf Poa S i = a. Pe ae 
(AC;;) 9 | 5 
; ‘i | y2 
(7 = 1) Alternate uses — .3019** 0911 — .3028** .0917 
(7 = 2) Match problems —.2599** .0676 —_2124* 0451 
(7 = 3) Consequences—obvious .1748* .0306 .0367 .0013 
(7 = 4) Consequences-remote — .5694** 3242 — .3563** | 1269 
(7 = 5) Consequences-tota] — ,1933** 0374 = 2008" | .0670 
(j = 6) Total score — ,3690** 1362 | — .3631** | 1318 
p< 05. 4 : tid 7 a Fo 
BH < 01, 


plained by the parallel score on the first 
administration and the effects of the state of 
frustration indicated by the emotional state 
measure. This should theoretically give com- 
parable results to that presented in Table 6 
except that, whereas the analysis presented 
in Table 6 assumed the effect of Fi, the 
emotional state of the S at the first ad- 
ministration, to be randomly distributed with 
respect to the emotional state at the second 
administration (jj2), no such assumption 
is required for this analysis. Any effect the 
emotional state had upon creativity for the 
first administration is now adjusted or ac- 
counted for in the term C;;;. The summary 
statistics for this analysis are shown in 
Table 7. 

To the degree that the results presented 
in Tables 6 and 7 differ, such differences may 
be ascribed to the presence of some degree 
of frustration at the time of the first ad- 
ministration, some unknown measurement 


error, and the predictive contribution of Cij1, 
which has already been seen to be from .41 to 
./1 (Table 3). For direct comparison, the data 
in Tables 6 and 7 need to be corrected for 
this. The results in both analyses do indicate 
that the state of frustration evidenced by S 
contributes a significant effect to that S’s 
creative-problem-solving performance. 

In summary, then, the data analysis has 
indicated that the emotional state of an in- 
dividual, and more particularly the state of 
frustration, does significantly influence his 
creative-problem-solving performance. This 
finding is supported by both the covariance 
analysis presented and by the multiple-regres- 
sion analysis presented. 


DIscussION 


The objective of this research effort has 
been an investigation of the effects of an 
individual’s emotional state, specifically his 


TABLE 7 


Mou trieLe CorRELATION COEFFICIENTS—CREATIVITY CRITERION MEASURE SCORES FOR FIRST AND SECOND 








ADMINISTRATIONS AND EMOTIONAL STATE MEASURES 














Criterion (C,;1), Eiz2 = Frustration Criterion (C;;1), 2:32 = 3-scale variable 
Criterion = = i = 
R R R rR 

(j = 1) Alternate uses 7104 5047 .6817 4647 
(j = 2) Match problems Pople .3038 5644 .3186 
(7 = 3) Consequences—obvious .6303 | 3977 .6114 | 3738 
(j = 4) Consequences—remote 7017 4924 5618 | 3156 
(j = 5) Consequences-total 1077 | 5008 7236 | .5236 
(j = 6) Total score 7725 | 5968 7707 5939 
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state of frustration, on creative-problem-solv- 
ing performance. 

Used as criterion measures have been three 
tests which tap four separate factors of intel- 
lectual ability believed to be basic to crea- 
tivity. Two additional criterion measures have 
also been applied, one being the total sum 
of the scores on the four factors, the other 
being the sum of two separate scores derived 
from the Consequences test. Thus a total of 
six criterion measures have been used. These 
measures, while only approximations to gen- 
eral creative abilities and not universally ac- 
cepted, do possess construct validity and have 
been fairly widely utilized. 

Throughout the Results section many of 
the analyses presented show a very similar 
pattern of level of significance across these 
six criterion measures. This pattern is not 
due to the reliability of the various measures, 
however, since it does not correspond to the 
reliability pattern as derived from the test 
manuals. At least part of the pattern ap- 
pears to be explained by the relationship be- 
tween scores. The three measures affected are 
the Consequences—Obvious, Consequences—Re- 
mote, and Consequences—Total. 

The Consequences test, it should be remem- 
bered, yields two separate factor scores. Also 
utilized as a criterion score was the sum of 
these two scores, the Consequences—Total 
score, reflecting the total number of re- 
sponses given to the test. As shown in Table 
6, the correlation coefficients between the 
creativity criterion measures and the frustra- 
tion measure are all negative except for the 
coefficient for the Consequences—Obvious 
measure. This indicates that, in general, the 
presence of frustration resulted in a de- 
crease in creative-problem-solving perform- 
ance. In the case of the Consequences—Obvi- 
ous measure, however, the correlation coeffi- 
cient is seen to be positive. The Conse— 
quences—Total score, being the arithmetic sum 
of the Consequences-Obvious and Conse- 
quences—Remote measures, can thus be seen 
to have suffered from a canceling effect. 

That is, performance under a state of 
frustration relative to performance under the 
initial and presumed nonfrustrated state is 
being measured by these nonindependent 
scores. The total number of responses given 
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to the Consequences test tended to go down 
under frustration. At the same time, how- 
ever, the ratio of obvious to remote responses 
shifted even more drastically, so that the end 
result was a major decrease in remote re- 
sponses, a moderate decrease in total re- 
sponses, but an increase in obvious responses. 
In a sense the quality of the responses, as 
well as the quantity, was affected, thus the 
shift in the sign of the correlation coefficient, 
and the canceling effect in terms of the tests 
of significance on the summed variables. It 
should be noted that this applies to all of the 
analyses based on these scores, although the 
magnitude of the effect varies. 

The Consequences—Remote criterion meas- 
ure appears especially strong. This is en- 
couraging in view of the fact that this score 
measures the factor DMT (Divergent Se- 
mantic Transformations), defined as Origi- 
nality (Guilford & Hoepfner, 1963). This is 
perhaps the one single factor which most 
closely approximates the usual, but larger, 
concept of creativity. 

In view of the findings, then, strong sup- 
port is believed present for the rejection of 
the null hypothesis, and the conclusion that 
frustration does reduce problem-solving per- 
formance. The correlation between the change 
in creativity criterion scores under frustra- 
tion and the emotional state measures evi- 
dences r?’s up to .324 in the case of the 
Consequences—Remote score. While the results 
in terms of the other criterion measures do 
not approach this level, it should perhaps be 
pointed out that this particular analysis is 
not, by its nature, extremely sensitive since 
it is not corrected in any way for the pos- 
sibility of frustration having existed at the 
time of the first administration, or the in- 
fluence of the personality variables, or any 
of the many other factors which may exert 
an influence. 

Analysis of the second administration 
scores as explained by the first administra- 
tion scores and the emotional state yielded 
R°s (multiple coefficients of determination) 
ranging from .304 to .597, with the Total 
score criterion giving the highest coefficient. 
This indicates some stability to the effect of 
frustration—that is, it implies that the effect 
is of similar nature for each of the criterion 
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measures even though the size of the effect 
may be different. 

In hindsight it is obvious that a more 
optimal procedure than that utilized would 
have been to collect data on the level of 
frustration present in Ss at this initial point; 
this was not in fact done. Such collection 
would have freed one from the necessity of 
making any assumptions about the level and 
distribution of the emotional factors, and 
statistical corrections could have been made 
to the posttest data to adjust for any vari- 
ance found. 
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LEADERSHIP IN NEGOTIATIONS AND THE COMPLEXITY 


OF CONCEPTUAL STRUCTURE * 
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Dyad negotiation teams (homogeneous in the complexity of the individuals’ 
conceptual structure) spent several hours attempting to solve a simulated in- 
ternational conflict situation. Every participant repeatedly rated himself and 
the other participants on Stogdill’s 10 leadership characteristics. The dyad 
member with the overall highest ratings on all scales was considered the dyad’s 
leader. The effect of simplicity and complexity of conceptual structure on the 
degree to which leaders display Stogdill’s leadership characteristics was analyzed 
in an ANOVA design. The interaction effect for complexity and leadership 
characteristics was highly significant. Complex leaders were rated higher on 
tolerance for uncertainty, assumption of leadership role, consideration, and 
predictive accuracy. Simple leaders were rated higher on initiation of structure, 


production emphasis, and demands reconciliation. 


In a series of statements, Driver and Streu- 
fert (1965, 1966a, 1966b) and Schroder, 
Driver, and Streufert (1967) have been con- 
cerned with complexity theory. According to 
these authors, individuals differ in the com- 
plexity of their conceptual structure, that is, 
in the degree to which they use differentiation 
and integration in information processing. The 
theory suggests that ‘‘simple” persons will re- 
late incoming information (input) to decisions 
(behavioral output) in a 1:1 fashion. They 
should rarely generate alternative interpreta- 
tions of the same information (no differentia- 
tion) and they should not consider the con- 
sequences of the varying interpretations for 
the potential decisions they might make (no 
integration). “Complex” persons on the other 
hand should tend to generate many alternate 
interpretations of informational input and 
should relate these alternative interpretations 
in various ways before reaching a (often 
tentative) decision (output). Generally, then, 
complex persons are more open to changes in 
the environment, adapt more easily, but are 
less constant and more liable to try pragmatic 
trial and error behavior (even when this is 
unnecessary). Usually, complex persons may 
be described as more “strategic.” 


1This research was supported by the Office of 
Naval Research, Group Psychology Branch. 


Differences in complexity of conceptual 
structure are not related to differences in in- 
telligence. Correlations of complexity mea- 
sures and measures of intelligence have gen- 
erally been near zero. 

The theory has generated a considerable 
amount of research. This research has served 
to support a number of the theoretical prop- 
ositions. As predicted by Driver and Streufert 
(1965, 1966a, 1966b) and Schroder et al. 
(1967), individual or group differences in the 
complexity of conceptual structure account 
for some of the differences in attitude change 
(Streufert, 1965), perception of others’ in- 
tentions and strategies (Streufert & Driver, 
1965), decision-making characteristics (Streu- 
fert & Schroder, 1965), information orienta- 
tion, information search, and information 
utilization (Karlins & Lamm, 1967; Stager, 
1967; Streufert, Suedfeld, & Driver, 1965; 
Suedfeld & Streufert, 1966), and other areas 
concerned with social interaction (Crano & 
Schroder, 1967; Sieber & Lanzetta, 1964; 
Tuckman, 1964). 

Little interest has been devoted so far to 
an analysis of the effect of complexity of con- 
ceptual structure on leadership characteristics. 
According to findings of Schroder, Streufert, 
and Weeden (1964), some relationships be- 
tween leadership and complexity do exist. In 
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a preliminary investigation of leadership, these 
authors found that groups of complex indi- 
viduals change their leaders more often than 
groups of simple individuals. 

According to the Driver and Streufert and 
the Schroder, Driver, and Streufert theory, 
one would not predict that either simple or 
complex persons would make more effective 
leaders. However, one would predict that cer- 
tain forms of leadership would be more likely 
displayed by complex persons, and other forms 
of leadership would be more likely displayed 
by simple persons. For instance, one would 
predict that complex leaders would be more 
willing to work with unknown factors, be 
more empathic, and predict group success 
better (when the determinants of group suc- 
cess are complex). On the other hand, one 
would predict that simple leaders would ex- 
ceed in emphasis on work success, giving di- 
rection (rather than empathy), and calling for 
unity of purpose. Theoretically, one might 
postulate that the degree to which complexity 
or simplicity in leaders may be an asset would 
depend on the particular requirements of the 
situation with which leader and follower 
group must cope. 

Concerned with the analysis of leadership 
characteristics, Stogdill (1948, 1957, 1962) 
proposed 10 characteristics which may be 
used to describe leaders: (1) tolerance of un- 
certainty, (2) persuasiveness, (3) initiation 
of structure, (4) tolerance of freedom of ac- 
tion, (5) role assumption, (6) production 
emphasis, (7) consideration, (8) predictive 
accuracy, (9) demands reconciliation, and 
(10) representation (Stogdill, 1962). These 
10 characteristics include some for which dif- 
ferences between complex and simple leaders 
might be expected (see above). Such differ- 
ences, however, would occur only if the en- 
vironment in which group and leader operate 
is sufficiently complex, so that differences can 
emerge (cf. Schroder et al., 1967). 

The present experiment is concerned with 
an analysis of the relationship of leadership 
characteristics as described by Stogdill’s 10 
leadership scales to the complexity of con- 
ceptual structure in leaders, as measured by 
the sentence completion test (SCT) of 
Schroder and Streufert (1963) and Schroder 
et al. (1967). 
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In its present form, the SCT employs the 
following stems: “When I am not certain 
which decision to make,” “When someone 
disagrees with me it usually means,” “When 
I am with two good friends and they express 
different points of view,” “When a situation 
can be interpreted in several ways.” The Ss 
are given 3 min. to complete each stem and 
write at least three more sentences on the 
subject. Responses are scored in terms of the 
multidimensionality and interrelationships of 
the response. For instance, a response scored 
as “simple” might be: 

When someone disagrees with me it usually means 
that he is wrong. I do not take issue when I am not 
sure of my ground. I try very hard to present the 


logic of my argument to the other person. If he will 
not listen I just give up. Some people never learn. 


Other simple responses state just as categori- 
cally that, “When someone disagrees with me 
it means that I have made another mis- 
fakéeon 5? 

A complex response to the same stem might 
be: 


When someone disagrees with me it usually means 
that we have had different experiences. We see the 
situation in a different way, and a long discussion is 
likely to ensue. If we both listen carefully, we will 
probably find the meaning in the other’s point of 
view, and where it can be applied. Often we both 
change our views somewhat. 


A complex simulated environment, the 
Tactical and Negotiation Game (TNG; 
Streufert, Kliger, Castore, & Driver, 1967) 
was chosen for this research. Leadership is 
measured in a simulated government (inter- 
nation) negotiation setting, which, because of 
the constellation of demands and offers, holds 
considerable similarity to both governmental 
and industrial negotiation situations. In addi- 
tion, this simulation situation provides suf- 
ficient environmental complexity to permit in- 
dividual differences in complexity of con- 
ceptual structure to take effect. 


METHOD 
Subjects 


Three hundred and fifty male undergraduate stu- 
dents in an introductory psychology course at an 
eastern state university were tested for complexity 
of conceptual structure with the SCT (Schroder et 
al., 1967; Schroder & Streufert, 1963). The potential 
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score range of Ss on this test varies from score 1 
(very simple) to score 19 (very complex). The high- 
est score obtained in this sample was 17. The Ss 
were selected into simple (score 1 and 2) and com- 
plex (score 10 or above) categories. Forty Ss, 20 of 
simple conceptual structure and 20 of complex con- 
ceptual structure, were selected for this experiment. 
The Ss within each of these groups were randomly 
assigned to dyad negotiation teams, so that two per- 
sons on one team would negotiate with two mem- 
bers of the opposing team. Teams were homogeneous 
in conceptual structure and negotiated with teams 
of the same level of conceptual structure. Differences 
in intelligence (College Board scores) between groups 
were not obtained. 


Setting 


The environment used for the present experiment 
was the negotiation component of the TNG de- 
scribed by Streufert et al. (1967). The topic of 
negotiations was the attempt at a settlement of an 
international conflict situation (based on the work 
of Streufert, Clardy, Driver, Karlins, Schroder, & 
Suedfeld, 1965). The negotiations, in effect, are not 
unlike potential negotiations for a settlement of a 
Vietnam or Cuba type conflict. 

The Ss were given 13 hr. to study the participants’ 
manual for the TNG.2 The manual contains con- 
siderable background information on the negotiations, 
the demands and offers to be employed by the team 
members, and potential demands and offers expected 
from the opposing dyad team (see below). All back- 
ground information in the manual was phrased in 
terms intended to persuade Ss that they were ne- 
gotiating for the “good” side. The biased presentation 
of this material was found to be very effective. 

The Ss were presented with 8 offers and 10 de- 
mands, both phrased in maximum and minimum 
terms. They were told that they were expected (by 
their government) to get the opponent team to ac- 
cept as many demands as possible, and to make as 
few offers as possible. The Ss were informed that 
their pay for participation in the game would de- 
pend directly on the number of offers and demands 
made. For instance, Ss would receive $2.00 for each 
of their maximum demands and $1.50 for each of 
their minimum demands that was accepted by the 
enemy team. In turn, they would lose $1.00 for each 
maximum offer and $.75 for each minimum offer 
made to and accepted by the enemy team. In addi- 
tion, they would lose $1.00 to $1.25 (depending on 
whether maximum or minimum) for each enemy de- 
mand they accepted, and gain $1.25 to $1.50 (depend- 


2 The manual has been deposited with the Ameri- 
can Documentation Institute. Order Document No. 
9244 from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington, 
D. C. 20540. Remit in advance $2.25 for microfilm 
or $5.00 for photocopies and make checks payable 
to: Chief, Photoduplication Service, Library of 
Congress. The manual may also be obtained from 
the senior author. 


S. STREUFERT, S. C. STREUFERT, AND C. H. CASTORE 


ing on whether maximum or minimum) for each 
enemy offer they accepted. 

The Ss were only given general information about 
the content of enemy demands and offers. They were 
consequently unable to distinguish between maximum 
and minimum proposals put forth by the other team. 

The demands and offers to be made by the two 
teams were written so that some of them would be 
directly contradictory, in other cases the minimum 
demand of one team would match the maximum offer 
of the opposing team, and in yet other cases there 
was no relationship between the demands and offers 
of the two teams. Without the direct knowledge of 
the Ss, demands and offers were organized so that an 
even trade of demands and offers would yield each 
person on both teams a take home pay of $8.00. 
The Ss were told that they would be able to make 
anywhere between zero and $20.00. All agreements 
had to be in writing and signed by all four nego- 
tiators. 


Measurement and Leadership Characteristics 


Ten series of verbal negotiations between sets of 
two dyad teams (involving four persons at a time) 
were carried on for three successive periods of 1 hr. 
each. The Ss did not know when the final period 
would be. None of the teams had resolved their dif- 
ferences entirely after the third period, so that an 
end effect was avoided. After each of the (three) 
negotiation periods, Ss were asked to take a recess. 
The recess period was used to collect a variety of 
data. Among the measures which Ss had to complete 
individually were the 10 scales for leadership char- 
acteristics proposed by Stogdill (1962). (See above.) 
The Ss were asked to rate their teammate, them- 
selves, and the two members of the opposing dyad 
team separately on all 10 scales. Seven-point scales 
were used, marked “much like” and “much unlike” 
at the scale ends. 

Each of the four Ss in the negotiation sessions 
was, in addition, rated (at the end of each period) 
by a trained observer. Mean ratings (across periods 
and Ss) for each S on each scale were correlated 
with the mean ratings (across periods) obtained from 
the trained observer. The obtained correlation of 
y = .87 was taken as an indicator of interrater re- 
liability for the leadership complex. Because of the 
high correlation, only ratings by Ss were used for 
further analysis. 

The high correlation between ratings by Ss and 
ratings by the observer serves additionally as a 
check on the meaning of the leadership scales. Scor- 
ing instructions defining the meaning of the cate- 
gories were given as in Table 1. 

By specifically scoring for the characteristics as 
outlined in Table 1, it was attempted to avoid much 
similarity or reciprocality in scoring categories. The 
high correlation of S ratings with observer ratings 
was taken as an indicant that Ss had followed in- 
structions sufficiently well to avoid confounding of 
the scoring categories. 

To select the “leader” of each dyad of Ss (ne- 
gotiators), the members of each dyad were com- 
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TABLE i 


LEADERSHIP CHARACTERISTICS AND 
ScoRING INSTRUCTIONS 








Leadership 
characteristic 


Tolerates un- 
certainty 


Persuasiveness 


Initiation of 
structure 


Tolerates free- 
dom of action 


Assumes leader- 
ship role 


Production 
emphasis 


Consideration 


Predictive ac- 
curacy 


Demands recon- 
ciliation 


Represents 
group 











Scoring instruction 


The degree to which the subject was 


willing to work on any topic at 
hand although there was doubt 
that the other side would accept 
the suggestions made. 


The degree to which the subject 
attempted and succeeded in getting 
his view accepted. 


The degree to which the subject at- 
tempted to impose rules or organ- 
izational procedures upon the 
negotiations. 


The degree to which the subject per- 
mitted in himself and others be- 
havior which was extraneous to 
the task at hand. (Score here 
based on interpersonal behavior 
to avoid reciprocality to “‘initia- 
tion of structure.”’) 


The degree to which the subject 
attempted to guide the discussion 
in task areas, no matter whether 
the discussions were held under 
structured or unstructured con- 
ditions. 


The degree to which the subject 
attempted to get signed agree- 
ments, even if these agreements 
did not touch on areas where the 
negotiation teams were far apart. 


The degree to which a subject ex- 
pressed an understanding of why 
the other team was demanding 
(or offering) particular points, 
even if these points were declared 
not acceptable by the subject. 


The degree to which a subject was 
able to foretell in advance which 
agreement would be made, and 
what terms would be agreed upon. 


The degree to which a subject would 
not tolerate interpersonal (rather 
than task-based) conflict. (Em- 
phasis here placed on interpersonal 
side to avoid reciprocality with 
“tolerates uncertainty.’’) 


The degree to which the subject was 
verbally active and argued pri- 
marily for the demands and offers 
of his side. (Emphasis on task 
orientation to avoid similarity to 
“assumes leadership role.”’) 
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pared on a leadership score obtained by calculating 
the mean of four leadership ratings (by themselves 
and the three other negotiators) across scales and 
periods. The person with the highest score on each 
team was designated the leader. In other words, a 
leader in this sense is the dyad member with the 
highest average rating across all of Stogdill’s leader- 
ship characteristics over an extended period of time. 

Since one member of each dyad is designated 
leader, the 40 Ss yielded 20 leaders. Ten of these 
leaders were of complex conceptual structure, the 
other 10 were of simple conceptual structure. Mean 
leadership scores for each one of Stogdill’s 10 scales 
(across periods and across ratings by all members of 
both negotiating dyads) were calculated to use as 
entries in a mixed-design two-way analysis of 
variance. 


RESULTS AND DISCUSSION 


The data were analyzed with a mixed two- 
way analysis of variance. The main effect for 
complexity was not significant. The main ef- 
fect for leadership scales (F = 62.01) and 
the interaction effect of leadership scales with 
complexity of conceptual structure (F = 
37.74) were significant beyond the .001 level. 

In view of the highly significant interaction, 
differences between simple and complex lead- 
ers on each of the 10 leadership scales were 
tested by means of the F statistic with 1/168 
degrees of freedom. The analysis-of-variance 
within-error term was used as the estimate of 
mean square error within (cf. Winer, 1962). 
Because of repeated application, and because 
of the small error mean square (below the 
“theoretical minimum” of 1.0), a limit of p 
< .001 was set to accept significance. Sig- 
nificant F ratios were obtained for tolerance 
of uncertainty (fF = 128.0), initiation of 
structure (F = 34.03), assumption of leader- 
ship role (F = 18.0), production emphasis 
(F = 63.28), consideration (F = 52.53), pre- 
dictive accuracy (F = 69.03), and demands 
reconciliation (F = 24.75). The F ratios for 
persuasiveness, tolerance for freedom of ac- 
tion, and represents group were not signifi- 
cant. The results are shown in graphic form 
in Figure 1. 

As expected, the main effect for complexity 
was not significant. It appears that leader- 
ship characteristics are displayed by both 
simple and complex leaders. Although it is 
difficult to interpret the main effect for leader- 
ship scales since the interaction effect is 
highly significant, one might suggest that this 
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Fic. 1. Mean leadership ratings on Stogdill’s char- 
acteristics for leaders of simple and complex con- 
ceptual structure. 


result may be an effect of the negotiation 
situation. For instance, secrecy and united 
action are of vital importance in negotiations, 
and great freedom of action for the follower 
cannot be permitted by the leader if negotia- 
tions are to be successful. Consequently, one 
finds that both complex and simple leaders 
are rated near the scale midpoint on this 
characteristic. Similarly, both are rated high 
in assuming the leadership role and repre- 
senting their group, in other words, permit- 
ting little (divergent) activity by their fol- 
lower. 

The highly significant interaction effect is 
of special interest. As expected, there are some 
communalities and some divergencies in the 
characteristics of simple and complex leaders. 
Complex leaders are high on tolerance of un- 
certainty, while simple leaders are rated near 
the scale midpoint. Since complex persons 
have more facilities for integrating incon- 
gruent or unstructured information, such a 
result may be expected. Similarly, it is not 
surprising that simple leaders are more con- 
cerned with initiation of structure. Stogdill 
describes this characteristic as “clarifies own 
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role and lets follower group know what is 
expected.” This again would result in less un- 
certainty, establish clear-cut roles, and permit 
greater emphasis on accomplishing the task 
as defined. In line with this argument, sim- 
ple leaders also scored higher than complex 
leaders on production emphasis. On the other 
hand, complex leaders show more considera- 
tion and demand less reconciliation than sim- 
ple leaders. Again, this would be due to their 
greater ability to integrate alternate views, 
possibly even a preference for alternate views 
that might serve to develop alternate or more 
complex strategies. In addition, complex lead- 
ers were rated as having higher predictive ac- 
curacy. This result corroborates the finding 
of Streufert and Driver (1965) that com- 
plexity is of great advantage in the perception 
of others’ strategy. The finding that complex 
Ss are rated higher in assumption of leader- 
ship role is difficult to interpret since the 
terminology is, in this case, somewhat unclear. 
Further research on the meaning and implica- 
tions of the leadership “role” seems advisable. 

One should note that there was no differ- 
ence (ry = .85, .89) in the correlation of the 
trained observer’s ratings with the mean 
group ratings of complex and simple leaders. 
In other words, the differences between com- 
plex and simple leaders on these scales are 
not likely due to different uses of the rating 
scales by complex and simple persons, but 
are reflections of actual differences in leader 
characteristics. 

To check whether the ratings obtained by 
the “leader” Ss were based on actual leader- 
ship (rather than other characteristics, e.g., 
possibly those inherent in the situation), 
mean ratings were also calculated for “fol- 
lower” Ss. Mean ratings on all categories 
were below the scale midpoint (4.0), and the 
peaks and valleys of the curves were less pro- 
nounced. On the basis of this information it 
was concluded that validity of leadership 
ratings was given. 

The results suggest that, although in some 
situations leaders who are complex and lead- 
ers who are simple will do equally well, there 
are other situations requiring specific leader- 
ship characteristics in which the simple lead- 
ers may outperform complex leaders and vice 
versa. It may be of value to study the type 
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of leadership required before a particular 
leader is selected. In this fashion, leadership 
and consequently group performance may be 
optimized. In negotiations, for instance, it 
may be of value to use a complex leader when 
the situation is somewhat unstructured and 
frequent modification of strategies as well as 
empathy with the opposing team are re- 
quired. On the other hand, a simple leader 
might be more advantageous if more uni- 
lateral negotiations with emphasis on un- 
modified results are required. However, a 
warning should be added: Negotiations be- 
tween teams whose members are of simple 
conceptual structure may well end in a break- 
down of the negotiations as long as both sides 
are of similar strength and if there is little 
cr no meeting ground. Under such conditions, 
one would expect better results by using more 
complex leaders and negotiations teams. 
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REACTION TIME AS A FUNCTION OF THE CUE 
PROPERTIES OF AN AUDITORY DISPLAY 


J. RICHARD SIMON 


University of Iowa 


An attempt was made to increase the speed of processing verbal directional 
commands by using the ear stimulated to provide a relevant directional cue. 
In a choice reaction-time task, Ss pressed the right- or left-hand key in 
response to: (1) binaural “right” or “left” commands, (2) monaural “right” 
or “left”? commands in the ear corresponding to the content of the command, 
and (3) monaural pure tones. Responses to the binaural commands were 
significantly slower than to the other 2 conditions, and responses to the 
monaural commands were significantly slower than to the monaural tones. 


A basic and continuing problem in engi- 
neering psychology is that of designing dis- 
plays which effectively communicate spatial 
relationships such as direction (Fitts, 1951). 
In order to maximize the speed of informa- 
tion processing, displays should be designed 
so that they provide cues for the direct per- 
ception of relevant directional relationships. 
This occurs, for example, when the spatial 
arrangement of a visual display is made to 
correspond with that of its associated control 
(Fitts & Seeger, 1953; Garvey & Mitnick, 
1955). 

Simon and Rudell (1967) demonstrated 
that directional cues also play a role in the 
interpretation of auditory displays. Their Ss 
performed a choice reaction-time (RT) task 
which involved pressing the right- or left- 
hand key in response to the commands “right” 
or “left” which were presented to the right 
or left ear. The Ss responded significantly 
faster when the content of the command cor- 
responded to the ear stimulated (i.e., “right” 
in right or “left” in left ear) than when it did 
not (i.e., “right” in left ear or “left” in right 
ear). In other words, a delay in responding 
occurred when the irrelevant directional cue 
(ear stimulated) conflicted with the relevant 
symbolic cue (content of the command). 

The present study represents an attempt 
to increase the speed of processing verbal 
directional commands by utilizing the direc- 
tional cue provided by the ear stimulated. 
Will Ss respond faster to a display which 
combines directional and symbolic cues, that 
is, when the “right” command is always pre- 


sented to the right ear and the “left” com- 
mand to the left ear, than they will to a 
display which utilizes either directional or 
symbolic cues alone? The expectation that 
such a result might occur is based upon evi- 
dence that two simultaneous stimuli may 
summate to elicit a faster response than any 
one of the stimuli alone (Woodworth & 
Schlosberg, 1954). Todd (1912), for example, 
has shown that RT to a combination of sound 
and shock is faster than to either one alone. 


MeEtTHOopD 


Apparatus. The apparatus provided a measure of 
choice RT to a series of auditory stimuli which 
were presented to S through Grason-Stadler TDH-39 
matched earphones. Two telegraph keys were 
positioned 10 in. apart on a table in front of S. He 
was instructed to depress the correct one of the 
two keys as quickly as possible after hearing the 
signal. The instructions and stimulus materials were 
recorded on tape and presented at approximately 
80 db. by a Sony TC 500 Stereocorder. Onset of the 
signal activated a Hunter Klockounter which stopped 
when S pressed the key. Depressing the key also 
signaled EZ as to which key (or keys) had been 
pressed. 

Subjects. The Ss were 18 male and 18 female 
University of Iowa undergraduate volunteers between 
the ages of 18 and 29 (average age 19.8). All Ss 
passed a standard audiometric screening test in which 
tones of 500, 1000, and 2000 cps were presented to 
each ear separately. No S had a hearing loss greater 
than 10 db. at any one of the three frequencies 
tested. 

Procedure and experimental design. Each S per- 
formed on three blocks of trials corresponding to 
the three experimental conditions. A 3X3 Latin 
square was used to balance the order of. presentation 
of the conditions. Eight males and eight females 
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were randomly assigned to each of the three 
sequences. 


In one block of trials, the stimulus was a 1000-cps 
monaural tone. The S was told that he would hear 
the tone in either his right or left ear and that he 
was to respond by depressing the corresponding 
(right or left) key. In a second block, S was told 
that he would hear either the word “right” or the 
word “left” in both ears simultaneously and that he 
should respond by depressing the appropriate key. 
In a third block, S was told that he would hear 
the word “right” in his right ear or the word 
“left” in his left ear and that he was to respond by 
depressing the appropriate key. Each block of trials 
consisted of a predetermined random sequence of 
42 tones or commands half of which called for a 
right response and half a left response. The first 
four responses in each block, two right and two 
left responses, were practice trials. A ready signal 
was presented 2 sec. prior to each stimulus and 
consisted of the simultaneous onset of a warning 
light and a 2000-cps, 80-db. binaural tone of 500- 
msec. duration. There was a 7-sec. interval between 
commands, 


RESULTS 


Median RTs were computed for each S$ 
for each treatment condition. The means of 
these median RTs were as follows: monaural 
tone (directional cue)= 307 msec.; monaural 
verbal command (directional plus symbolic 
cues) = 322 msec.; and binaural verbal com- 
mand (symbolic cue)= 420 msec. An overall 
analysis of variance revealed that the differ- 
ence between these treatment conditions was 
significant; (2/60) = 155.10, p< .001. 
None of the other main effects (i.e., order, 
sequence, sex) or interactions was significant. 
A critical difference test (p= .05) be- 
tween individual pairs of means revealed that 
RT to the binaural verbal command was 
significantly slower than to either of the 
other two treatment conditions. The RT to 
the monaural verbal command was signifi- 
cantly slower than to the monaural tone. 


DISCUSSION 


Results indicated that Ss were slower in 
responding to the binaural verbal command 
than to either the monaural tone or the 
monaural verbal command. The explanation 
for this finding seems obvious. With the 
binaural command, S$ could not begin to re- 
spond until an intelligible portion of the 
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word had been articulated, whereas, with the 
other two treatment conditions, the ear stimu- 
lated provided a relevant cue and S could 
begin to respond immediately upon onset of 
the stimulus. It is therefore not surprising 
that the monaural verbal command (com- 
bined directional and symbolic cues) re- 
sulted in a faster RT than the binaural verbal 
command (symbolic cue). What is surprising 
is that the monaural verbal command actually 
resulted in a significantly slower RT than 
the monaural tone (directional cue). Results 
indicated clearly that Ss were utilizing the 
directional cue contained in the monaural 
verbal command. Apparently, however, they 
were not utilizing the cue as effectively as 
they might have. It was as if the availability 
of the redundant symbolic cue (content of 
command) slowed down Ss’ reactions to the 
directional cue (ear stimulated). 

Results of the present study show that RT 
to verbal commands of “right” or “left”? may 
be reduced by presenting these commands to 
the appropriate ear, that is, adding a relevant 
directional cue. However, results also indicate 
that this method of conveying directional in- 
formation is not superior and, in fact, is in- 
ferior to presenting pure tones to one ear or 
the other. These results must not be used as 
a basis for generalizations concerning (a) 
the relative effectiveness of directional versus 
symbolic cues in auditory RT, or (0) the 
general effect of combining directional and 
symbolic cues. Such generalizations are un- 
warranted because of the qualitative differ- 
ence between the two kinds of stimuli (tone 
and verbal command) used in this study. 
Future research should hold this variable 
constant, for example, train Ss to associate 
different directional responses to tones of 
differing frequency and then compare RT to 
binaural and monaural presentations of these 
tones. 
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COMPARISON OF JUDGMENTAL METHODS WITH 
ENDORSEMENTS IN THE ASSESSMENT OF 
PERSONALITY TRAITS * 


IGOR KUSYSZYN 


York University, Downsview, Ontario 


The study focuses on the degree to which judgment methods are free from 
dissimulation effects as compared with endorsements. 8 structured personality 
scales from the Personality Research Form were administered under 4 in- 
structional sets—Endorsement, Desirable in Others, Frequency of Endorsement, 
and Desirable in Selfi—to 94 members of 7 local college fraternities. The 
validity of both judgments and endorsements was examined by relating them 
to noninventory behavioral criteria in the form of behavior ratings. Although 
judgments were found to be relatively free from the kinds of dissimulation 
effects found with endorsements, judgments yielded significantly lower validities 


than endorsements. 


It has long been known that personality 
tests of the paper-and-pencil variety are sub- 
ject to faking. Although some tests have 
employed validity scales to detect distortion, 
an invalid profile is of little use to the in- 
vestigator who is interested in the individual’s 
personality traits. One approach to the prob- 
lem of faking, implemented in the present 
study, is to change the basic task of the S 
from that of describing himself to that of 
making judgments about the test items. For 
example, instead of asking S to evaluate an 
item as “true” or “false” of himself, one 
might ask him to judge how desirable he 
thinks others feel it is to respond “true” to 
the item. Thus, through the use of judg- 
ments, the purpose of the task may be 
disguised so as to minimize the operation of 
dissimulation tendencies. At the same time 
judgments may yield enough systematic re- 
sponse variance to provide an indication of 


1 This study is based on a thesis submitted by the 
author in 1967 to the Department of Psychology, 
University of Western Ontario, in partial fulfillment 
of the requirements for the degree of Doctor of 
Philosophy. The study has profited greatly from the 
author’s many discussions with Douglas N. Jackson, 
as well as from his careful reading of the manu- 
script. The able assistance of J. Edwin Boyd in 
programming the data for computer analysis is also 
gratefully acknowledged. This research was sup- 
ported in part by the University of Western Ontario 
Research Fund and in part by Grant No. 151 from 
the Ontario Mental Health Foundation to Douglas 
N. Jackson. 


the individual’s own personality (cf. Camp- 
bell, 1950). 

There have been very few studies published 
in which an attempt was made to use judg- 
ments for the purpose of assessing personality 
traits. In an early study, Rosen (1956) ad- 
ministered the Minnesota Multiphasic Per- 
sonality Inventory under two judgment sets, 
“personal desirability” and “perceived social 
desirability,” and obtained positive but low 
correlations between these sets and the stand- 
ard true-false set. Rosen concluded that the 
correlations were sufficiently low as to sug- 
gest the relative independence of each set. 
More recently, Messick (1960) factor ana- 
lyzed social desirability ratings of the Ed- 
wards Personal Preference Schedule (Edwards, 
1954) and obtained nine dimensions of social 
desirability. Jackson (1961, 1964a) suggested 
that the presence of these different “points of 
view” about desirability raises the possibility 
that the judgments of items might be useful 
in drawing inferences about the personality of 
the judge. Jackson (1961, 1964a) tested this 
hypothesis by using desirability judgments of 
a conformity scale to predict an attitude-shift 
criterion of conformity. The judgments were 
found to have a slightly higher relationship 
with the criterion (.29) than the endorsements 
(.22). In a replication of the Jackson study 
by Goldberg and Rorer (1966), in which Ss 
predicted the responses of their group mem- 
bers instead of judging the desirability of the 
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items, a correlation of .30 between predictions 
and the criterion was obtained while the 
validity of the endorsements was only .06. 
However, in an earlier replication of the Jack- 
son study by Loomis and Spilka (1963), the 
validity of endorsements (.26) was superior 
to the validity of the judgments (.15). In a 
broader study of conformity, Stricker, Mes- 
sick, and Jackson (1966) related judgments 
and endorsements of an Independence-Yield- 
ing scale to four kinds of conformity criteria 
and found judgments to be better predictors 
than endorsements in the case of Ss who were 
suspicious as to the nature of the study. 

The positive results in favor of judgments 
in three of the last four studies cited above 
suggest the potential importance of judgment 
instructions in trait assessment, and the 
limited scope of these studies (i.e., only one 
kind of criterion—conformity—was predicted) 
clearly indicates the need for a broader and 
more thorough investigation of judgment 
methods. The present study will attempt to 
partially fulfill this need by using criteria 
which differ in method from questionnaires. 
These are in the form of behavior ratings by 
peers. The validities of eight personality scales 
administered under three kinds of judgment 
sets and under an endorsement set will be 
evaluated. The susceptibility of judgments 
and endorsements to dissimulation effects will 
be studied by comparing the validities of de- 
fensive and social desirability respondents 
with the validities obtained by low defensive 
and low social desirability respondents, re- 
spectively. 


METHOD 


Subjects 


The Ss were 94 males ranging from freshmen to 
graduate students from seven college fraternities in 
the city of London, Ontario. Thirty-six lived inside 
their fraternity houses (live-ins) and 58 lived out- 
side (live-outs). They had been acquainted with one 
another for a mean period of 18 mo. and had a 
mean age of 21.9 yr., with a standard deviation of 
ie 7eyits 


Personality Inventory 


The inventory consisted of eight 20-item person- 
ality scales from the Personality Research Form 
(PRF) (Jackson, 1964b) and two 20-item validity 
scales, Social Desirability (Jackson, 1964b) and De- 
fensiveness (Jackson & Messick, 1964). The per- 


Icor Kusyszyn 


sonality scales are as follows: Nurturance, Exhibi- 
tion, Autonomy, Play, Aggression, Achievement, Im- 
pulsivity, and Order. The scales have a median test- 
retest reliability of .82 and a median validity, based 
on behavior ratings, of .54. The social desirability 
factor was suppressed in all scales during item anal- 
ysis. Acquiescence is suppressed by balanced true- 
false item keying. The Social Desirability scale mea- 
sures the degree to which an individual responds in 
a socially desirable manner; the Defensiveness scale 
measures the degree to which a bias in self-report is 
operating. 


Trait Adjectives 


For the behavior rating task, eight matched pairs 
of trait adjectives were selected so as to represent 
the eight personality scales of the inventory (matched 
pairs instead of single adjectives were selected so 
that the reliability of the ratings could be deter- 
mined). A brief definition of the trait accompanied 
each adjective. The adjectives and the scales they 
represent are as follows: supporting, helpful (Nur- 
turance); attention seeking, entertaining (Exhibi- 
tion); independent, nonconforming (Autonomy) ; 
playful, fun loving (Play) ; aggressive, argumentative 
(Aggression) ; achieving, ambitious (Achievement) ; 
impulsive, impatient (Impulsivity); orderly, meth- 
odical (Order). 

The rating instructions asked Ss to rate how char- 
acteristic each trait was of each of the members of 
their fraternity group. The ratings were made on 
a 9-point scale ranging from “extremely character- 
istic” through “neutral” to “extremely uncharacter- 
istic” of the individual. 


Instructional Sets 


The personality inventory was administered under 
four instructional sets—one endorsement set and three 
judgment sets. The sets, briefly defined, are as fol- 
lows: Endorsement—‘Rate how characteristic each 
statement is of you”; Desirable in Others—“Rate 
how desirable each statement is assuming it is true 
of others”; Frequency of Endorsement—“Rate how 
frequently people in general would answer ‘true’ to 
the statement”; Desirable in Self—“Rate how desir- 
able the statement is when it is applied to you.” 
All ratings were made on a 9-point scale similar to 
that used with the trait adjectives. 


Administrative Procedure 

Each fraternity group was run separately in a 
classroom setting. The groups ranged in size from 9 
to 19 Ss. The Ss performed the adjective trait rating 
task first, then the personality inventory was ad- 
ministered successively under each of the four in- 
structional sets in the order in which they are de- 
scribed above. 


RESULTS 
Defensiveness and Social Desirability Groups 


To investigate the effects of biased re- 
sponding, the total sample was split at the 
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median of the Ss’ Defensiveness scale scores 
obtained under the Endorsement set. This 
yielded a mean Defensiveness scale score of 
99 (the possible range of scores being 20- 
180) for the High Defensive group (V = 47) 
and a score of 76 for the Low Defensive 
group (NV = 47). To investigate the effects of 
socially desirable responding, the total sample 
was similarly split at the median of the Social 
Desirability scale scores. This split yielded a 
mean Social Desirability scale score of 140 for 
the High Social Desirability group (NV = 47) 
and a mean score of 118 for the Low Social 
Desirability group (NV = 47). 


Reliability of Behavior Ratings 


The median matched-pair reliability of the 
eight pairs of trait adjectives was .88 for the 
total sample (V = 94), .83 for the live-ins 
(N = 36), .79 for the live-outs (NV = 58), 
.88 and .86 for the Low Defensive (N = 47) 
and High Defensive (N = 47) groups, re- 
spectively, and .88 and .88 for the Low Social 
Desirability (N = 47) and High Social De- 
sirability (VV = 47) groups, respectively (co- 
efficients are Spearman-Brown corrected). 

The median between-rater reliability (agree- 
ment between subgroups of raters rather than 
the correspondence between matched pairs of 
adjectives) for the 16 single adjectives was 
.83 (Spearman-Brown corrected) for the total 
sample (V = 94). 


Reliability of Inventory Scales 


The reliabilities of the judgment and en- 
dorsement sets did not differ significantly. 
For all seven groups and for all four instruc- 
tional sets, the median reliabilities of the 10 
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inventory scales ranged between .67 and .86 
(split-half and Spearman-Brown corrected). 


Comparison of Judgment and Endorsement 
Validity Coefficients 


The criterion behavior rating scores were 
calculated as follows. Each S$ was rated by 
his peers on each of the 16 trait adjectives. 
The mean rated score for each S$ for each of 
the 16 adjectives was then determined. Since 
the 16 adjectives represented eight matched 
pairs of adjectives, the means of each of the 
eight matched pairs were added, yielding 
eight criterion scores per S, each of the eight 
scores corresponding to one of the eight per- 
sonality scales of the inventory. Validity co- 
efficients were determined for each of the 
eight personality scales under each of the four 
instructional sets by correlating the Ss’ in- 
ventory scale scores with their behavior rat- 
ing scores. For purposes of taking into ac- 
count the varying reliabilities of the trait rat- 
ings and of the inventory scales, all validity 
coefficients were initially corrected for at- 
tenuation. However, since the corrected coef- 
ficients revealed the same patterns of relation- 
ships among validities as did the raw coef- 
ficients, only the raw validity coefficients will 
be presented. 

Table 1 presents the validity coefficients for 
the total sample (V = 94). The Endorsement 
set yields the highest validities (Wdn r= 
40), the Desirable in Others set, the next 
highest (Mdn r = .20), the Desirable in Self 
set, the next highest (IMdn r = .17), and the 
Frequency set, the lowest (Mdur = .06). 

For purposes of conserving space, only the 
median validity coefficients will be presented 


TABLE 1 


VALIDITY COEFFICIENTS FOR THE TOTAL SAMPLE 











Nurt Exh Auto Play Agg Ach Imp Order Mdn 
Endorsements 29 56 24 35 44 32 51 40 
Desirable in Others 19 26 10 36 O1 21 12 20 
Frequency 19 —07 —06 19 09 —02 09 06 
Desirable in Self 19 Sl 05 21 12 01 14 17 


Note.—N = 94. Decimals omitted. 


The .05 and .01 significance levels of y are .21 and .27, respectively. h . . 
Significant differences between the Endorsement and Judgment sets as determined by the Wilcoxon signed-rank test : Endorse- 
ments—Desirable in Others, p < .02; Endorsements—Frequency, » < .01; Endorsements—Desirable in Self, p < .01 


Abbreviations: Nurt = Nurturance; Exh = Exhibition; 
Achievement; Imp = Impulsivity; Order = Order. 


Auto = Autonomy; 


Play = Play; Agg = Aggression; Ach = 
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TABLE 2 


MEDIAN VALIDITY COEFFICIENTS OF EIGHT PERSONALITY 


SCALES 


UNDER JUDGMENT AND ENDORSEMENT SETS 








: veers ; High Low High Low 
a ate a = b 
Instructional set Live-in Livercule defensive*® defensive® desirability® | desirability® 

Endorsement 47 34 37 id 36 48 
Desirable in Others 11 17 13 24 18 26 
Frequency 14 —03 —O1 18 07 13 
Desirable in Self 17 10 07 28 07 18 

Note.—Decimals omitted. 

aN = 36. 

dN =58. 

cN = 47. 


for the other six groups which were studied.” 
These correlations are presented in Table 2. 
From Table 2 it is seen that in each group 
the Endorsement set yields higher validities 
than any one of the three judgment sets. The 
Wilcoxon matched-pairs signed-rank test re- 
vealed that each of the three judgment sets 
differed significantly from the Endorsement 
set in all six groups. 

When the endorsement validities of each 
pair of subgroups were compared (Mann- 
Whitney U test) live-ins obtained signifi- 
cantly higher validities than live-outs (p< 
05); the validities of the Defensiveness sub- 
groups and the validities of the Social Desir- 
ability subgroups did not differ significantly. 
When the validities of each pair of subgroups 
were compared under each of the three judg- 
ment sets (Mann-Whitney U test) no signifi- 
cant differences were found between the 
live-ins and the live-outs and no significant 
differences were found between the Social 
Desirability subgroups. However, the Low 
Defensive group obtained significantly higher 
validities than the High Defensive group 
under the Desirable in Others (p < .01) and 
under the Frequency (p < .01) sets. 

When the individual validity coefficients of 


The entire 48 X 48 correlation matrix including 
all validity coefficients and all off-diagonal cor- 
relations has been deposited with the American 
Documentation Institute. Order Document No. 9828 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washing- 
ton, D. C. 20540. Remit in advance $1.25 for 
photocopies or $1.25 for microfilm and make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 


Table 1 are examined they are seen to vary 
greatly in magnitude, especially under the 
judgment conditions. This variability indi- 
cates that certain Judgment Set x Scale Con- 
tent interactive effects must be operating. 
That is, the same item content is rated dif- 
ferently under different judgment sets. When 
the individual validity coefficients of the De- 
fensiveness subgroups were examined, certain 
Scale X Subgroup interactive effects were also 
seen to be operating, especially under the 
judgment sets. For example, for the traits Ex- 
hibition, Aggression, and Achievement, Low 
Defensive Ss obtained positive validity coef- 
ficients while High Defensive Ss obtained 
negative validity coefficients, under the same 
judgment sets. Similar interactive effects were 
found for the Social Desirability subgroups. 


Behavior Rating and Personality Scale Score 
Means 


Table 3 presents the mean behavior rating 
scores and the mean personality scale scores 
of the High and Low Defensive groups. 
These means are compared in the hope that 
they will shed some light on the pattern of 
validity coefficients obtained by these two 
groups. (It will be remembered that the Low 
Defensive group obtained significantly higher 
validity coefficients than the High Defensive 
group on two of the judgment sets—see 
above.) When the mean scores are examined 
on the basis of a general ‘“‘favorable-unfavor- 
able” trait dimension, it is seen that High De- 
fensive Ss obtain higher scores than Low De- 
fensive Ss on favorable traits and lower scores 
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TABLE 3 
MEAN Trait RATING AND MEAN PERSONALITY SCALE SCORES FOR HIGH AND 
Low DEFENSIVE GRoUPS 
Nurt Exh Auto Play Agg Ach Imp Order 
High defensive group (V = 47) 
Trait ratings 12.34 9.93** 10.14 eon 8.05 1'3:62"* 9.86 Dies 
Endorsements 119* 98** 95 104** Soa Mog Ost os 
Desirability in 
Others 130 100** 95 98 71 131 81 127 
Frequency 114 99 98 105 83 108 94. 109 
Desirability in Self | 132 100** 97 94* 63 137 76 131 
Low defensive group (V = 47) 
Trait ratings 11.96 10.97** 10.64 12.64** 8.45 2!65n5 10.41 Lashes 
Endorsements 113* AEE 94 1S OTe 106** LO7as OSG 
Desirability in 
Others 130 LO ea 94 102 al 131 82 128 
Frequency 116 99 95 105 86 111 91 116 
Desirability in Self | 129 ih 99 100* 68 137 80 129 





Note.—The possible score range for a trait rating score is 2-18 and the range for a personality scale score is 20-180. 


See Note of Table 1 for abbreviations. 


The entries marked with asterisks indicate statistically significant differences between the High and Low Defensive groups 


between the scales obtained under parallel instructional sets. 
a 


RD < 01. 


on unfavorable traits. For example, when the 
behavior rating scores of the two groups are 
compared it is seen that the High group ob- 
tains significantly lower scores than the Low 
group on Exhibition and Play (Play was de- 
fined as “horsing around,” always having a 
good time, etc.) and higher scores than the 
Low group on Achievement and Order. Under 
the Endorsement condition the High group 
again obtains lower scores on Exhibition and 
Play and higher scores on Achievement and 
Order, as well as obtaining lower scores on 
Aggression and Impulsivity and a higher score 
on Nurturance. However, High Defensive Ss 
do not respond much more favorably than 
Low Defensive Ss under the judgment sets: 
there were no significant differences between 
the two groups on the Frequency set, only 
one significant difference under the Desirable 
in Others set (Exhibition), and only two 
significant differences under the Desirable in 
Self set (Exhibition and Play) (see Table 3). 

Similar score differences were obtained be- 
tween the means of the Social Desirability 
subgroups but fewer of the differences were 
found to be statistically significant. 


Discussion 


It will be recalled that the major purpose 
of the study was to examine the validities of 
judgments and endorsements, especially in 
relation to the operation of dissimulation 
effects. The obtained results were in one 
sense supporting of the use of judgment sets 
for controlling response biases, but it became 
clear that much more work is needed to 
establish an unequivocal foundation for the 
use of judgments routinely in personality 
assessment. Thus, the judgment sets were 
found to be relatively free from dissimula- 
tion effects in that little evidence of biased 
judgments was found under any of the three 
judgment sets. However, the validities of the 
judgment sets were low in relation to the 
validities obtained with endorsements. En- 
dorsements yielded higher validities for all 
groups, even in the case of High Defensive 
and High Social Desirability respondents who 
endorsed items in a favorable direction. 

When the mean personality scale scores of 
the High and Low Defensive groups were 
compared under the Endorsement condition, 
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High Defensive Ss obtained more favorable 
scores than Low Defensive Ss. However, it 
was also observed that High Defensive Ss ob- 
tained more favorable behavior ratings than 
Low Defensive Ss. That is, not only do 
High Defensive individuals respond in a 
favorable direction to personality items, but 
they also seem to create, quite strongly, 
favorable impressions upon others in real life. 
This finding is important for several reasons. 
Although other factors may be operating, the 
fact that there is a correspondence between 
the favorable endorsement of personality 
items and favorable responding in real life 
(as reflected by the behavior ratings) may be 
why the endorsements in the present study 
show higher validities than the judgments 
for the Defensiveness subgroups. The fact 
that the behavior ratings of the Low Defen- 
sive group covaried with the judgments to 
a greater degree than did the behavior ratings 
of the High Defensive group would also pro- 
vide an explanation for the judgment validities 
being higher for the Low than for the High 
Defensive Ss. This follows from the fact that 
there were few significant differences between 
the judgment scores of the two groups but 
there were relatively more significant differ- 
ences between their behavior rating scores. 
The finding that High Defensive individ- 
uals present favorable impressions in real life 
poses an interesting theoretical question. Do 
these individuals create favorable impressions 
intentionally or are they unaware of creating 
them? If they are unaware of the way in 
which they present themselves, that is, if 
favorable responding is part of their “life 
style” (cf. Crowne & Marlowe, 1964), then 
it is clear that personality scales are reflect- 
ing, in part, this “life style” of positive pre- 
sentation. On the other hand, if they are 
aware of playing this favorable role, should 
one accept their scale scores or should one 
be skeptical and begin a search for their 
“real” or “true” personalities? If one commits 
himself to the definition of behavior of pri- 
mary research interest as that which is en- 
dorsed on structured tests, as the present 
researcher has done, then one must accept 
by this definition all reliable test responses 
as being indicators of “true” personality in 
the sense that each response is determined 
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by certain personality characteristics. In this 
sense one would accept the endorsement 
validity correlations of the High Defensive 
Ss as they stand. One would also be obliged 
to account for all reliable variance obtained 
under the judgment sets, including the rela- 
tively modest portion which proved to be 
related to criterion trait ratings. In this latter 
case, it is believed that the correlations be- 
tween the judgments and the trait ratings 
were small in relation to the endorsement 
validities, because the judgment tasks differ 
in nature from the endorsements. The judg- 
ment tasks, being indirect, did not require Ss 
to reveal their personality traits in the same 
sense as did the endorsements. The judg- 
ments asked for an opinion of how desirable 
an item is in oneself or in others, or how 
frequently others would endorse the item. 
One implication of the present study, there- 
fore, is that endorsement rather than judg- 
ment instructions be used if one desires a 
valid measure of personality by means of 
structured scales and trait-rating criteria. The 
relatively high criterion validities, especially 
in the case of High Defensive and High Social 
Desirability respondents, justify the use of 
endorsements. The empirical support for the 
uniqueness of judgments suggests that more 
work needs to be done in uncovering the 
properties of judgments before applying them 
to the assessment of personality traits. The 
unique properties of the individual judgment 
sets were revealed in the present study by the 
interactive effects obtained between judg- 
ments and traits. For example, under one 
judgment set the validity coefficient is high 
while under another judgment set the validity 
coefficient for the same trait is low. Similar 
interactive effects between judgments and 
scale content are reported by Jackson and 
Singer (1967). In their study Ss were found 
to attribute the more pathological traits to 
others to a greater degree than to themselves. 
Further support for the complexity of judg- 
ments comes from a study by Jackson 
(1964b). Jackson’s results suggested that in 
the case of certain traits a complementary- 
needs hypothesis may be warranted. For ex- 
ample, Ss scoring high on the trait Dominance 
under the endorsement condition were found 
to judge Submissiveness as being desirable in 
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others. The same held true for the comple- 
mentary traits Succorance and Nurturance. 
Findings such as these reveal some of the 
problems involved in using judgments for 
assessing personality traits. It appears that 
a fruitful approach would be to examine each 
type of judgment set separately in relation to 
specific item content. In this way the com- 
plementarity-of-needs hypothesis as well as a 
similarity-of-needs hypothesis could be more 
precisely studied and the effects of different 
judgment sets on different traits more clearly 
delineated. 
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EFFECTS OF THE TYPE OF STIMULUS EMPLOYED 
AND THE LEVEL OF SUBJECT AWARENESS ON 
THE DETECTION OF DECEPTION * 


RICHARD I. THACKRAY 2 ann MARTIN T, ORNE 2 


Institute of the Pennsylvania Hospital and University of Pennsylvania 


An experiment was designed to compare detection of personal material with 
that of neutral material made relevant only in the experimental context. Each 
S acted the role of an espionage agent and attempted to conceal both his 
personal identity and certain code words he had learned. Personally relevant 
material was found to be significantly more detectable than the neutral ma- 
terial, although consequences of detection were the same for both types. In 
addition, the study attempted to provide exploratory data concerning physi- 
ological responsivity to lie-detection stimuli when Ss were unaware that 
responses were being monitored. While there was no evidence that detection 
was inferior under the nonaware condition, difficulties in achieving a com- 
pletely convincing nonaware situation suggest caution in generalizing from 


these findings. 


Previous studies in detection of deception 
have reported varying rates of detection 
which appear to be related to the type or 
nature of the stimulus situation employed. 
Lykken (1960), for example, obtained 100% 
accuracy in matching idiosyncratic material 
(mother’s name, father’s name, etc.) to a 
particular S$. Somewhat lower detection rates, 
ranging from 94% (Lykken, 1959; Marston, 
1921) to 50% (Landis & Wiley, 1926), are 
reported when mock crimes are employed. 
Still lower rates, ranging from 83% (Ruck- 
mick, 1938) to 40% (Landis & Wiley, 1926), 
are found with such “neutral” stimuli as 
numbers, letters, etc. While these studies 
would suggest that the greater the degree of 
affect associated with the type of stimuli 
employed the greater the obtained detection 
rate, differences between studies in methods 
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of stimulus presentation, analysis of data, 
particular physiological measures used, etc., 
make valid comparisons virtually impossible. 

Since systematic comparison of different 
types of stimulus material within a single 
study has not previously been reported, a pri- 
mary aspect of the present study was to ex- 
plore the importance of the stimulus dimen- 
sion as a determinant of detection rate. 
Specifically, detection rates derived from a 
mock crime type of paradigm similar to one 
previously used (Thackray & Orne, 1967) 
were compared with rates obtained from 
personal or idiosyncratic material. 

The study was also designed to provide 
some exploratory data about S’s physiolog- 
ical responsivity to lie-detection stimuli when 
unaware that his responses are being moni- 
tored. A number of studies have suggested 
that detection of deception is difficult or im- 
possible if S is not motivated to deceive or 
if there are no consequences to being detected 
(Chappell, 1929; Gustafson & Orne, 1963). 
Thus, by eliminating S’s awareness of being 
monitored, he might not at that time be at- 
tempting to deceive, would therefore have 
nothing to hide, and would be difficult to 
detect. Under nonaware conditions, a differ- 
ence between personally relevant words and 
neutral words which achieve relevance only in 
the context of the experiment might be par- 
ticularly striking. For example, hearing one’s 
own name is likely to result in a greater 
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physiological response than hearing other 
names independent of the detection-of-decep- 
tion situation. On the other hand, when decep- 
tion is not being attempted, such differentially 
greater physiological responsivity might not 
occur with material that is relevant only in 
the context of the experiment. 

The physiological response measure em- 
ployed in the present study was the galvanic 
skin response (GSR), since previous labora- 
tory studies have shown it to be the single 
best index of deception (Ellson, Davis, Saltz- 
man, & Burke, 1952; Kubis, 1962; Kugelmas 
& Lieblich, 1966; Violante & Ross, 1964). To 
use this measure under a nonaware condition, 
however, presents certain difficulties. There 
appear to be no completely adequate tech- 
niques for insuring uniform and continuous 
electrode contact without having electrodes 
physically attached to S. It was felt, though, 
that a situation might be devised in which 
electrodes were attached, but in which S 
would believe that no recordings were actually 
being taken. The situation chosen approxi- 
mated closely the procedure followed in actual 
field polygraph examinations. In the field 
procedure, each question is carefully reviewed 
with the suspect prior to the actual examina- 
tion. In the present experiment, GSRs were 
surreptitiously recorded during this review 
phase in the hopes of approximating a non- 
aware condition. 


METHOD 
Subjects 


Twenty-eight male undergraduate college students 
were recruited from local universities. All were 
volunteers and none had previously participated in 
detection-of-deception experiments. 


Procedure 


The critical stimulus material consisted of three 
common words and three personally relevant items 
oi information: S’s first name, last name, and date 
of birth. Each critical stimulus was embedded 
among five similar stimuli. Prior to S’s arrival, an 
assistant randomly selected one card from each of 
three sets of six cards containing names of com- 
mon birds, trees, and colors, respectively, which 
would subsequently serve as “code words” to be 
learned by a given S.4 He also selected five last 


4The actual stimuli employed were as follows: 
Colors—PURPLE, ORANGE, BROWN, GRAY, GREEN, BLUE; 
Birds—crow, PIGEON, OWL, WREN, ROBIN, SPARROW; 
Trees—OAK, HEMLOCK, FLM, SPRUCE, MAPLE, PINE. 
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names from a large list of names of similar ethnic 
origin, five first names from a list of common male 
names, and five random dates (month and day). 
The S’s own last name, first name, and date of 
birth were then randomly positioned in their re- 
spective sets. The only restriction placed on the 
sets was that the first item served as a dummy in 
the interrogation and consequently could never be 
a significant stimulus. 

Each S was seen by &; who explained that the 
effectiveness of the polygraph as a lie detector was 
being studied. He was told that he was to play the 
role of an espionage agent who knows certain 
critical code words and that he would be given 
a polygraph examination subsequently by the inter- 
rogator (Hs). It was further explained that, due to 
a security leak, the interrogator was in possession of 
a pool of code words as well as the names and 
birthdates of six agents. His own name and the code 
words to be learned would be contained in this 
list. To maintain his anonymity, however, he would 
be introduced as “S 27” and the interrogator would 
not know whether he (the S) was one of the in- 
dividuals whose names he had or whether he was in 
fact innocent (belonged to a control group who did 
not receive such information). The S’s task was to 
convince the interrogator that he was innocent and 
that none of the code words or personal informa- 
tion had any meaning to him. It was explained to 
him that while it was difficult to deceive the 
polygraph, some highly intelligent and mature in- 
dividuals had been able to do so. The S was then 
given a booklet which was constructed to maximize 
overlearning of the three code words. 

After S had completed the initial phase of the 
experiment, he was introduced to E2. as S 27. The 
E, simultaneously gave Es the set of stimulus cards 
specific for the S. While no innocent Ss were used 
in this study, H2 was actually blind as to the relevant 
code words and personal information concerning 
the S. As S entered the room, the polygraph was in 
the process of being “calibrated.” The technician 
completed her task and turned off the instruments 
as S was being seated in a stratolounger chair facing 
away from the equipment. The £2 treated S 
analogously to the way an interrogator might. He 
explained the basic operation of the polygraph and 
stressed the virtual infallibility of the instrument. 
He explained to S that if he were guilty the equip- 
ment would certainly register this fact, and he 
could save considerable time and trouble by con- 
fessing (no one did). The interrogator explained 
that he would go over each of the questions with 
S to make certain there was no ambiguity, surprise, 
or confusion regarding any of the questions or the 
procedure itself. The S was told that after the 
familiarization phase, the polygraph equipment 


5 The “calibration” was to acquaint S with the 
sound of the polygraph and to reinforce his belief 
that no recording was being attempted prior to 
the actual examination. 
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would be turned on, adjusted, and the examination 
itself carried out. 

Electrodes and transducers for recording GSR 
and six other variables were then attached. (The 
remaining variables will be discussed in a separate 
paper.) The leads from the various electrodes, 
transducers, and associated equipment were not 
connected to the polygraph and were left loosely 
arranged on S’s lap. One of these polygraph leads 
was actually a dummy, however, and led from one 
of several small transducer boxes located on S’s 
chair. The GSR electrodes led to this box which con- 
tained the transmitter and bridge circuit of an FM 
telemetry system.6 The GSR was telemetered to a 
separate room and recorded on a separate polygraph 
elsewhere. It was explained to S that the electrodes 
had to be attached at this time in order to allow 
sufficient time for them to “stabilize” prior to the 
later polygraph examination. 

After all electrodes were attached to the S, E> 
informed him that the questions would now be 
reviewed with him. Since the purpose of the pre- 
interrogation was to familiarize him completely with 
all aspects of the polygraph examination which 
would subsequently follow, S was told not only to 
listen to the questions, but to answer each with a 
“yes” or “no” just as he would later during the 
examination. The S was then questioned on each 
word in the six stimulus sets. The items were pre- 
sented by £2 as informally as possible, allowing 
approximately 6 sec. between each item to provide 
sufficient time for the telemetered GSRs to be 
recorded. Half the Ss received the code-word ma- 
terial first followed by the personal material, while 
the order was reversed for the remaining half. 

Upon completion of the preinterrogation phase, 
the various leads were connected to the polygraph and 
the equipment was adjusted by an assistant acting 
as the polygraph operator. The S was then informed 
that the examination would begin. Questions were 
asked in the same order employed in the pre- 
interrogation, and S was again asked to answer each 
with a “yes” or “no.” A 16-sec. interstimulus interval 
was used to allow for differing latencies in the 
physiological response measures employed. Follow- 
ing completion of the questions, S was disconnected 
from the polygraph and taken to &: for a post- 
experimental inquiry. 

The data consisted of the telemetered GSRs ob- 
tained during the preinterrogation as well as dur- 
ing the “actual” polygraph examination. Since the 
telemetered data did not yield absolute values of 
resistance, the magnitude of response was obtained 
by measuring in millimeters the maximum pen 
deflection to occur within 5 sec. following the onset 
of each verbal stimulus. The response to the critical 
stimulus was ranked against the four control words 
in its respective set, and a rank of 1 was assigned 
to the largest response. Mean ranks were then de- 
termined for each S on both the code words and 
the personal items. 


6 Onyx Designs, Inc. 
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RESULTS 


Table 1 displays the mean ranks obtained 
for both types of stimulus material under 
the nonaware and aware conditions. A mean 
rank of 1 would indicate perfect detection; 
it is noteworthy that in every comparison 
personal material yields a smaller mean rank 
than code-word material. To test the pos- 
sibility of combining orders, comparisons were 
made between ranks of personal words under 
both orders of presentation, and likewise 
between code words under both orders. Sepa- 
rate comparisons were made for the nonaware 
and aware series. None of the differences 
was significant at the .05 level (Mann- 
Whitney U; two-tailed). The orders were 
then combined and the results shown in 
Table 2. As indicated in this table, Wilcoxon 
matched-pairs signed-rank tests revealed a 
significant superiority of personal words over 
code words. 

In order to determine whether detection 
rate differed as a function of awareness, chi- 
square tests were separately performed on the 
personal- and code-word distributions. Since 
the expected chance value of the mean ranks 
is 3, the number of mean ranks above and 


TABLE 1 


Mran GSR RANnKs ON CopE Worps (CW) Anp 
PERSONAL Worps (PW) FOR BotH ORDERS 
OF PRESENTATION UNDER BOTH 
NONAWARE AND AWARE 























CONDITIONS 
Condition Order Stimulus X rank 
material 
PW 1.38 
PW-CW2 
CW 1.67 
Nonaware 
CW 1.63 
CW-PW? 
PW 1.23 
PW 1.46 
PW-CW 
CW 1.70 
Aware 
CW 1255 
CW-PW 
PW 1.42 
NE aioe 
bN = 15. 
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TABLE 2 


Mean GSR Ranks ON ComMBINED CopE Worps 
(CW) AND PERSONAL Worps (PW) UNDER 
NoNAWARE AND AWARE CONDITIONS 








Stimulus 





Condition : X rank x? Le 
material 
PW 1.30 200374" 
Nonaware 20-000 
CW 1:65 DRS 
PW 1.44 DA oomean 
Aware So" 
CW 1.62 ZnO Snes 





Note.—N = 28. 

* » < .025, one-tailed test. 
¥#* b < .01, one-tailed test. 
arhip < .001. 


below this value was determined for each 
distribution and evaluated against the ex- 
pected frequencies. As Table 2 indicates, the 
four chi-square values were significant, in- 
dicating better than chance detection under 
both awareness conditions. To test for any 
relative superiority of the two conditions, a 
comparison was made between ranks of per- 
sonal words in the nonaware and aware series. 
Likewise, a similar comparison was made be- 
tween ranks of code words in both series. 
Wilcoxon tests yielded nonsignificant z values 
of 1.09 and .09, respectively. 

During the postexperimental inquiry each 
S rated how detectable he thought he was 
on the three personal-word and three code- 
word items. Mean ranks for these two types 
of stimulus material were determined for each 
S, and the difference between the distributions 
tested with a sign test. Results indicated Ss 
felt themselves to be more detectable on the 
personal-word material (p < .001). 


DIscUSSION 


Previous research has focused on the con- 
sequences of detection and S’s motivation 
to deceive as significant variables affecting 
the likelihood of differentially increased physi- 
ological responses (Gustafson & Orne, 1963, 
1965). In the present study, responding to 
either personal items or code words would 
lead to detection. The S was aware that a 
response to either type of stimulus would have 
the same consequences; that is, he would 
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reveal himself to be a guilty person. Yet, the 
results indicate that detection of personally 
relevant material is superior to relatively 
neutral stimuli which are made relevant only 
in the context of the experiment. If S’s mo- 
tivation to deceive and the consequences of 
deception are held constant, however, it seems 
reasonable that personal material would be 
more likely than code material to evoke some 
form of emotional response. These findings 
are not inconsistent with previously reported 
results that situational factors have marked 
effect on detection rates, since significant 
detection was achieved with neutral stimuli 
made relevant in the experimental context. 
Thus, one would expect a decrease in detec- 
tion rates with both kinds of stimuli in a situ- 
ation where motivation is markedly reduced. 
The findings do suggest, though, that situa- 
tional determinants alone are not sufficient to 
account for the obtained detection rates. If 
detection were only a function of its con- 
sequences, personal material would not have 
been detected better than code-word material. 

The differential response between personal- 
word and code-word material observed in the 
present study is consistent with the findings 
of Oswald (1962, pp. 47-50) that Ss during 
sleep are more likely to rouse to their own 
name than to other stimuli presented at equal 
volume even if motivating contingencies are 
associated with other stimuli, though mo- 
tivating contingencies do affect the prob- 
ability of arousal to previously neutral stimuli 
during sleep. Whether one wishes to con- 
ceptualize the observable difference of re- 
sponse to highly relevant personal material 
(such as one’s name) as due to past differ- 
ential affective conditioning or to differences 
in meaning is largely a matter of semantic 
preference. That personally relevant material 
tends to evoke a differential affective re- 
sponse has also been shown in quite a different 
context in the work of Wolff (1943). He 
found that individuals gave a greater emo- 
tional reaction to their own forms of self- 
expression even when these were not recog- 
nized as such. The detection-of-deception 
paradigm used here is one where S’s physio- 
logical response to a particular stimulus is 
compared with that to other similar stimuli. 
In past research (Gustafson & Orne, 1963), 
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it has been shown that previously neutral 
stimuli can be made relevant to the experi- 
mental situation and yield differentially 
greater physiological responses. The previous 
work by both Oswald and Wolff has shown 
that personally relevant stimuli yield greater 
affective responses than neutral stimuli in a 
situation other than a detection-of-deception 
experiment. The authors of this study would 
interpret their findings to suggest that the two 
sources of increased physiological respon- 
sivity, situational and preexisting personal 
relevance, may each independently contribute 
to detection. 

The Ss rated themselves as more detectable 
on personal than on code material in the post- 
experimental inquiry. A clear interpretation 
of this finding is difficult. They may have 
perceived themselves as responding to per- 
sonal items during the polygraph examination 
or they may merely have expected that per- 
sonal material should be more detectable than 
code words. Further research is needed to 
clarify the results. 

In designing the present study, it was 
hoped that the nonaware condition would 
allow a comparison between personal- and 
code-word stimuli in a situation where, as far 
as the S knew, a physiological response would 
not have the consequence of detection. Under 
these circumstances one might expect that 
the code words would fail to yield a greater 
physiological response than their appropriate 
control stimuli, whereas this might not be 
the case with idiosyncratic or personal ma- 
terial. This difference in relative response 
to the two classes of stimuli might become 
smaller or disappear entirely under the aware 
condition of interrogation, since at that time 
the S would be motivated to attempt to con- 
ceal his physiological responses to both code 
words and personally relevant words. 

The actual findings indicated no significant 
differences between these two conditions. 
Personally relevant words yielded differ- 
entially greater physiological response than 
code words in both conditions; however, 
code words themselves were identified as 
readily in their nonaware condition as in their 
aware condition. It is of course possible that 
Ss did not believe appearances and thought 
that somehow they might still be monitored 
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(as indeed they were) during the nonaware 
condition. Not only was every possible pre- 
caution taken to minimize suspicion, but very 
careful postexperimental inquiry was carried 
out on this issue. Only one S appeared to 
have any suspicion in this regard; most Ss, 
however, verbalized feelings of discomfort 
during the nonaware condition. They in- 
dicated they felt strange having all the elec- 
trodes attached to them and anticipating the 
forthcoming polygraph examination. Some Ss 
even indicated that if they had been recorded 
during this initial period detection probably 
would have been easier than during the actual 
examination. It would seem that having a 
great many electrodes attached and expecting 
the forthcoming polygraph examination 
created a condition for most of the Ss that 
was similar to that of the actual interrogation. 
It should be emphasized that Ss did attempt 
to deceive E during the preliminary part of 
the examination. Further work will be needed 
to clarify whether the augmented GSR as- 
sociated with the code words under the non- 
aware condition was a function of the S’s 
motivation to deceive E independent of his 
belief that records were being obtained or 
whether it was related to the number of 
electrodes that were attached to S and the 
special situation in which he found himself. 
A definitive answer about the effect of the 
awareness of being monitored on physiological 
responsivity will require considerably more 
investigation, and great care should be taken 
in generalizing from the findings reported 
here. 
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20 yr. of occupational validation on the General Aptitude Test Battery (GATB) 
are summarized in terms of average validity with (a) job proficiency and train- 
ing criteria and (b) concurrent and longitudinal studies. These 4 categories of 
studies are investigated from the standpoint of both single and multiaptitude 
prediction. The relative efficiency of many of the 9 GATB aptitudes in pre- 
dicting criteria is dependent on whether job proficiency or training criteria are 
used. Longitudinal studies tend to have higher-aptitude criterion correlations 
than concurrent studies. The median validity of batteries of GATB aptitudes 
tends to be higher with studies using training criteria or the longitudinal design 
than the validity of battertes using job-proficiency criteria or the concurrent 
design. The summary is based upon 424 studies involving over 25,000 em- 


ployees, applicants, trainees, and students. 


In his recent book, Ghiselli (1966) summa- 
rizes the success of researchers in predicting 
occupational performance with aptitude tests. 
Ghiselli’s book provided the impetus for this 
article which summarizes the United States 
Employment Service (USES) research results 
in terms of the average validity of each Gen- 
eral Aptitude Test Battery (GATB) aptitude 
against job proficiency and training criteria 
and in longitudinal and concurrent validation 
studies. The results obtained when batteries 
of GATB aptitudes are developed with each 
of these two types of criteria and two types 
of designs are also compared. The summary 
is based on 424 studies involving over 25,000 
employees, applicants, trainees, and students. 


BACKGROUND 


The USES test research program, initiated 
in 1935, led to the development of the USES 
General Aptitude Test Battery (GATB) pub- 
lished in 1947. Occupational test development 
analysts are studying aptitudinal require- 
ments, as shown by the GATB, for successful 
on-the-job performance in 36 state employ- 
ment services affiliated with the USES. To 
date, GATB validity studies have been con- 
ducted on over 500 occupations. Although 
USES aptitude test research has taken several 
new directions in recent years (Droege, 1966; 
Dvorak, 1965; Dvorak, Droege, & Seiler, 
1965), such as the development of literacy 
skills and nonreading aptitude tests, and 
achievement and proficiency tests are being 


developed, the validation of the GATB for 
specific occupations is still the major com- 
ponent of the USES test research program. 

The nine aptitudes measured by the GATB 
are General Learning Ability (G), Verbal Ap- 
titude (V), Numerical Aptitude (N), Spatial 
Aptitude (S), Form Perception (P), Clerical 
Perception (Q), Motor Coordination (K), 
Finger Dexterity (F), and Manual Dexterity 
(M). Each aptitude is scaled to a standard- 
ized mean of 100 and standard deviation of 
20 based on a general working population 
sample (United States Department of Labor, 
1967). 

Each of the research studies summarized 
below resulted in a specific occupational test 
battery composed of minimum cutting scores 
on two-four GATB aptitudes. These test bat- 
teries are currently in use by approximately 
1,600 local offices of the state employment 
services. (Information on each of these studies 
as well as a historical and technical descrip- 
tion of the GATB are available elsewhere 
[United States Department of Labor, 1967].) 
Data on: (a) older studies which have been 
replaced by new validations, (6) studies 
which did not yield norms, and (c) studies 
which yielded norms considered suitable for 
single plant use only (because of small sample 
size) are not included in this summary. (It 
is estimated that 10% of USES studies fail 
to yield norms.) While USES studies have 
employed numerous types of criteria, ratings 


240 


OCCUPATIONAL VALIDITY OF THE GATB 


by supervisors or instructors have been used 
most frequently. 


PROCEDURE AND RESULTS 
Job Proficiency versus Training Criteria 


The median correlation was computed for 
each aptitude for the 74 USES studies in 
which a training criterion had been used. 
Similarly, the median correlation was com- 
puted for each aptitude in a randomly se- 
lected 20% sample of the 345 USES studies 
in which a job-proficiency criterion was used. 
Grand medians were computed for each type 
of criterion. These results are shown in 
Table 1. 

Table 1 shows that the typical aptitude- 
criterion correlation obtained in USES studies 
is approximately .22 regardless of the type 
of criterion used. However, Table 1 also 
shows that the cognitive aptitudes (G, V, N, 
S) typically correlate to a higher degree with 
training criteria than they do with job-pro- 
ficiency criteria. The additional amount of 
training variance accounted for is greatest for 
Aptitude G (approximately 9%) and least 
for Aptitude S (approximately 3%), with 
Aptitudes V and N showing about the same 
additional amount (approximately 5%). On 
the other hand, the manipulative aptitudes 
(K, F, M) tend to correlate lower with train- 
ing criteria than with job-proficiency cri- 
teria. These correlations reflect differences in 
the amount of criterion variance accounted 
for of 24% (Aptitude F) and 34% (Ap- 
titudes K and M) in favor of the job pro- 
ficiency. The perceptual aptitudes (P and Q) 
tend to have above the same level of validity 
with either job-proficiency or training criteria. 
(It should be noted that all GATB aptitudes 
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are typically included in USES research re- 
gardless of the type of criterion.) 


Concurrent versus Longitudinal Validity 


Ghiselli’s calculation of the average valid- 
ities obtained with job-proficiency and train- 
ing criteria was only an aside in his attempt 
to summarize the predictive power of differ- 
ent categories of aptitude tests and the pre- 
dictability of different categories of jobs. 
However, it would have been interesting if he 
had calculated the average validity of another 
classification of studies—concurrent versus 
longitudinal. Longitudinal validation is usu- 
ally held to be preferable from the standpoint 
of experimental design because it yields “pre- 
dictive validity,” but because of practical con- 
siderations much of the USES research and 
much of the occupational validation done else- 
where is on a concurrent basis. 

The concurrent-—longitudinal validity com- 
parison is related to the job-proficiency—train- 
ing criteria comparison because in USES re- 
search many studies utilizing training criteria 
are longitudinal. However, studies utilizing 
job-proficiency criteria are both concurrent 
and longitudinal. The median correlation was 
computed for each aptitude for the 72 USES 
studies conducted according to a longitudinal 
research design. Similarly, the median cor- 
relation was computed for each aptitude for 
a randomly selected 20% sample of the 353 
USES studies conducted according to the con- 
current research design. These results and the 
associated grand medians are reported in 
Table 2. 

Table 2 shows that the typical aptitude- 
criterion correlation obtained in USES longi- 
tudinal studies is approximately .22 while the 
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Mep1An VALipities oF GATB APTITUDES 











Type of criterion | No. studies 


G Vv N 








64-70%» 25 19 21 
67-718 39 30 31 


Job proficiency 
Training 








Aptitude 
Grand 
Mdn 
S P Q K F 
17 23 20 DS 22 2S 22 
25 22 DD 14 16 14 22 





a Ns differ slightly by aptitude. 


b 20% sample of all studies using a job-proficiency criterion. 
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TABLE 2 


MeEpIAN Vatipities oF GATB AptiTupES 








Aptitude 
Design of study | No. studies Grand 
: , Mdn 
G V N S ie Q K i 
Concurrent 66-693" 27 22 24 18 15 21 16 19 20 20 
71-728 34 22 28 26 27 22 19 21 22 22 


Longitudinal 





® Ns differ slightly by aptitude. : 
b 20% sample of all concurrent studies. 


typical correlation is .20 in concurrent studies. 
Eight of the nine aptitudes also show slightly 
higher validities with the longitudinal design, 
while there is no difference on Aptitude V. 
The largest differences in Table 2 indicate that 
an additional 5% (Aptitude P), 34% (Ap- 
titude S), and 44% (Aptitude G) of the 
criterion variance is predicted in longitudinal 
studies than is predicted in concurrent studies. 
The slightly higher validities obtained in 
longitudinal studies are not due to a greater 
range of talent in this group as the medians 
of the aptitude standard deviations in the 
studies shown in Table 2 are either the same 
(S, K, F) or 1 point higher for the concurrent 
studies. The grand median standard devia- 
tions are .17 for concurrent studies and .16 
for longitudinal studies. 


Predictor Combinations 


One consideration which Ghiselli omitted 
was the average validity obtained when sev- 
eral predictors are used. In the typical study, 
the multiple correlation technique is used to 
determine the best possible weighting of pre- 
dictors. As pointed out by Dvorak (1956), 
the USES considers the multiple cutoff method 
to be superior to the multiple correlation 
method for USES occupational validation. 
Since that time USES has begun using the 
phi coefficient rather than the tetrachoric 
correlation to report the relationship between 
the variable dichotomies (“high” or “low” 
on the criterion versus “pass” or “fail” on a 
battery consisting of minimum qualifying 
scores on two—four aptitudes). 

Table 3 shows the median phi coefficients 
obtained in USES studies when divided into 
the following categories: job proficiency and 
training, and concurrent and longitudinal. 


The difference between the median phi coef- 
ficient for the concurrent type of study (.40) 
and the longitudinal type of study (.45) is 
greater than the difference between job-pro- 
ficiency criterion studies (.40) and training 
criterion studies (.42). These results are gen- 
erally consistent with those shown in Tables 
1 and 2. 


Additional Analyses 


Two preliminary analyses were performed 
in an attempt to relate USES results to differ- 
ences between studies with job proficiency and 
training criteria reported by Ghiselli. 

Ghiselli reports grand average validities of 
.19 with job-proficiency criteria and .30 with 
training criteria. Ghiselli’s figures appear to 
be based upon average correlations obtained 
in 107 matched sets of studies in which both 
job-proficiency and training criteria had been 
collected. The few USES batteries in which 
both job-proficiency and training criteria had 
been collected on the same occupation (11 
matched sets) were similarly summarized. 
The results showed a median correlation of .24 
for all aptitudes with job-proficiency criteria 
(97 correlations) and .26 with training cri- 


TABLE 3 
MEDIAN VALDITY oF Batrertes or GATB AptirupEs 








Type of study Mdn phi coefficient 
Job proficiency (V = 345) 40 
Training (VN = 74) 42 
Concurrent (V = 353) 40 
Longitudinal (V = 71) 45 





Note.—Five studies in which a combined job-proficiency— 
training criterion was used were eliminated from the job- 
proficiency-training comparison but included in the concur- 
rent—-longitudinal comparison. 
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teria (93 correlations). Although the differ- 
ence between the two median validities is 
slight, it is in the same direction as that re- 
ported by Ghiselli. 

Before all of the data upon which the 
medians reported in Table 1 and 2 were 
readily available, an exploratory study was 
conducted. In this study, the highest of the 
nine aptitude-criterion correlations obtained in 
each of 282 USES studies with a job-pro- 
ficiency criterion and 48 studies with a train- 
ing criterion (a total of 330 studies) was 
singled out and the median of these distribu- 
tions was obtained as a method for comparing 
training and a job-proficiency criteria. The re- 
sults showed a higher median validity for 
studies with a training criterion (.44) than 
for studies with a job-proficiency criterion 
(.38). A parallel study was conducted with 
these data to compare median correlations 
from concurrent and longitudinal studies. 
Similar results were obtained. (Longitudinal 
studies yielded the higher validity.) 


CONCLUSIONS 


Cognitive aptitudes (G, V, N, S) are more 
useful for predicting training criteria than for 
predicting job-proficiency criteria. Manipula- 
tive aptitudes (K, F, M) are more useful for 
predicting job-proficiency criteria. However, 
considering the grand median validity for all 
aptitudes, the same level of prediction (.22) 
is obtained regardless of the type of criterion. 
The difference in the types of criteria which 
the cognitive and manipulative aptitudes 
predict best points to the need for a mul- 
tiple-hurdle criterion for many validation 
studies. Different aptitudes (or other pre- 
dictors) may be correlated with success in the 
book-learning and the work-performance 
aspects of learning the same job. A multiple- 
hurdle criterion has been used in some USES 
studies where it was recognized that the job- 
performance criterion could not be considered 
the sole measure of occupational success. 

Most aptitudes (especially G, S, and P) 
tend to yield better validity in longitudinal 
studies than in concurrent studies. This may 
mean that validity coefficients based upon 
concurrent studies tend to give underestimates 
of the predictive validity of a test used in 
selection, ignoring of course the possibility of 
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validity shrinkage in cross-validation. This 
fact and the median aptitude standard devia- 
tions obtained in longitudinal and concurrent 
studies seem to cast some doubt on the inate 
superiority frequently assumed for longitudinal 
studies. If the validity reported for a longi- 
tudinal study is based upon the aptitude test 
battery-criterion relationship for only those 
people who complete the minimum training 
time and are on the job at criterion collection 
time, the range of talent in the research sam- 
ple may be no different than that which would 
be found in a comparable, but completed 
sooner, concurrent study. Procedures are being 
instituted in USES research to try to identify 
individuals who do not remain in the research 
sample until criterion collection time because 
of inability to learn or perform the job. Such 
individuals will be included in the low cri- 
terion group or the final four-fold table used 
to reflect the validity of test batteries de- 
veloped by USES. 

The above criticism of the longitudinal 
design is not meant to belittle other ad- 
vantages of this design such as: (a) elimina- 
tion of possible length of service contamina- 
tion in the criteria, (b) elimination of pos- 
sibility of higher scores on a predictor due to 
information or tasks learned in conjunction 
with the job, and (c) similarity of research 
sample to applicant sample. 

Plans are now being made for additional 
analyses of USES data which will be similar 
to Ghiselli’s summary of aptitude-criterion 
correlations obtained with various categories 
of jobs. For these analyses jobs will be 
grouped according to the new USES occupa- 
tional structure (United States Department of 
Labor, 1965) rather than the earlier USES 
structure or the special structure which were 
both used by Ghiselli. 

Research along several lines must be in- 
tensified if the modest but useful validities 
reported above are to be raised. These lines 
are: 


1. Methods of criterion improvement (about 
which much is said but little is done). 

2. Use of moderator variables (recent work 
by Hobert & Dunnette, 1967, is encouraging). 

3. Use of variables such as interest tests 
and biographical inventories in addition to 
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aptitude tests in the prediction equation (in- 
terest tests offer a possibility for a significant 
and early increase in validity coefficients only 
if a systematic and acceptable way of using 
interest tests in routine selection situations 
can be derived). 


None of the three areas of research men- 
tioned above is new or unexplored, but they 
have not received anywhere near the same at- 
tention as aptitude test research either by 
USES or other researchers. USES is now 
moving into a research program which will at- 
tempt to improve criteria and explore the use 
of moderator variables and nonaptitude pre- 
dictors. 


TECHNICAL NOTES 


The median correlation is used to summa- 
rize the USES data rather than the weighted 
numerical average used by Ghiselli because 
of the computational problem involved in 
transforming phi coefficients for purposes of 
averaging. However, the weighted numerical 
average was computed for a number of sub- 
samples of aptitude-criterion correlations and 
was found to be approximately the same as 
the median. 

Although a few studies are based on sam- 
ples of less than 50 individuals (older studies 
or cross-validation studies) and a few studies 
are based on samples of several hundred in- 
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dividuals, the majority of the studies (about 
80%) are based on Ns of 50-75. The bound- 
aries of this range are dictated at the lower 
end by a long-standing USES requirement 
that norms must be based on samples of at 
least 50 in order to be used nationally and at 
the upper end by the practical problem of 
trying to find more than 75 people performing 
one specific job. 
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Hedonic, quality, action tendency, and quality-action tendency rating scales, 
each designed to evaluate the taste of domestic water, were constructed. Near 
linear relations were found between equal appearing and successive interval 
scale values for the items comprising each scale. Average reliability coefficients 
for scales ranged from .62 to .77 in the laboratory work here reported. Mean 
scale ratings of sodium solutions were found to differ significantly by anion and 
by concentration level on each scale. In consumer survey research, the quality 
rating scale was found to correlate .83 with previously developed attitude 
scales, and to be significantly and linearly related to the total amount of com- 
mon mineral present in a water supply. Use of the rating scales to develop 
limiting standards for minerals, and for other constituents commonly found 


in domestic water, was discussed. 


A research project has been initiated (Bru- 
vold, Ongerth, & Dillehay, 1967; Dillehay, 
Bruvold, & Siegel, 1967) which is intended 
to produce objective data that can be used to 
recommend limiting standards for common 
mineral content in domestic water. The de- 
sired standards are intended to insure the 
potability of water served to the public. Lack 
of research regarding the relationship be- 
tween amount of common mineral content in 
water and potability has prevented the United 
States Public Health Service (1962), and re- 
sponsible state agencies, from establishing 
definitive standards for these minerals in 
drinking water. The absence of definitive 
standards means that new public water supply 
permits may be issued for unpotable waters. 
This problem is particularly acute in rapidly 
growing population centers located in the 
southwestern United States. Also, the absence 
of the needed standards serves to impede the 
action necessary to provide all consumers with 
fully potable water for daily consumption. 

The present report outlines the construc- 
tion and evaluation of four rating scales that 
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can be used to describe quantitatively the re- 
lation between common minerals, or other dis- 
solved constituents, and the taste of water. 
Three of the scales were new, and one repre- 
sented a refinement of a scale used in an 
earlier work (Bruvold & Pangborn, 1966). 
Specifically, the present paper has three major 
aims. The first is to describe procedures used 
to construct the four scales. The second is to 
report results from a laboratory study de- 
signed to evaluate the four scales. The third 
is to report results of consumer surveys de- 
signed to evaluate the quality rating scale. 


SCALE CONSTRUCTION 


Combination Scale 


A rating scale constructed by rank ordering 
statements used by Ss to describe the taste 
of water in a free response setting was em- 
ployed in initial laboratory research (Bruvold 
& Pangborn, 1966). This rating scale con- 
tained 14 items, each item consisting of a 
quality statement in combination with an 
action tendency statement. These items were 
the first to be scaled. 


Method. The 14 combination items were randomly 
arranged on a rating form with 81 additional items 
describing the taste of water in general terms. All 
95 items were presented as the third part of a 
mimeographed booklet. The first part of the booklet 
contained 105 adjectives evaluating the taste of 
water, while the second part contained 73 statements 
generally evaluating water quality. The three sets of 
items, minus the combination statements dealt with 
here, formed the item pools from which three at- 
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titude scales have been constructed (Dillehay et al., 
1967). 

The items were judged by 44 Ss (Dillehay et al., 
1967). Judgments were made on the usual 11-cate- 
gory scale employed in the method of equal ap- 
pearing intervals; standard instructions associated 
with this method were used (Edwards, 1957). All 
Ss judged all items in the same order. 

Results. The median category assignments 
and Q values were determined for each com- 
bination item using standard analytical pro- 
cedures (Edwards, 1957). Also, means and 
standard deviations for each item were de- 
termined by computer using the iterative 
least-squares method of successive intervals 
developed by Diederich, Messick, and Tucker 
(1957). Weights of 2, 1, and 0 were employed 
in the latter analysis with cutoff points of 
+0.7 and +1.4. The relationship, over all 
items, for equal appearing and successive in- 
terval values, was very nearly linear. 

Nine items were selected from the total 
group so that scale distances between succes- 
sive items were approximately equal. Items 
forming the combination scale are shown in 
Table 1; scale and dispersion values are 
listed in the last two columns of Table 2. 


New Scales 


Three new rating scales were constructed 
after the work on the combination scale had 
been completed. One scale contained hedonic 
statements, one quality statements, and one 
action tendency statements. The three scales 
constructed were analogous to those described 
by Schutz (1965) for use with food products. 
The procedures employed to rate statements 
were those suggested by Torgerson (1958) in 
a discussion of subjective estimate methods. 


Method. All items were phrased to refer directly 
to water. Nine hedonic items were constructed by 
rephrasing items from the scale used by Schutz 
(1965); 23 new hedonic items were also constructed. 
Thirty-four quality and 18 action tendency items 
were constructed using the same free responses to 
water samples employed to develop the original 
combination scale. An attempt was made to cover 
systematically the entire range of reaction to the 
taste of water in preparing each group of items. 

All items from each group were typed, one to a 
card on small plain white cards and labeled, at ran- 
dom, with two digit numbers. The three groups of 
cards were presented, in a random order, to 53 Ss, 
all of whom were employees of the California State 
Department of Public Health. The Ss placed each 
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item into 1 of 11 categories. Standard equal appear- 
ing interval instructions were used. The Ss worked 
alone to complete the judgment task after receiving 
appropriate instructions. Item order was newly ran- 
domized for each group of cards before an S per- 
formed the judgment task. 

Results. The judgments of three Ss were 
discarded due to an apparent reversal of the 
prescribed method of assigning statements to 
categories. Median and Q values (Edwards, 
1957), and means and standard deviations 
(Diederich et al., 1957), were determined 
separately for each set of items employing 
the judgments of the remaining 50 Ss. Weights 
of 2, 1, and O were used with cutoff points 
of +0.8 and +1.6 in the successive intervals 
analysis. Plots made separately for each set 
of items showed near linear relations between 
successive and equal appearing interval values 
in all three cases. The plot for hedonic items 
is shown in Figure 1. Median, mean, Q, and 
standard deviation values for all 98 statements 
are available (Bruvold, 1967). 

Quality and action tendency scales were 
constructed to provide approximately equal 
distances between items. Items for the hedonic 
scale were selected to match those forming 
the scale commonly used in the sensory evalua- 
tion of foods (Schutz, 1965) even though the 
distance between successive items was not 
approximately equal. The items comprising 
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SI Values 
SI Values (Jones & Thurstone, 1955) 





EAI Values 


Fic. 1. Plot of equal appearing and successive in- 
terval scale values for standard hedonic items. 
(Circles represent Jones and Thurstone, 1955, data. 
Filled squares represent items chosen for the water 
taste hedonic scale. The lines were fitted by eye.) 


SCALES FOR RATING THE TASTE OF WATER 


TABLE 1 


ITEMS FOR THE Four TASTE SCALES 


Hedonic scale 


Quality scale 





I like this water extremely. 

I like this water very much. 

T like this water moderately. 

I like this water slightly. 

T neither like nor dislike this water. 
I dislike this water slightly. 

I dislike this water moderately. 

I dislike this water very much. 

I dislike this water extremely. 
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. This water has an excellent taste. 

. This water has a very good taste. 

. This water has a good taste. 

. This water has a slightly good taste. 
. This water has a neutral taste. 

. This water has a slightly bad taste. 
. This water has a bad taste. 

. This water has a very bad taste. 

. This water has a horrible taste. 
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Action Tendency scale 


Combination scale 





1. I would be very happy to accept this water as my 
everyday drinking water. 

2. I would be happy to accept this water as my 
everyday drinking water. 

3. I am sure that I could accept this water as my 
everyday drinking water. 

4. I could accept this water as my everyday 
drinking water. 

5. Maybe I could accept this water as my everyday 
drinking water. 

6. I don’t think I could accept this water as my 
everyday drinking water. 

7. I could not accept this water as my everyday 
drinking water. 

8. I could never drink this water. 

9. I can’t stand this water in my mouth and I 
could never drink it. 





1. This water tastes real good. I would be very happy 
to have it for my everyday drinking water. 

2. This water tastes good. I would be happy to have 
it for my everyday drinking water. 

3. This water has no special taste at all. I would be 
happy to have it for my everyday drinking water. 

4, This water seems to have a little taste. I would be 
satisfied to have it for my everyday drinking water. 

5. This water has a mild bad taste. I could accept it 
as my everyday drinking water. 

6. This water has a fairly bad taste. I think I could 
accept it as my everyday drinking water. 

7. This water has a bad taste. I don’t think I could 
accept it as my everyday drinking water. 

8. This water has a real bad taste. I don’t think 
I would ever drink it. 

9. This water has a terrible taste. I would never 
drink it. 








each scale are shown in Table 1. Scale and 
dispersion values are listed in Table 2. 


LABORATORY STUDY 


Research in a laboratory setting was per- 
formed with the four scales in order to ob- 


tain an assessment of the reliability of each 
rating scale and to compare, across scales, the 
magnitude of mean reliability coefficients. 
Also the research was designed to yield some 
information regarding the validity of the 
scales. In general, judges should rate synthetic 


TABLE 2 
SCALE AND DISPERSION VALUES FOR SELECTED TASTE SCALE ITEMS 








Hedonic scale Quality scale 


Action Tendency scale Combination scale 











Item no. EAI SIb EAI SI EAI SI EAI SI 

Mdn Q M SD | Mdn Q M SD | Mdn Q M SD | Mdn O M SD 
1 10.66 | 1.02 As |\e2O tO. 1G 2.07 | .42 ODOM eta de 1.39} .58] 9.54 | 1.76 ISS) 36 
2 9.94 | 1.49 1.48 | .34 9.79 | 1.31 een: 9.20 | 1.86 1.17 | .45 |] 8.04 | 1.67 15 Ose oS 
3 7X0) Rasy -38 | <9 8.45 | 1.44 1.32 | .37 8.07 | 2.01 sii} ok || CHE |) ate On mea O. 
4 6.98 -60 .04 | .23 7.16 74 SO eke) 7.39 9 | 2.40 0M 2929) 6:25 |) 2502 Oe et 
) 5.98 54 Zn es) 6.01 96 -38 | .09 5.64 | 2.02 =A 04.) 5.00 -84 —.02 |) «3S 
6 4.87 -66 Sat leer 4.61 | 1,11 20K. Sz 4,21 1.63 — WOM 4527 1.83 Sil 609 
7 3.98 | 1.02 5G) | ede) 2.95 | 1.40 SOM eS 2.65. | 1.68 | —1.29' |, .47 | 3.18 | 1.86 —.64| .34 
8 TSOm | 1e2O))) AS81231 DOOM el2 eel O7 29) tc OL A dno Wert 1.95) |e1s05) |) 05) 9.37, 
2 1.24 | 1.07 | —1.98 | .84 1.16 66 | —1.54] .40 1.05 56 | —2.90 | 1.02 | 1.33 | 1.47 | —1.61 | 1.16 





a Equal appearing intervals. 
b Successive intervals. 
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water samples containing a lesser amount of 
solute more favorably than samples contain- 
ing a greater amount of the same solute 
(Bruvold & Pangborn, 1966). Further, it is 
expected that ratings of equal concentration 
sodium solutions should vary significantly by 
anion, with sulfate and bicarbonate solutions 
receiving the most favorable ratings, chloride 
intermediate ratings, and carbonate the least 
favorable ratings (Bruvold & Pangborn, 
1966). 


Method. Twenty-four individuals served volun- 
tarily as Ss in the laboratory study. All were em- 
ployees of the California State Department of Public 
Health. Twelve Ss were male and 12 were female. 
Some Ss had participated in earlier research which 
involved rating the taste of water samples. None, 
however, was asked to serve in the present study on 
the basis of quality of performance in the earlier 
work. No S was highly experienced in the sensory 
evaluation of taste, odor, or flavor. 

The nine statements comprising a scale appeared, 
unnumbered, along the left-hand edge of a mimeo- 
graphed data sheet. The statements were ordered 
from favorable to unfavorable, starting with the 
uppermost statement. To the right of each statement 
there appeared a row of 10 cross marks. The nine 
rows of 10 cross marks each were also ordered by 
column to correspond to 10 water samples labeled 
by code letters. Instructions for the rating procedure 
appeared at the top of each mimeographed data 
sheet. 

Ten water samples were used in the laboratory 
work. Double distilled water was used as one sample 
as was Berkeley tap water. The latter is an odor- 
free water containing approximately 85 mg/l of 
total dissolved solids (TDS). Solutions prepared in 
the laboratory comprised the eight remaining water 
samples. Reagent grade NaHCOs, NazSO., NaCl, and 
NasCOs; were used as solutes, and double distilled 
water as the solvent, in preparing solutions of 1,000 
and 2,000 mg/1]. Each solute was represented at each 
concentration level. 

Each S$ took part in nine separate rating sessions. 
Successive sessions for each S were separated by 1 
wk. The first session was used only as a practice 
session. The remaining eight sessions were divided 
into two blocks of four sessions each. Each scale was 
used once by each S during each block. A list of the 
24 possible orders of presentation for the four scales 
was tabulated. The order in which Ss used the scales 
was determined by random matching of Ss and 
orders separately for Block 1 and for Block 2. The 
practice session involved the same scale as Session 
1 of Block 1. The average number of interpolated 
sessions intervening between like scales, over all Ss, 
was 3.00 for the hedonic scale, 3.00 for the quality 
scale, 3.08 for the action tendency scale, and 2.92 for 
the combination scale. 

The actual rating task required S to take three 
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TABLE 3 


RELIABILITY COEFFICIENTS FOR THE 
Four TAstTE SCALES 








Frequencies 
Reliability Ne ite 
coefficient : 7 ction “ombi- 
Hedonic | Quality Tendency! snatite 
— acale scale scale 
.90-.99 1 2 1 2 
.80-.89 4 6 4 8 
.70-.79 5 8 5 9 
-60-.69 4 4 4 4 
.50-.59 5 D, 5 1 
.40-.49 3 1 1 0 
.00-.39 2 1 4 0 














separate mouthfuls of each sample, to record his 
rating on the data sheet by circling the appropriate 
cross mark, to rinse his mouth thoroughly with 
Berkeley tap water three times, and to rest for 30 
sec. before rating the next sample. The instruc- 
tions were read to each S$ at the start of the practice 
session by the experimenter who remained with S$ 
throughout this initial session to see that all pro- 
cedures were correctly followed and that the 10 
samples were rated in the order specified. The Ss 
worked alone in all remaining sessions. 

The rating was conducted in a small, quiet, air- 
conditioned room where the temperature was main- 
tained at 70° +2°F. All water samples were served 
at room temperature in letter-coded 100-ml. beakers 
filled to the 75-ml. level. Order of sample presenta- 
tion was randomized for each individual rating ses- 
sion. A different coding system was used for each of 
the 9 wk. of the study. 


Results. All analyses performed upon the 
laboratory results used equal appearing in- 
terval scale values as raw data. Standard 
integer values of one through nine were not 
employed because of inequality of item spac- 
ing especially evident in the hedonic scale. 

Pearson product-moment correlation coef- 
ficients were computed for all combinations 


TABLE 4 


AVERAGE INTERCORRELATIONS BETWEEN THE 
Four RatTinc SCALES 











Scale Hedonic | Quality eae aa 
Hedonic .63 
Quality .70 af 
Action 
Tendency .66 .69 .62 
Combination .70 af) mila SO 
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TABLED S 
MEAN RaTINGs FOR WATER SAMPLES USED IN THE LABORATORY STUDY 
. Concentration . , Action Tendency} Combination 
Solution inmesl Hedonic scale Quality scale eee seule 
Distilled nil 7.01 Feo 7.60 6.91 
Berkeley tap 85 7.59 7.67 8.36 138 
NaeSO4 1,000 6.07 5.79 6.88 6.08 
NaHCO; 1,000 5.40 Sal 5.79 5.36 
NaC1 1,000 4.85 4.49 5.43 5:33 
Na2CO3 1,000. 3.34 ei 3.58 3.30 
NaSO4 2,000 SE 4.80 5.90 5533 
NaHCO; 2,000 4.68 4.70 eS 4.65 
NaC1 2,000 Salo 3.61 3.98 Siu 
Na2COs3 2,000 1.82 1.81 1.95 2.14 




















of scales and sessions separately for each S. 
Thus, 28 correlation coefficients were obtained 
per S. Each correlation coefficient describes 
the relation between ratings given identical 
water samples for a particular pair of rating 
sessions. Of the 28 correlation coefficients, 4 
involve like scales, and therefore these coef- 
ficients may be taken as measures of the con- 
sistency or reliability of rating by a single 
S. The reliability coefficients for all Ss and 
scales are reported in Table 3. Individual re- 
liability coefficients were transformed from 7 
to z values and then subjected to a Treat- 
ment (Scales) X Subjects analysis of vari- 
ance. An F ratio of 4.48 (df = 3/69) was 
obtained for scales. This ratio is significant at 
the .05 level for a conservative test using 1 
and 23 degrees of freedom (Winer, 1962). 
The average intercorrelations between scales, 
over all sessions and Ss, are shown in Table 4. 

Mean ratings averaged over sessions and 
Ss for each sample and each scale are shown 
in Table 5. The results of repeated-measures 
analyses of variance (Winer, 1962), analyses 
using each S’s average rating for each water 
sample over the two sessions involving like 
scales as raw data, are shown in Table 6. 
Sex of S, solute concentration, and type of 
mineral solute formed the three factors in these 
analyses. Ratings for distilled and Berkeley 
tap water were not included in the analyses 
of variance. 


CONSUMER SURVEYS 


The rating scales developed can be used by 
Ss to rate water samples in a laboratory set- 


ting, and they can also be used by consumers 
to rate the taste of water received for daily 
consumption. The consumer research was de- 
signed to provide an assessment of the re- 
liability of the quality rating scale by cor- 
relating results from it with attitude scale 
scores. Further, the surveys were designed to 
provide some evaluation of the validity of the 
quality scale by correlating scale ratings with 
TDS in the waters studied. Theoretical no- 
tions concerning consumer attitudes toward 
water clearly indicate that there should be an 
imperfect, but substantial, negative correla- 
tion between TDS and quality scale ratings 
(Dillehay et al., 1967). 


Method. Four water systems serving communities 
in the Central California coastal area were selected 
as testing sites for the quality rating scale. Each 














TABLE 6 
ANALYSES OF VARIANCE FOR LABORATORY RATINGS 
: . Action |Combi- 
Source df dee ey Tendency | nation 
MS MS 
Sex (A) 1 0.28 0.40 1.30 2.59 
Ss within groups] 22 3.16 1.82 6.61 4.43 
Concentrations 
B 1 | 53.47** | 30.13** 64.03** | 52.21** 
AXB 1 0.16 1.47 1.61 1.05 
B X Ss within 
groups 22 0.51 1.78 1.15 0.47 
Minerals (C) 3 | 84.30** | 76.69** | 114.91** | 77.85** 
AG 3) 0.39 1.59 0.52 0.93 
C X Ss within 
groups 1.30 1.60 Dah: 1.30 
BEGG 3 1.47 2.14 2.82* 1.93* 
ACXTBE XC 3 0.79 1.34 1.66 0.21 
B XC X Ss 
within groups | 66 0.95 0.94 1.02 0.62 
#5 <.05. 
¥*k D <.001. 
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TABLE 7 
DATA FOR THE FouR COMMUNITIES SURVEYED 
Midrange mineral content in mg/l 
Water Mean | Mean | Mean No. re- 
system 7 : ATS |AATS| TSR | spondents 
TDS Ca Mg Na || HCO; | ‘Cl NO 3 SO4 
Los Osos 123 5 5 V9 32 26 9 3} 8.84 | 8.74 9.80 548 
San Miguel 585 41 42 82 286 73 14 Pit) 7664 (aod 8.51 55 
El Rio 1,001 140 53 80 266 45 12 438 | 7.28 6.89 7.42 59 
Guadalupe | 2,078 | 287 128 122) 288 154 14 LOLGH | S226u eam toe20 518 


a One respondent’s TSR missing. 


system surveyed was supplied by odor-free water 
sources as determined by on-site odor detection 
threshold tests (American Public Health Association, 
1965), and no source contained iron or manganese 
in amounts greater than .05 mg/I. No system chlo- 
rinated, and all were supplied by wells whose com- 
mon mineral content was relatively constant over the 
system and across the seasons. Chemical analyses and 
other data for each community water system sur- 
veyed are shown in Table 7. 

Selection of respondents from each community was 
accomplished by a careful sampling procedure that 
involved listing all dwelling units within the geo- 
graphical area served by the water system under 
study, the selection of approximately 60 dwelling 
units by a random sampling procedure modified to 
distribute selected dwelling units uniformly over the 
area, and, finally, the random selection of one Eng- 
lish-reading individual 15 yr. of age, or older, from 
all such individuals residing at each selected dwelling 
unit. Each respondent participating in the survey 
was first given an interview, then three attitude 
scales (Dillehay et al., 1967), and finally the quality 
rating scale. The interview and scales were ad- 
ministered at the respondent’s home by a trained 
interviewer working for the California State Depart- 
ment of Public Health. 

Each participating respondent drew a glass of water 
from his kitchen tap before the taste scale rating 
procedure was initiated. Having obtained a glass of 
water, the respondent was given a copy of the qual- 
ity scale in which the nine items appeared unnum- 
bered and ordered from most to least favorable. 
The interviewer read each item comprising the scale 
to the respondent. The respondent was then in- 
structed to take three mouthfuls of water from the 


TABLE 8 


INTERCORRELATIONS BETWEEN TDS anp 
CONSUMER SCALE SCORES 








AATS ATS TSR 
ATS 93 
TSR 83 84 
DS Od —.6/ —.67/, 


glass, to study the scale items carefully between 
mouthfuls, and to indicate, after taking the third 
mouthful, which statement most accurately expressed 
his rating of the taste of the water. 


Results. In El] Rio, 100% of the selected 
respondents participated in the survey. Anal- 
ogous percentages were 93, 95, and 95, respec- 
tively, for San Miguel, Los Osos, and Guada- 
lupe. 

Intercorrelations between quality taste scale 
ratings (TSR), attitude taste scale scores 
(ATS), attitude adjective taste scale scores 
(AATS), and TDS as milligrams per liter are 
shown in Table 8. Each correlation coefficient 
is based upon scale responses obtained from 
all four towns. Correlation coefficients for 
taste scale ratings involve 217 respondents, 
while those for attitude scale scores involve 
219 respondents. Equal appearing interval 
scale values were used in all TSR analyses. 

An analysis of linear regression (McNemar, 
1962) was performed for each set of scale 
scores and TDS midranges. The results of 
the analyses of variance are shown in Table 9. 
A plot of TSR, ATS, and AATS means against 


TABLE 9 


ANALYSES OF VARIANCE FOR CONSUMER 
SURVEY RESULTS 











ATS AATS TSR 
Source 
df MS af MS df MS 

Linear 

regression 1 | 340.88* 1 | 333.74* 1 | 573.63 
Deviation from 

line 2 2.68 2 4.71 Z 2.25 
Between-array 

means Si |et15.50% 3 | 114.38* 3 | 192.70* 
Within arrays | 215 1.90 DiS 2.35 213 Oe, 
Residual 217 1.91 217 add 215 3.31 


NEE Eee eee 
= Pe< 001, 
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TSR(Y=-0.00230 X+9.91) @ 
ATS (Y'=-0.00I77 X + 8.92) @ 


a AATS(Y=-0.00174 X+ 8.69) 


Mean Scale Scores 





° 1,000 2,000 


TDS in Milligrams per Liter 


Fic. 2. Lines of best fit for TDS midranges and 
mean scale scores. 


TDS midranges, and the line of best fit for 
each scale, is pictured in Figure 2. 


Discussion 
Scale Construction 


Edwards (1957) holds that the relation be- 
tween equal appearing and successive interval 
scale values for the same set of items will be 
linear only in the middle range, with succes- 
sive interval values becoming more extreme 
near the ends of the continuum. Edwards’ 
prediction was clearly borne out by the rela- 
tion between successive interval values for 
general hedonic scale items reported by Jones 
and Thurstone (1955) and equal appearing 
interval values for water taste hedonic scale 
items. This result is pictured in Figure 1. 
However, none of the plots relating successive 
and equal appearing interval values for the 
four sets of items here scaled gave clear sup- 
port to Edwards’ assertion. Instead, the overall 
linearity obtained between equal appearing 
and successive interval values shows that the 
two methods yielded equivalent results for 
the statements scaled in this research. 


Laboratory Study 


The reliability coefficients based upon the 
laboratory work varied substantially across 
Ss. Such a result was not unexpected since Ss 
for this study were volunteers not highly ex- 
perienced or skilled in sensory evaluation 
work. Nevertheless, 55 of the 96 individual 
reliability coefficients were equal to, or greater 


Zs 


than, .70. The results indicate that, using 
water samples similar to those employed in 
this research, it should be quite easy to ob- 
tain a taste panel of Ss whose individual re- 
liability coefficients exceed .70 for any of the 
four scales developed, and that it would be 
reasonable to seek a panel having individual 
reliability coefficients of .80, or higher, for the 
quality or the combination scales. 

Schutz (1965) found an action tendency 
scale to be superior to an hedonic scale in rat- 
ing food products. In terms of reliability, no 
such advantage was found for the action 
tendency scale in the present study. The sta- 
tistically, and practically, significant differ- 
ences between mean reliability coefficients 
were not fully anticipated. The lower relia- 
bility of the action tendency scale may be due 
to the difficulty an S encounters in predicting 
his actual response to a certain mineralized 
water solely on the basis of laboratory ex- 
posure to that water. The lower reliability of 
the hedonic scale may be due to the rather 
gross inequality of item spacing along this 
continuum. Whatever the cause of the find- 
ings, the results of the reliability study, the 
spacing of items, the relative size of the dis- 
persion values, and simplicity of phrasing, all 
indicate that the quality scale is likely the 
most useful of the four here investigated. This 
judgment must be tempered by the purpose 
of the research in which the scale is to be 
employed, and also, perhaps, by future find- 
ings involving additional assessments of scale 
quality. 

The analyses of variance performed upon Ss’ 
average ratings for sodium solutions showed 
that there was a significant difference between 
means associated with sample concentration 
on each of the four scales. This finding is well 
illustrated by Table 5. Each 1,000 mg/1] sam- 
ple received a more favorable mean rating 
than the corresponding 2,000 mg/l] sample. 
This result supports the validity of the four 
scales as it was expected that general taste 
quality would deteriorate as amount of min- 
eral dissolved in water increased. 

The analysis of variance on Ss’ average 
ratings also showed that there was a significant 
F ratio associated with solution anions. This 
result parallels earlier findings with the 14- 
item combination scale (Bruvold & Pang- 


Az 


born, 1966). It now seems well established 
that mean taste scale ratings for equal con- 
centration sodium solutions will clearly rank 
by anions, with sulfate and bicarbonate rank- 
ing highest, chloride intermediate, and car- 
bonate lowest. As pointed out in connection 
with the earlier work, this rating is identical 
with the rank order of detection threshold 
values for sodium salts in water (Bruvold & 
Pangborn, 1966). The congruence of present 
findings with earlier ratings and with detection 
threshold results further supports the validity 
of the rating scales here described. 


Consumer Surveys 


The correlations between the quality rating 
scale and the two attitude scales may be con- 
sidered as reliability coefficients since both 
procedures were designed to measure the 
same attribute. Such a reliability coefficient 
is most closely related to a coefficient of 
equivalence obtained by administering equiva- 
lent forms of an instrument in close succes- 
sion. In addition to usual error variance due 
to chance factors and differences in item con- 
tent, the present measure of reliability will be 
affected by differences between the rating and 
attitude scale methods of obtaining an in- 
dividual score. Since the obtained correlation 
coefficients of .83 and .84 are substantial in 
spite of the additional source of unreliability, 
it may be concluded that the quality rating 
scale possesses reliability sufficient for use in 
survey research of the kind reported in this 
paper. Further, it should be noted that the 
correlation between TDS and scale scores was 
of similar magnitude for both techniques of 
measurement. 

Reliability, in the broader sense of con- 
sistency of results, may also be evaluated by 
comparing the regression of rating and at- 
titude scale scores on TDS. The analyses re- 
ported in Figure 2 and Table 9 show that 
means for all three scales were linearly related 
to TDS midranges for towns. The rating 
scale means fit a line of somewhat steeper 
slope than those fitting attitude scale means; 
however, such a difference in slopes would 
not be of much practical importance in recom- 
mending a limiting standard for total mineral 
content in domestic water. The degree of con- 
sistency obtained for the regression results 
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provides further evidence that the rating 
scale possesses reliability sufficient for use in 
consumer survey research. 

The significance of differences between 
mean ratings for scale scores over towns, and 
the significance of the negative slope fitting 
the scale means and TDS midranges, provides 
evidence supporting the hypothesis that there 
is an inverse relationship between favorability 
of consumer attitudes toward water and its 
total common mineral content (Dillehay et 
al., 1967). These results support the validity 
of the rating scale, and provide further sup- 
port for the validity of the attitude scales. 

The linear relation between TDS midranges 
and rating scale scores for the four water sys- 
tems studied may appear to be at variance 
with laboratory work which showed that im- 
portant differences existed between mean rat- 
ings for equal concentration mineral solutions. 
The apparent difficulty is resolved by noting 
that, with the exception of bicarbonate for 
San Miguel and chloride for El] Rio, the 
relations between ionic concentrations and 
TDS were approximately linear across towns. 
Nothing in the laboratory work completed to 
date suggests that a nonlinear relation will be 
obtained between TDS and mean taste scale 
ratings when each mineral ion present main- 
tains a relatively constant proportion to TDS. 
It is also worth noting that the more mineral- 
ized natural waters surveyed were relatively 
low in chlorides and high in sulfates. This fact 
probably explains the rather moderate mean 
ratings given El Rio and Guadalupe water. 
Had these two waters been relatively high in 
chlorides and low in sulfates, more negative 
ratings would very probably have been ob- 
tained. 


CONCLUSIONS 


The performance of the quality rating scale 
in the laboratory and consumer survey re- 
search reported here indicates that it may 
now profitably be employed in future work 
designed to recommend limiting standards 
for common mineral content in domestic water 
(Bruvold et al., 1967). Consumer ratings ob- 
tained with the quality scale represent mea- 
surement of the attribute of interest by the 
judgment method (Torgerson, 1958). The 
previously constructed attitude scales (Dil- 


SCALES FOR RATING THE TASTE OF WATER 


lehay et al., 1967) obtain a measure of the 
attribute of interest by the response method. 
Use of both judgment and response methods 
in the consumer survey research will give ad- 
ditional confidence to the standards finally 
recommended. 

Taste panel ratings obtained using the 
quality scale and water samples from each 
community surveyed may also be extremely 
useful in establishing the desired standards. 
The taste panel ratings may be considered 
analogous to chemical analyses performed 
upon water samples from a particular com- 
munity in the laboratory. Results from the 
chemical analyses, such as TDS values, are 
employed to describe the relation between 
mineral content in water and potability as 
operationally defined by consumer attitude or 
rating scale scores. If it is found that the 
mean taste panel ratings allow for significant 
improvement in describing the relation be- 
tween consumer responses and results from 
laboratory analyses, taste panel ratings may 
be incorporated into a regression equation 
that would be used to predict the potability of 
mineralized waters not assessed in the con- 
sumer survey work. Here the regression equa- 
tion relating results of laboratory analyses to 
consumer response comes to embody the de- 
sired standard, and, as indicated, inclusion 
of taste panel data in this equation may con- 
siderably increase its accuracy. 

The newly developed attitude and rating 
scales can also be used to assess flavor in 
water produced by dissolved constituents other 
than common minerals. In fact, increasing in- 
terest in defining and protecting water quality 
(Bean, 1962) indicates that considerable re- 
search will be performed in order to establish 
standards for taste and odor-producing con- 
stituents commonly found in water used for 
domestic supply. The scales developed make 
possible an interlocking consumer evaluation- 
taste panel approach to the assessment of 
flavor in domestic water. Results from the 
scales may be used to recommend a limiting 
standard in terms of an upper limit of ac- 
ceptability whenever the threshold approach 
is not feasible (Bruvold et al., 1967). When 
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the threshold approach is both desirable and 
feasible, assessment of detectable flavor by 
consumers or taste panel judges will provide 
fuller knowledge of the effects of the con- 
stituent upon water quality. Such additional 
information would create a more complete 
data base for recommending the desired stan- 
dards by indicating the suprathreshold con- 
centration at which a particular constituent 
in water becomes unacceptable for daily 
drinking. 
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HOW TO SUCCEED IN BUSINESS ACCORDING TO 


BUSINESS STUDENTS AND MANAGERS 


BERNARD M. BASS 


Management Research Center, University of Rochester 


The Organizational Success Questionnaire was developed to see the extent 
graduate business students and middle managers accept the opinions of the 
social theorists and the political theorists about what it takes to succeed in 
large organizations. 6 factors emerged in a 1st analysis but failed to remain 
sufficiently independent. Consequently, a social approach scale was constructed 
with a mean internal consistency reliability of .72 and a mean retest reliability 
of .52. A political scale yielded similar reliabilities. Slight but meaningful rela- 
tions were found between scale responses and responses to other inventories of 
orientation and value. Both social and political approaches are needed, fairly 
or very often, for success according to the respondents (the social somewhat 
more than the political). But initially students favored less activity of either 
kind compared to middle managers. Following training the managers remained 


unchanged but the students became more like the managers. 


When your boss asks you to push your sub- 
ordinates to work harder and your subordi- 
nates complain they already have too much 
to do, what do you do? What do you think 
the successful executive does? Faced by these 
and countless other conflicts of interests 
among those above, below, and next to you 
in the system, do you concentrate on being 
open, trusting, and communicative, as prob- 
ably would be recommended by most socially 
oriented management experts such as Gibb 
(1964), Argyris (1962), or Bennis (1964); 
or, on the contrary, do you become the astute 
dissembling politician, maintaining your dis- 
tance, controlling the outflow of information, 
indulging in minor insincerities, and making 
superficial compromises, as probably would be 
recommended by most politically oriented 
management experts, such as Martin and 
Sims (1956), Jameson (1945), or Pfiffner 
(1951)? Gibb argues that before effective in- 
terpersonal decision-making procedures can 
be developed that lead to real commitments, 
a trusting relationship must be established 
between the persons. Argyris gives highest 
priority to “owning up” to one’s own feelings 
and needs. On the other hand, Jameson em- 
phasizes that to be effective in such circum- 
stances, one needs to wear a “mask,” while 
Pfiffner calls for hiding one’s own feelings, 
engaging in benign chicanery, and being a 
good actor. The present investigation set out 
to develop a brief attitude scale which would 


make it possible to see how managers and 
students felt about the issue, and what ef- 
fects a particular training program had on 
these attitudes. 


DEVELOPMENT OF THE ORGANIZATIONAL 
SUCCESS QUESTIONNAIRE 


As a first step in development of an attitude 
scale, that is, the Organizational Success 
Questionnaire (OSQ), 200 undergraduate stu- 
dents enrolled in an elementary psychology 
section were asked to complete a 16-item 
questionnaire under the following instructions: 

People differ in what they think it takes to get 
ahead today in most large organizations. Listed be- 
low are ways of behaving that may or may not be 
important for success. Indicate how frequently you 
think managers, staff members, or administrators 
ought to behave as described in order to be successful. 

Eight items were composed primarily from 
an article by Martin and Sims (1956) on the 
utility of the political approach and eight 
other statements were drawn from the writings 
of Gibb (1964) advocating the social ap- 
proach. 

A typical item was: 

Successful persons try to establish in advance mu- 


tually satisfactory objectives with others with whom 
they must work. 


5 —— alwaysl1] 
4 __ very often 


1 The frequency scale was based on a magnitude 
estimation study of 28 adverbs of frequency. Each 
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—— fairly often 
— sometimes 
— seldom 

—— never 


OrN W& 


Factor Analysis 


The 16-item questionnaire was given to 107 
graduate business students. A principal com- 
ponents factor analysis with Varimax rotation 
yielded six factors which accounted for 42% 
of the variance after 29 iterations (Table 1). 
All loadings of items on the factors above .40 
are italicized in Table 1. The six factors ob- 
tained were: 


I. Emphasize personal gain. 
II. Emphasize organizational gain. 
III. Share decision making. 
IV. Emphasize candor, openness, trust. 





of 71 undergraduate students had been asked to as- 
sign a number of his own choosing to “sometimes,” 
then to indicate what number would best fit each 
of the 28 other adverbs. On the average, in relation 
to “sometimes,” “always” was seen as 2.533 times as 
frequent; “very often” was 2.093 times as frequent; 
“fairly often” was 1.683 times as frequent; “seldom” 
was .425 times as frequent and “never” was .000. 
Thus, these particular six adverbs of the 28 studied 
bore an approximate relation to each other of 
5:4:3:2:1:0 and were selected as the response al- 
ternatives for OSQ items. 
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V. Bluff, obscure issues. 
VI. Maintain social distance and preroga- 
tives. 


For a new sample of 120 graduate business 
students, factor scores were calculated using 
the items that correlated most highly with 
each factor to obtain six factor scores for 
each student. Table 2 shows the correlations 
among the factors, as well as correlations with 
retests scored the same way. It can be seen 
that Factors I, V, and VI—dealing with per- 
sonal gain, bluffing, and social distance— 
clustered together, while Factors II, ITI, and 
I1V—dealing with organization gain, sharing 
decisions, and trust—negatively related to 
the cluster. 

The test-retest reliabilities ranging from .35 
to .48 indicated that the number of items con- 
tributing to each factor score would have to 
be multiplied many times to achieve reliable 
assessments of individual attitudes, factor by 
factor. 


Relations of the Factor Scores with Other 
Assessments 


The six factor scores of 113 business 
students were correlated with a variety 


TABLE 1 


PRINCIPAL COMPONENTS Factor ANALYSIS (DECIMALS OMITTED) OF 16 OSQ ITEMs ANSWERED 
BY 107 GRADUATE BUSINESS STUDENTS 























Factor 
Item I2 
I II iil IV Vv VI 

1. Listen only when you ask —20 DD) 29 32 06 —07 54 
2. Level, be candid 34 —14 16 i 08 05 50 
3. Withhold information 72 02 —04 23 —01 —04 57 
4, Share decisions —08 11 Te 07 —02 —18 58 
5. Bluff 21 15 —08 09 75 27 72 
6. Make allies 76 —16 14 —12 01 09 64 
7. Commit yourself fully 06 21 —17 75 12 —08 67 
8. Set mutual objectives 18 18 40 LS —41 08 42 
f 9. Retard and delay 39 06 —19 18 —30 46 52 
10. Push people to limits 01 77 —05 —14 —13 10 64 
11. Maintain distance —02 02 07 04 10 84 73 
12. Foster mutual trust —02 —06 13 69 —27 24 62 
13. Compromise, but delay 61 10 as 09 —40 35 68 
14. Hold group discussions 16 —10 65 —08 —07 35 59 
15. Set clear goals 33 28 03 02 —65 21 65 
16. Organization comes first 56 od 08 14 02 —13 62 




















Note.—All loadings of items on the factors above 40 are italicized. 


Factor I = Emphasize personal gain; II = 


Emphasize organizational gain; III = Share decision making; IV = Emphasize 


candor, openness, trust; V = Bluff, obscure issues; VI = Maintain social distance and prerogatives. 
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TABLE 2 
INTERCORRELATIONS AMONG THE FACTOR SCORES 
Correlation 
, 7 , with same 
Factor I Lt III IV V VI fackaeee 
retest 
I. Personal —.13 —.13 — .23** PS Gee Om A4** 
II. Organizational allifs 20g —.24** — .14* aon 
III. Share 00 — .22** —.12 oon 
IV. Trust — .22** — .20 ae 
V. Bluff oan 45% 
VI. Distance A8** 

















Note.—N = 120. 
*p <.05,r = .14 for 118 df. 
rey Olin = 2d fon 11S df. 


of other assessments. Results are shown in 
Table 3. 

Other assessments. Verbal and numerical 
aptitude were measured on the Admissions 
Test for Graduate Schools of Business. Self-, 
interaction, and task orientation were as- 
sessed by the Orientation Inventory (Bass, 
1962), a Kuder-type inventory of 27 triads. 
Values were assessed by the Allport-Vernon 
Study of Values (Allport, Vernon, & Lindzey, 


1960), and budgeting decisions were examined 
by Exercise Objectives, an exercise in which 
participants each choose whether or how to 
spend money on each of five problems: safety, 
labor relations, management development, 
product development, and stream clean-up. 
For each problem, one could choose a “soft” 
social solution, that is, fix equipment to avoid 
a possible accident, or a “hard” economic 
solution, that is, let the machine operator be- 


TABLE 3 


INTERCORRELATIONS OF OSQ FAcTOR SCORES 


AND ASSESSMENTS OF ABILITY AND VALUES 











Factor scores 








Assessment 
I II Te IV V VI 
Verbal aptitude (ATGSB) .00 05 —.12 —.01 .10 .06 
Numerical aptitude (ATGSB) .00 —.09 — .24** 03 .06 —.03 
Greater human and social con- — .20* .19* roi si: —.17* —.25** 
cern in five budgeting decisions 
Orientation Inventory 
Self-orientation 14 —.15* —.14 —.24** 05 —.02 
Interaction orientation .06 .08 .08 —.03 07 .04 
Task orientation —.18* .04 —.01 .20* —.10 —.01 
Allport-Vernon Study of Values 
Theoretical —.13 —.14 — .25** .20* 02 .O1 
Economic als —.260** —.15* —.07 .08 nl Os 
Aesthetic —.11 .08 a0) —.05 .08 —.13 
Social —.05 .10 a5 .16* —.09 —.08 
Political .19* 08 —.04 —.09 — .02 ALS) 
Religious .04 pS BIS .00 03 —.12 








_Note.—Factor I = Emphasize personal gain; II = Emphasize organizational gain; III = Share decision making; IV = Em- 
phasize candor, openness, trust; V = Bluff, obscure issues; VI = Maintain social distance and prerogatives. 


Kp ODsge = tonton dil dare 
repo OL fee eee) TOT 1idvasa 
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ware. The total number of ‘‘Social” choices 
was calculafed and correlated with OSQ re- 
sponses. 

Several other exercises of the Program of 
Exercises (Bass, 1967) were also scored 
against OSQ responses, but failed to yield 
significant or meaningful patterns of results, 
possibly because of the low reliabilities of the 
factor scores. 

OSQ and aptitude. Numerical aptitude cor- 
related —.24 with sharing. No other sig- 
nificant relations were observed. 

OSQ and orientation. From Table 3 it can 
be seen that self-orientation on the Orienta- 
tion Inventory—the tendency to seek recogni- 
tion and personal rewards and to prefer to 
work alone rather than in groups was nega- 
tively related (—.15, —.14, and —.24) with 
OSQ Factors II, III, and IV, concerned with 
emphasizing organizational gain, sharing de- 
cisions, and fostering trust. On the other hand, 
task-orientation—interest in the challenge of 
the work to be done—was positively associated 
(.20) with the OSQ trust score (IV) and also 
unlike self-orientation was negatively related 
(—.18) with the OSQ factor score for per- 
sonal gain (1). 

OSQ and values. Theoretical value on the 
Allport-Vernon—appreciation of understand- 
ing—was negatively associated (—.25) with 
sharing of decisions (III) but positively 
(.20) with trust (IV). Valuing economic ac- 
tivity was positively associated with emphasiz- 
ing social distance (.19) and negatively with 
emphasizing organizational gain (—.26) and 
sharing (—.15). Contrary to expectations, 
strong social values were only slightly related 
to belief in the importance to success of shar- 
ing (.15) and trust (.16), while strong politi- 
cal values were only slightly related to em- 
phasizing personal gain (.19) and social dis- 
tance (.15) and not at all to bluffing. Strong 
religious values correlated .15 with emphasiz- 
ing organizational gain. While relations here 
were in the direction expected, it can be seen 
that the brief OSQ factor scores could in no 
way substitute for the much more reliable All- 
port-Vernon Study of Values and would be 
severely limited to survey and research ap- 
plication. 

OSO and budgeting decisions. Most interest- 
ing are the positive correlations of .30 and .25 


ZSi 


between the tendency to make “social” bud- 
geting decisions and the OSQ Factors III and 
IV concerning sharing decisions and fostering 
trust. Correspondingly, there were three nega- 
tives correlations, of =—.20,"—.17; and —.25 
between these “soft” decision tendencies and 
belief in the utility of political approaches to 
organization emphasizing personal gain (I), 
bluffing (V), and social distance (VI). 


Rescoring 


The unreliability of the factor scores, their 
tendency to cluster rather than remain inde- 
pendent, and the patterns of their correlations 
with other assessments led to the pooling of 
items from the social factor scores into one 
social approach score and to the pooling of 
political factor items into one political ap- 
proach score.” 

Twelve of the original 16 items were clus- 
tered into two composites, a social and a 
political, to form the final scale. 

The four items which were eliminated 
loaded highly on more than one factor or 
loaded only on Factor II dealing with or- 
ganizational gain (which could be pursued 
politically or socially) or loaded highly nega- 
tively on a factor. 

The final set of 12 items and whether each 
is scored as social (S) or political (P) are 
shown in Figure 1. 


Scale Reliabilities and Intercorrelations 


For a total of 196 cases, test-retest relia- 
bilities were .52 and .51, respectively, for the 
social and political scores. The correlations 
between the social and political scales were 
A9, .63, .33, and .38, respectively, for samples 
of 200 undergraduates, 24 night business 
students, 89 day business students, and 83 
middle managers. Thus the same respondent 
who calls for more frequent social approaches 
to be successful also is likely to call for more 
frequent political approaches. How much of 
this may be mere rating bias is unknown. 


2For two reasons, the alternative of developing 
six longer, more reliable, and possibly more dis- 
criminating factor scales was rejected. First, the 
social and political dichotomy was in line with the 
rationale with which the author had begun. Second, 
the resulting lengthy questionnaire would be much 
more difficult to administer in the future to large 
samples. 
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ORGANIZATIONAL SUCCESS QUESTIONNAIRE 


People differ in what they think it takes to get 
ahead today in most large organizations. Listed be- 
low are ways of behaving that may or may not be 
important for success. Indicate how frequently you 
think managers, staff members, or administrators 
ought to behave as described in order to be success- 
ful. 

Use the following scale to indicate your opinion: 


5—They always ought to do this. 
4—They ought to do this very often. 
3—They ought to do this fairly often. 
2—They ought to do this sometimes. 
1—They seldom ought to do this. 
O—They never ought to do this. 


. Level with others; be open, frank and can- 
did in their communications with others. 

. Withhold the release of information or 
time its release for when it will do the most 
good. 

. Share in decision-making with their sub- 
ordinates whenever possible. 

. Act confidently when they are personally 
unsure about matters or when they lack 
relevant information. 

. Make political alliances with superiors and 
subordinates to foster and protect mutual 
interests. 

. Completely and openly commit themselves 
to a position or program. 

. Try to establish in advance mutually satis- 
factory objectives with others with whom 
they must work. 

. Initiate actions which they are personally 
against but retard and delay carrying out 
the actions so that the actions are in 
progress but never completed. 

. Maintain social distance; remain aloof, de- 

tached, uninvolved with others; always 

remain the boss when interacting with sub- 
ordinates. 

Foster mutual trust with others. 

Openly compromise, yet privately divert or 

delay compromise plans so that their own 

aims will be pursued despite the stated 
compromise. 

. Arrange for their superiors and their sub- 
ordinates to meet together to encourage 
group discussions among others above and 
below them in the organization with easy 
participation by all. 


S 10 
Pal le 


Fic. 1. Organizational Success Questionnaire with each 
item scored as social (S) or political (P). 
EFFECTS OF ORGANIZATIONAL EXPERIENCE 


Before they entered training programs? in 
organizational psychology, OSQ data were col- 


3 The training program focused on completing 10 
small group exercises dealing with budgeting, expecta- 
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TABLE 4 


How To SuccEED IN Business ACCORDING TO GRADU- 
ATE BUSINESS STUDENTS AND MippLE MANAGERS 
BEFORE AND AFTER TRAINING 





Mean attitude 























Sample - 
p Brat Post- Adjusted 
training | training i 
Social items 
83 graduate business 3.62 3.82 3.90 
students (days) 
24 graduate business 3.56 3.84 3.96 
students (night) 
89 middle managers 4.02 4.07 3.93 
Political items 
83 graduate business 3.30 3.26 S302 
students (day) 
24 graduate business SL 3.60 S18 
students (night) 
89 middle managers 3.61 3.69 3.59 


lected from 89 graduate business students, 
most of whom had had no industrial experi- 
ence; these data were contrasted with a sam- 
ple of 24 night students who had had an aver- 
age of approximately 5 years of experience 
and a sample of 89 upper-middle managers 
of large firms who had had about 20 years of 
experience, on the average. 

Table 4, Column 1, contrasts the mean dif- 
ferences found between the three samples on 
social approach items, then again on political 
approach items. All social means were signifi- 


tions, compensation, life goals, supervision, group 
dynamics, planning, communication, negotiations, and 
evaluation. Lectures and T-groups were interwoven 
in the program. Typically, discussions following 
each exercise noted that there was no one best way 
to manage and that a manager had to balance many 
objectives, be flexible in relating to subordinates, 
cope with the conflict between individual participa- 
tion and organizational demands, etc. Neither a 
complete social nor a complete political point of 
view was advocated. For more detail, see Bass (1967). 
The students completed the approximately 40-hour 
program in 15 weeks; the managers, in 1 week. 
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cantly higher than the corresponding political 
means for the same samples. 

Table 5, Column 1, contains the two ap- 
propriate analyses of the variance of these 
means. In both cases, the obtained results 
could not be accounted for by chance. Middle 
managers felt that to be successful in large 
organizations required greater frequencies of 
the social as well as the political approaches 
described by the items. The mean social score 
for managers was 4.02, while it was 3.62 and 
3.56, respectively, for the student samples. 
What managers saw as needed very often was 
perceived to be needed between fairly and 
very often by the students. Somewhat lower 
frequencies of political activity were per- 
ceived as needed, but again, managers (3.61) 
felt more of this activity was required for 
success than did the graduate students (3.30 
and 3.17). The correlation of social and politi- 
cal approach means among the three groups 
probably reflected to some extent the correla- 
tion mentioned earlier among the individuals 
within the groups. 

To sum up, prior to training, experienced 
middle managers called for more frequent use 
of social and political approaches for success. 
All three groups favored somewhat greater 
use of the social approaches compared to the 
political (3.62 versus 3.30, 3.56 versus 3.30, 
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and 4.02 versus 3.61). All three samples were 
a bit more in agreement with Gibb, Bennis, 
and Argyris than with Martin, Sims, Jameson, 
and Pfiffner. Yet, they subscribed almost as 
much to the latter men’s point of view as 
to the former. 


Specific Differences (before Training) 


Table 6 shows how the three samples dif- 
fered on each of the 12 items studied. Each 
of the 12 items yielded a standard deviation 
of .70 to 1.55 with a mean standard deviation 
of 1.07. Such results signified that almost the 
whole scale from 0 to 5 was used by re- 
spondents. The estimated standard errors of 
the displayed means ranged from .050 to .076 
suggesting that a difference between samples 
on any of these two means was significant at 
the 1% level if it was as large as from .128 
to .285. 

All groups were least disposed toward bluff- 
ing, that is, acting confident even when unsure 
about matters. Probably this political item 
was the least socially acceptable behavior on 
the list of 12 items. At the same time, the 
three samples as a whole were most supportive 
of the value of fostering mutual trust. 

The widest divergence of opinion between 
the three samples for the various social ap- 
proach items occurred in the question about 


TABLE 5 


ANALYSES OF VARIANCE AND COVARIANCE OF THE SOCIAL AND POLITICAL ScoREs on OSQ 
or GRADUATE BusINESS STUDENTS AND MIppLE MANAGERS 








First analysis 


Second analysis 


























Source oes 
jus 
a = te posttraining SS 16S Gi 
Social 
Between samples 2 34,52 103.5 3.28 1.64 2 
Error 193 459.31 14.2 1543.31 8.04 192 
Total 195 493.83 ne oge 1546.60 Pe) 20 194 
Political 
Between samples 2 49.20 197.6 171.00 85.50 2 
Error 193 292.07 9.1 2698.36 14.05 192 
Total 195 341.27 eer 2869.40 F= 6.08* 194 
*p <.05 
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TABLE 6 
SPECIFIC DIFFERENCES BETWEEN GRADUATE BUSINESS STUDENTS AND MIDDLE 
MANAGERS BEFORE AND AFTER TRAINING 
89 graduate 24 graduate 9 midd 
Approved ere one business business stu- sen All (V = 196) 
6 students (day) | dents (night) 
Social 
Level, be candid Before 3.69 3.46 4.20 3.88 
After See 3.85 4.04 3.86 
Share decisions Before 3.19 3.50 4.17 3.64 
After 3.76 4.02 4.23 4.08 
Commit oneself fully Before 3.20 Sale 3509) 3.36 
After 2.98 3.38 3.83 3.38 
Set mutual objectives Before 4,28 3.96 4.35 4.27 
After 4.36 4.29 4.25 4.29 
Foster mutual trust Before 4.58 4.58 4.75 4.65 
After 4.64 4.73 4.86 4.73 
Hold group discussions Before 2.76 DS 3.06 2.89 
After 3.45 3.28 SD 3.28 
Political 
Withhold information Before 2.85 2.46 3L25 2.87 
After 3.13 3.63 3.61 3.40 
Bluff Before 1.98 Oe 2.33 25) 
After 2.06 2.38 2.25 2.18 
Make alliances Before 3.42 3.04 4.01 3.62 
After 3.26 3.38 4.13 3.64 
Retard and delay Before 4.27 4.17 4.37 4.30 
After 3.99 4.13 4.28 4.13 
Maintain distance Before 3.07 3.00 Sol 3.16 
After 3.12 3.83 3.22 3125 
Compromise but divert Before 4.21 4.17 4.39 4,28 
After 3.97 4.29 4.61 4.29 





Note.—S = always; 4 = very often; 3 = fairly often; 2 = sometimes; 1 = seldom; 0 = never. 


the need to share decision making with sub- 
ordinates whenever possible. The experienced 
managers saw this as needing to be done very 
often (4.17), the night students with some 
experience felt less strongly about this (3.50), 
while the experienced day students called for 
this action only fairly often (3.17). Again, 
managers were most in favor of leveling, 
openness, candor, and full commitment com- 
pared to the students. 

On political approaches the managers and 
students were widest apart on the utility of 


forming political alliances to suit personal 
ambitions. The managers saw this as needed 
very often (4.01), the students less so (3.04 
and 3.42). The managers also saw the need 
to withhold information more _ frequently 
(3.25) than did the students (2.46 and 2.85). 


EFFECTS OF TRAINING IN ORGANIZATIONAL 
PsycHOLOGY 
Table 4, Column 2, shows the actual mean 


scores of the three samples following training, 
and Table 4, Column 3, shows the same 
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mean scores after the effects have been sub- 
tracted from them of the differences among 
the participants prior to training. Table 5 dis- 
plays the appropriate covariance analysis of 
these adjusted posttraining means. From these 
results we infer that the graduate students, 
who initially had been significantly (F = 
7.28) lower than the managers in calling for 
social approaches, increased in this point of 
view during training while the managers re- 
main unchanged. At the conclusion of train- 
ing students had become like the managers 
(F = .20) on this dimension. With reference 
to political approaches, both day students as 
well as the managers remained unchanged in 
outlook, but as a consequence of training 
more experienced night students increased in 
their belief in the efficacy of political activity 
so much so that their actual final scores on 
this dimension matched those of the managers 
and their adjusted mean surpassed that of the 
managers. 


Specific Differences (after Training) 


Table 6 shows the responses to each of the 
12 OSQ items after as well as before the stu- 
dents and managers completed the course 
which included the Program of Exercises in 
Management and Organizational Psychology. 

Two multivariate analyses of covariance 
were completed on these data. In the first, 
the final six social approach responses were 
adjusted according to their multiple correla- 
tions with the initial social and political re- 
sponses of the trainees. In the second analysis, 
the final six political approach responses of 
the trainees were adjusted according to their 
multiple correlations with the initial social and 
political responses of the trainees. In both 
analyses there remained at the .01 level of 
confidence significant sources of variance due 
to whether the trainees were day or night 
graduate students or middle managers. The 
respective F's were 3.85 and 2.54 with 12 and 
352 degrees of freedom for each analysis.* 

Social analysis. The most interesting shifts 
in stress on social approaches occurred with 
reference to sharing decisions and holding 


4For a discussion of the multivariate analysis of 
covariance see W. W. Cooley and P. R. Lohnes, 
Multivariate procedures for the behavioral sciences. 
New York: Wiley, 1962. 
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group discussions. It can be seen that although 
training raised all samples on how much they 
thought sharing decisions with subordinates 
was necessary, the students were affected 
much more than the managers. The student 
means rose toward those of what the managers 
had been. On the other hand, while there was 
a sharp increase by students in emphasis on 
holding group discussions, the comparable in- 
crease for managers was so much smaller that, 
after training, students stressed this more 
than did managers, a reversal of their respec- 
tive positions before training. 

Political analysis. All three samples in- 
creased significantly in their valuing the util- 
ity of withholding information or timing its 
release as a consequence of the training ef- 
fort. Bluffing remained as unpopular as be- 
fore training. While inexperienced students de- 
creased their support for forming alliances, 
experienced students and managers increased 
in their favoring this approach. There was a 
slight overall decline in all samples in valuing 
delaying tactics. Among the night graduate 
students, there was a sharp increase (3.00- 
3.83) in the opinion that maintaining social 
distance and aloofness was important for suc- 
cess in the large organization; the other two 
samples were unchanged. Finally, the middle 
managers seemed to have increased in their 
support of open compromise but secret diver- 
sion of plans. 


CONCLUSIONS 


A meaningful consistent set of responses 
emerge when students and managers are asked 
to indicate what they think it takes to succeed 
in large organizations. On the whole, they see 
that the fellow who gets ahead is one who 
adopts fairly or very often both social and 
political approaches, the former somewhat 
more than the latter. He is one who will try 
to foster trust, share in the decisions with 
others, and consider the organization’s needs. 
At the same time, to a somewhat lesser extent 
he is ready when necessary, to maintain his 
distance and his prerogatives, to act confident, 
even if he is not sure of himself, and to con- 
sider his own personal advantage. 

Orientation and values, per se, only slightly 
relate to these responses, that is, self-oriented 
respondents favor more political activity, and 
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the attitudes reflected in these responses tend 
to account for whether socially acceptable 
rather than economically determined budget- 
ing decisions will be made. 

Middle managers tend to see more of both 
the social and political approaches required 
of the successful man in the large organization 
than do graduate business students, but gen- 
erally the students become more like the man- 
agers as a consequence of a training program 
in organizational psychology while the man- 
agers seem less affected by the program. 
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JOB SATISFACTION AMONG GRADUATE STUDENTS: 
INTRINSIC VERSUS EXTRINSIC VARIABLES 


EDWARD L. LEVINE anp JOSEPH WEITZ 1 


New York University 


A survey of “job” satisfaction was conducted among graduate students in 2 
universities. A major source of dissatisfaction at both universities was found 
to be student voice in influencing department policy. When the responses were 
factor analyzed, 1st- and 2nd-order factors did not indicate an intrinsic or ex- 
trinsic dimension, nor did the findings support a 2-factor theory of “job satis- 
faction.” Furthermore, the alternative to the 2-factor theory, namely that in- 
trinsic variables are more importantly related to measures of overall satisfaction, 
was similarly not supported. The conclusion is that both the 2-factor theory 
and its alternative are oversimplifications. Finally, the survey results of general 


interest are presented. 


As in so many recent studies, much of our 
analysis takes departure from the two-factor 
theory of job satisfaction outlined by Herz- 
berg, Mausner, and Snyderman (1959). The 
results of tests of the theory, which was far 
from exact to begin with (Ewen, 1964), have 
been mixed, for and against. Burke (1966) 
offers an objective and fairly thorough sum- 
mary of much of the relevant literature, while 
Herzberg (1966), in his more recent book, 
offers studies supporting the theory and criti- 
cizes, sometimes effectively, the research 
methodology employed by the theory’s op- 
ponents. 

One of the main objections to the two- 
factor theory is that it is method bound. Most 
studies which support the theory utilize the 
same open-ended “critical incidents” approach 
employed in the original study (cf. Herzberg, 
1966, and the review by Hulin, 1966b). On 
the other hand, evidence against the two- 
factor theory has come from several sources— 
studies of dimensionality, including Q analy- 
ses (Dunnette, Campbell, & Hakel, 1967), 
factor analysis (Friedlander, 1963; and, as a 


1 The authors wish to thank Byron Fiman, Jeffrey 
Greenhaus, Robert Schoen, Theodore Venet, and 
Virginia Schein who helped construct the question- 
naire and collect the data. 





more direct test, Graen, 1966b), rotated prin- 
cipal components (Ewen, 1964), Coombs’ un- 
folding technique (Burke, 1966), and studies 
utilizing the overall satisfaction measures 
neglected by Herzberg et al. (1959) in their 
original study. Among the latter, analysis of 
variance (Friedlander, 1966; Graen, 1966a), 
multiple regression (Halpern, 1966), and a 
series of hypothesis tests by Ewen, Smith, 
Hulin, and Locke (1966) have been employed 
in evaluation of the two-factor theory. 

By way of results, the dimensionality 
studies have not succeeded in establishing any 
clear-cut intrinsic-extrinsic dichotomy. Fur- 
thermore, the other group of studies, along 
with evidence presented by Wernimont 
(1966), Dunnette et al. (1967), Hinrichs and 
Mischkind (1967), and Mischkind (1967), 
have revealed the weaknesses of the two-factor 
theory and have pointed to an alternative 
which postulates that intrinsic (to the job) 
variables are more important in producing 
overall satisfaction, as well as overall dis- 
satisfaction. 

Our study, a survey of job satisfaction 
among graduate students majoring in psy- 
chology, proposes to extend the work of these 
investigators. We intend to do a first- as well 
as a second-order factor analysis, as an addi- 
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tion to Ewen’s (1964), Friedlander’s (1963), 
and Graen’s (1966b) work, in the supposition 
that an intrinsic-extrinsic dichotomy may 
emerge in second-order factors. 

Moreover, Halpern’s (1966) test of the 
two-factor theory, in which he found that 
motivators account for most of the variance 
in overall satisfaction with a satisfied sample, 
has been vitiated to an extent by the emer- 
gence of the above-mentioned alternative the- 
ory. His results support the two-factor theory, 
as well as an alternative theory. The two- 
factor theory predicts that for people on the 
satisfied end of the continuum, “motivators” 
should be more strongly related to overall 
satisfaction than “hygienes.” On the other 
hand, for people on the dissatisfied end of the 
continuum, “hygienes” should explain more 
variance in overall satisfaction. 

The alternative theory, which might be 
called the two-factor-plus-importance theory, 
predicts that intrinsic variables should relate 
more strongly to overall satisfaction than 
extrinsic variables regardless of the level of 
overall satisfaction; that is, they are more 
important. 

Our results will bear on the two-factor-plus- 
importance prediction, since, in an extension 
of Halpern’s (1966) test to determine whether 
intrinsic variables explain more variance in 
overall satisfaction, we do not restrict our 
sample only to respondents who are satisfied 
overall. 

Survey data are rich in analytic possi- 
bilities, and we would be remiss in reporting 
only the results bearing on the controversial 
issues just considered. One interesting area we 
plan to explore is that of the moderator 
variable. 

Recently, moderator variables have come 
into prominence as a way of improving pre- 
dictions (Ghiselli, 1963; Weitz, 1966), and 
we will employ traits assumed to covary with 
sex for this purpose. 

Hypothesizing that females are more so- 
cially oriented than males, and males more 
“independent,” the predictions are: for fe- 
males, satisfaction with social life, satisfac- 
tion with the opportunity for instructor-stu- 
dent discussion, and satisfaction with their 
social interaction with fellow students will be 
more highly correlated with overall satisfac- 
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tion than for males. For males, satisfaction 
with the voice they have in influencing de- 
partmental policy, satisfaction with their in- 
come relative to other graduate students, and 
satisfaction with the opportunity for inde- 
pendent thought and action will be more 
highly related to overall satisfaction, than for 
females. 

Finally “raw” survey results considered of 
interest to the scientific community will be 
presented. Graduate students, of course, will 
become the psychological community of the 
future, and their present attitudes toward 
their training, their projected careers, their 
mentors, may play a large role in determin- 
ing the characteristics of that community. To 
the extent that our results are generalizable, 
some insights may be obtained from the 
present data. 


PROCEDURE 
The Questionnaire 


The Graduate Student Questionnaire was designed 
in the following manner: 


1. Of the 78 items on the questionnaire, 35 covered 
a number of specific areas of possible satisfaction or 
dissatisfaction along with overall satisfaction. For 
the majority of the items, a 7-point scale was used, 
ranging from very satisfied to very dissatisfied. Of 
the 35 items, 5 were overall measures, for example, 
“How satisfied are you with your overall graduate 
training?”; 7 items were rated intrinsic to the stu- 
dents’ work, for example, “How satisfied are you 
with the intellectual stimulation provided by your 
graduate training?”; 19 items were rated extrinsic to 
the students’ work, for example, “How satisfied are 
you with your social life?” The remaining 4 items 
could not be classified into any of the above cate- 
gories, for example, “How satisfied are you with 
your decision to become a psychologist ?” 

The classification of items into intrinsic, or mo- 
tivator, variables and extrinsic, or hygiene, variables 
followed the scheme presented by Herzberg et al. 
(1959). A consensus of opinion among the authors 
and our colleagues (see Footnote 1) was required for 
the assignment of items to each of the categories 
mentioned. Where consensus could not be reached, 
or where the item did not lend itself to any meaning- 
ful classification, no assignment was made. 

2. Five items asked what the respondent’s primary 
career goals were. The item format specified that the 
respondent should divide the 100% of his work time 
among the categories “teaching,” “research in an 
academic setting,” “research in a non-academic set- 
ting,” “professional practice,’ “administration,” and 
the ubiquitous “other.” 

3. The respondents were also asked to suggest the 
foundation courses which all students in psychology 
should take. 
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4. A number of other items covered a variety of 
topics such as respondent’s satisfaction with his 
grades, to how many other schools he applied and 
was accepted, whether his present university was his 
first choice, and whether, if given another chance, 
he would still choose his present university. 

5. Another block of items listed certain profes- 
sional skills a psychologist might need in his work, 
for example, experimental design, statistical analysis, 
etc. The respondent was asked whether this skill was 
important or not important to him, and whether he 
was satisfied or dissatisfied with his training in that 
skill. 

6. There were two open-ended questions concern- 
ing what the student liked best and least about his 
graduate training (‘““Bouquets” and “Brickbats’’). 

7. The last group of items concerned the follow- 
ing demographic characteristics: major area of 
study, sex, age, marital status, student status, years 
pursuing graduate study, form of university financial 
assistance, if any, distance from place of residence 
to the graduate school, which university the re- 
spondent was attending, and whether or not he was 
a matriculant only (taking no courses), or a full-time 
student. 


This questionnaire, along with a cover letter which 
explained the purposes of the questionnaire and gave 
its sponsorship, was mailed to all psychology gradu- 
ate students at two universities. Anonymity was 
guaranteed. 


Sample 


The sample consisted of graduate students enrolled 
in the psychology department at two universities, 
ene rather large graduate department, and one rela- 
tively small. 

While the total response rate for the larger uni- 
versity was 53%, the primary results to be pre- 
sented here will be on 93 cases, all of whom are 
full-time students taking courses. This represents 70% 
of the respondents. 

For the small university a 61% return was ob- 
tained. 


RESULTS ? 


Tot. refers to the total group in University 
A taking courses, V = 93. 

MO refers to the group maintaining matric- 
ulation only, VN = 40, at University A. 

B refers to the second University, N = 19. 


Factor Analysis 


Responses of Tot. on the 35 items concern- 
ing specifics of graduate training and overall 
satisfaction were subjected to a factor analy- 


2 The authors gratefully acknowledge the contribu- 
tion of John D. Hundleby to the analyses reported. 
Tom Keenan, Chuan-Yu Chen, and Thanton Porter 
helped prepare the data for computer processing. 
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sis. The method of principal axes was used 
with an Equamax rotation.* The orthogonal 
factors which emerged were then rotated so 
that they became oblique by a Promax rota- 
tion (Hendrickson & White, 1964), and sec- 
ond-order factors were extracted by Varimax 
(Kaiser, 1958). Squared multiple correlations 
were placed in the diagonals, and extraction 
was halted when the number of loadings of 
items on a factor above .30 was less than two. 
That the number of factors extracted in the 
first order is the correct number is given 
credence by the finding that the number ex- 
tracted would have been the same if another 
criterion independent of this one had been 
used, that is, the number of eigenvalues of the 
correlation matrix, with unities in the di- 
agonals, greater than one. As many second- 
order factors were extracted and rotated suc- 
cessively as first-order factors. An upper- 
bound estimate of the reliability of these 
items, as given by the Spearman-Brown for- 
mula, using the average item intercorrelation 
(Guilford, 1954), was .91. To the extent that 
the assumptions of the formula are not met, 
the coefficient is overstated. However, Guil- 
ford (1954, p. 378) points out that a coef- 
ficient above .60 will generally not be more 
than .05 in error, and reliabilities of this mag- 
nitude are not extraordinary for items of this 
type (Edwards, 1957). 

Missing data were handled by ignoring 
those pairs of observations where one or both 
observations in a pair were not included. The 
items containing missing data were primarily 
concerned with assistantship jobs and could 
not, therefore, be expected to apply to every- 
one. However, none of the Ns was lower 
than 70. 

The factors themselves emerged as follows: 
(Only loadings above .30 are considered in 
view of the small V. Furthermore, the items 
loading on each factor will be listed by their 
questionnaire number, and following each 
item in parenthesis will be the designations I 
for intrinsic, E for extrinsic, O for overall, 
and X for unclassifiable measures, as well as 
the factor loadings. ) 


Factor 1. General Satisfaction—measures 
satisfaction with: 


3 David Saunders has devised this criterion. 
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65. Graduate student life, everything con- 

sidered (O, .74) ; 

1. Overall graduate training (O, .71); 

. Preparedness to achieve goals (O, .64) ; 

6. Decision to become a psychologist (X, 

0) 

The degree to which the department 

lived up to expectations (O, .57); 

Quality of instruction (E, .56); 

The training in major area of interest 

(O, .52); 

3. Anticipated future as a psychologist (I, 
50)$ 

7. Intellectual stimulation from graduate 
training (I, .48); 

8. The intellectual climate in the depart- 
ment (E, .41). 


Factor 2. A complicated factor which loaded 
in order of magnitude on satisfaction with: 


Zon 


Shoe 


10. 
32. 


Academic performance of fellow stu- 
dents (E, .79); 

26. Intelligence of fellow students (E, .74) ; 
9. Prestige of department (E, .59); 

8. Intellectual climate in the department 
(E, .51); 
Work and study interaction with fellow 
students (E, .45); 

7. Intellectual stimulation from training 
(I, .45); 
The degree to which the department 
lived up to expectations (O, .44) ; 
Quality of instruction (E, .42); 
The training in major area of interest 
(0, .33) ; 
The amount of required work in your 
courses (E, .32). 


This factor has been named Satisfaction 
with Intellectual Stimulation. However, it also 
seems to measure the quality of peers, the de- 
partment, and the instruction, along with in- 
tellectual stimulation. 


20%. 


O05 


10. 
oe 


30s 


Factor 3. Satisfaction with Assistantship 
Job—measures satisfaction with: 


4. Social relationship with your assistant- 
ship supervisor (E, .83) ; 

27. Recognition for work as an assistant 
Cis.8 1): 

28. Feedback on performance as an as- 
sistant (I, .64); 
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15. Income relative to other graduate stu- 
dents (X, .59); 

5. Competence of 
visor (E, .41); 

17. Research space available to you (E, 
34), 


Factor 4, Satisfaction with Physical En- 
vironment and Setting—measures satisfaction 
with: 

16. Study space available (E, .72); 

18. The overall physical plant at your uni- 
versity (E, .71); 

Recreational and cultural facilities pro- 
vided by your university (E, .58); 

The library facilities (E, .58) ; 

Research space available to you (E, 
56). 


One item, feedback on your performance as 
an assistant, loaded .40 on this factor; how- 
ever, the other item loadings were above .55, 
so that in this case we thought it justifiable to 
exclude this item. 


assistantship super- 


19. 


14. 
Lis 


Factor 5. Satisfaction with Constraints 
Placed upon the Student—measures satisfac- 
tion with: 


29. Emphasis placed on grades (FE, .67); 
22. Freedom in choosing course work (1, 
eo Lk 
12. Progress toward the PhD (I, .48); 
9. Prestige of the department (FE, .45); 
21. Opportunity for independent thought 
and action (I, .41); 
13. Voice in setting departmental policy 
(X,.41); 
Time limits placed on the student (E, 
39) ; 
Amount of required work in courses (E, 
36) 
1. Overall graduate training (O, .35); 
17. Research space available (E, .32); 
11. Opportunity for instructor-student dis- 
cussion (X, .30). 


The only item that causes difficulty is Item 
9, prestige of the department. Perhaps we can 
resolve the presence of this item on the basis 
that if department prestige is higher the stu- 
dent will have greater latitude in choosing a 
position, in publishing research, in making 
contacts. In sum, a department’s prestige can 
“open doors” for the student. 


ols 


30. 
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Factor 6. A complicated factor, loading on 
satisfaction with: 


24. Social interaction with fellow students 
(E, .70) ; 
23. Work and study interaction with fellow 
students (E, .62); 
2. Social life (E, .57); 
3. Future as a psychologist (I, .48); 
6. Decision to become a psychologist (X, 
AT JE 
12. Progress toward the PhD (1, .43). 


For want of a better label this factor will be 
called Satisfaction with Social Future. 


Factor 7. Satisfaction with Intellectual 
Stimulation and Freedom to Pursue Intellec- 
tual Interests—measures satisfaction with: 


5. The competence of your assistantship 
supervisor (E, .59); 
21. The opportunity for 
thought and action (1, .54); 
7. The intellectual stimulation provided 
by graduate training (I, .46); 
8. The intellectual climate in the depart- 
ment (E, .44); 
19. The recreation and cultural facilities 
provided by your university (E, .42); 
32. The training in major area of interest 
(O, 42); 
31. The time limits placed on the student 
(E, 37); 
30. The amount of required work in your 
courses (E, .30); 
22. The amount of freedom in choosing 
course work (I, .30). 


independent 


The three second-order factors, which ac- 
count for virtually all the variance in the 
first-order factors’ correlation matrix, are in- 
teresting in how they group those of the first 
order (see Table 1). 

Second-Order Factor 1. This factor groups 
general satisfaction (Factor 1), intellectual 
stimulation (Factor 2), and _ constraints 
placed upon the student (Factor 5). In terms 
of intrinsic and extrinsic variables, all the in- 
trinsic items (3, 7, 12, 21, 22) are represented 
here except the two concerning feedback as 
an assistant and the recognition received as 
an assistant, while only 10 of 19 extrinsic 
femoeappear §(S,°9, 1017) 23, 25, 26,029, 
BO, sok 
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TABLE 1 
SECOND-ORDER FACTORS 
Factor 1 2 | 3 
1 562 254 — .003 
2 455 — .096 297 
3 033 485 130 
4 146 239 540 
5 570 156 .056 
6 203 373 tine = 241 
7 081 356 | 084 





Second-Order Factor 2. This factor groups 
satisfaction with assistantship job (Factor 3), 
satisfaction with social future (Factor 6), 
satisfaction with intellectual stimulation and 
freedom to pursue intellectual interests (Fac- 
tor 7). In terms of extrinsic and intrinsic 
factors, all of the intrinsic items appear (3, 7, 
12, 21, 22, 27, 28), while only 10 of 19 ex- 
{rinsic: items appear (2, 4,°5,.8..17,.19) 23, 
24m30431): 

Second-Order Factor 3. This factor isolates, 
to a large extent, satisfaction with physical 
environment and setting, although satisfaction 
with intellectual stimulation is represented 
and, to a small degree, satisfaction with so- 
cial future. Only two of the seven intrinsic 
items appear (considering only First-Order 
Factors 2 and 4 which load .30 or above). 
Twelve of 19 extrinsic items are represented 
Gleon10814.e16ml 71s 19723, 95 26°30) 

Some variance of general satisfaction is ac- 
counted for by Second-Order Factors 1 and 2, 
where all of the intrinsic items appear, but 
none at all in Second-Order Factor 3 where 
only 2 of 7 intrinsic and 12 of 19 extrinsic 
items appear. 

Finally, that First-Order Factors 2 and 6, 
the social factors, were complicated (cf. Hulin, 
1966a) is made plainer by the second-order 
analysis. Factor 2, satisfaction with intellec- 
tual stimulation, has some variance repre- 
sented on Second-Order Factors 1 and 4. Fac- 
tor 6, satisfaction with social future, has its 
variance fairly evenly divided among Second- 
Order Factors 1, 2, and 4. One can only con- 
jecture that the “social” aspects of these di- 
mensions are represented on the more “ex- 
trinsic” Second-Order Factor 4, while the more 
job-related, self-actualizing, intellectual por- 
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TABLE 2 


INTERCORRELATIONS OF OVERALL SATISFACTION MEASURES 














Question 1 32 65 40 33 
How satisfied are you with: 

1. overall graduate training? 1.00 9 66 50 7 
32. training in major area? 1.00 48 44 .62 
65. graduate student life, overall? 1.00 61 .60 

40. Will you be prepared to reach your goals? 1.00 AS 
33. Did department meet expectations? 1.00 


tions are represented in Second-Order Factors 
1 and 2, the more “intrinsic” second-order 
factors. 

Interestingly, Item 20, satisfaction with 
recreational and cultural facilities in the city 
in which your school is located, does not have 
a salient loading on any factor, the only vari- 
able for which this is true. 


Multiple-Regression Analysis 


A Wherry stepwise multiple-regression anal- 
ysis (Wherry, 1940) was employed to de- 
termine the relationships of intrinsic and ex- 
trinsic items to overall satisfaction. The five 
overall measures, whose intercorrelations are 
presented in Table 2, were employed as de- 
pendent variables. 

The analysis was done in three stages. 
First, all seven intrinsic variables were cor- 
related with overall satisfaction, then the 19 
extrinsic variables, and finally all 26 extrinsic 
and intrinsic variables together. The multiple- 
regression coefficients (see Table 3) are ob- 
viously in contrast to what the two-factor-plus- 


TABLE 3 


COMPARISON OF THE RELATIONSHIPS OF INTRINSIC AND 
EXTRINSIC VARIABLES TO OVERALL SATISFACTION 
MULTIPLE-REGRESSION COEFFICIENTS 








Criterion Intrinsic Extrinsic 
variable variables variables 

1 .68 HAS (OH) 

32 .62 73 (.69) 

65 ao 79 (.76) 

40 57 50 (.47) 

33 67 85 (.83) 

Note.—The coefficients in parentheses are the correlations 


yielded by considering only the best seven extrinsic variables, 
a See Table 2 for item stems, 


importance hypothesis predicts, since the 
extrinsic variables tend to predict overall satis- 
faction better in four out of five cases. The 
differences in coefficients do not seem espe- 
cially great, but the differences magnify some- 
what when one considers the variance ac- 
counted for. 

A note of caution is in order, for one might 
well question this evidence on the basis that 
we “stacked the deck” against the intrinsic 
variables, since we reported the results for all, 
and the best seven, variables from the ex- 
trinsic set. However, such an objection cannot 
be sustained when the regression equations of 
both intrinsic and extrinsic variables on over- 
all satisfaction are computed. The sum of the 
contributions of each variable to the variance 
of the overall measures will then give a truer 
indication of the relative importance of each 
set of variables. We will arbitrarily set the 
number of variables included in this analysis 
to 10, since most of the systematic variance 
will be accounted for, and the possible effect 
of the larger number of extrinsic variables 
will be offset. 

The results, as shown in Table 4, are still 
contrary to the two-factor-plus-importance 
theory, for in three of five cases extrinsic 
variables account for more variance. Such a 
result also implies that the use of a one-item 
measure of overall satisfaction in tests of a 
theory is a far from adequate strategy. 

Furthermore, while these results were un- 
expected, they were not unprecedented. Hin- 
richs and Mischkind (1967) report that, 
contrary to the two-factor and the two-factor- 
plus-importance theory, “hygienes” were re- 
ported more often than ‘“‘motivators” as rea- 
sons for both satisfaction and dissatisfaction, 
by both high- and low-satisfaction people. 


JoB SATISFACTION AMONG GRADUATE STUDENTS 


Survey Results 


1. The predictions concerning the moderat- 
ing effects of sex are presented in Table 5. 
Two of the predictions hold up: for males, 
satisfaction with independent thought and ac- 
tion (p< .05) and voice in departmental 
policy (p < .07) are more importantly related 
to overall satisfaction than for females. How- 
ever, the reverse of the prediction that satis- 
faction with instructor-student discussion 
would yield a higher relationship with overall 
satisfaction for females was borne out (p < 
Ol): 

2. “Raw” survey results for variables which 
have been judged important by content, and 
by their importance to overall satisfaction, 
reveal the following for B, MO, and Tot.: 


All three groups answer all overall satis- 
faction items “‘slightly satisfied,” with B some- 
what more satisfied on the average. 

All respondents were slightly satisfied to 
satisfied, on the average, with their anticipated 
future as a psychologist, their decision to be- 
come a psychologist, the intellectual stimula- 
tion provided by their training, the intellec- 
tual climate in their department, their prog- 
ress toward the PhD, the opportunity for 
independent thought and action, and the 
amount of required work in their courses. 
However, the major, and indeed the only out- 
standing, source of dissatisfaction for all 
groups was with their voice in influencing de- 
partment policy, a result consistent with the 
recent nationwide student unrest. 


3. Statistics, psychometrics, and research 
design, along with history and systems, were 
endorsed most often by all groups as desirable 
basic courses for all students in psychology 
to take. 

4. According to the open-ended question 
requesting “Bouquets” and “Brickbats,” the 
faculty, its general quality, supervisory and 
technical competence, and breadth of outlook, 
was the most important source of satisfaction 
as well as dissatisfaction in both universities. 


DISCUSSION 


Within the limits set by the sample size, 
our questionnaire seems to be a fairly reliable 
instrument, and quite interesting factorially. 
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TABLE 4 


COMPARISON OF THE PROPORTION OF VARIANCE AC- 
COUNTED FOR BY INTRINSIC AND 
EXTRINSIC VARIABLES 





Criterion Intrinsic Extrinsic 
variable* variables variables 
1 17 45 
ou 13 46 
65 45 oe 
40 Sil .06 
33 O01 Biel 





8 See Table 2 for item stems. 


It is gratifying that, for the most part, the 
factors are clear cut; only in Factor 2 and 
Factor 6 are the interpretations strained, 
though we are convinced that they are not 
mere hodgepodges of variables; so that Mc- 
Nemar’s (1951) caveat concerning overinter- 
pretation perhaps does not apply here. Fur- 
thermore, the factors correspond quite closely 
to the coding categories set up for the ‘“Brick- 
bat” and ‘‘Bouquet” items: freedom, assistant- 
ship job, physical environment, level of over- 
all training, peers, are all clearly represented. 

Our results, as they bear on the con- 
troversy surrounding the two-factor theory 
and its alternative, add more evidence that 
the intrinsic-extrinsic dichotomy is not em- 
pirically useful. Our Second-Order Factors 1 
and 2 can be at best characterized as more 
intrinsic than extrinsic, and vice versa for 
Second-Order Factor 3. The complexity of 


MAB EES 

COMPARISON OF THE RELATIONSHIPS BETWEEN VARI- 
ABLES 2, 11, 13, 15, 17, 21, 24 AND OVERALL 

SATISFACTION—VARIABLE 1 





x? (corrected) Variable in x? (corrected) 
for males question for females 
04 Social life 18 
6.89:;5'¢ = .363 Instructor student 28; 6 = .09 
p< 01 discussion 
3.42; ¢ = .25; | Voice in depart- AAR Cy iI 
p < 07 ment policy 
40 Income relative 43 
to others 
5.10; ¢@ = .32; | Independent pl Lex —=5 00 
pi <= .09 thought and 
action 
18 Social interaction ea 
with peers 








270 


this intrinsic-extrinsic dichotomy is further 
shown by the fact that First-Order Factor 2, 
a predominantly extrinsic factor, loads most 
highly with the Second-Order Factor 1, where 
the general satisfaction factor appears. It 
seems that the context within which a variable 
is embedded must be taken into account in 
terms of its relevance for the satisfaction of 
self-actualization, or of more basic needs. For 
example, quality of instruction, for a graduate 
student, bears heavily on intellectual stimula- 
tion and overall satisfaction, while it may 
mean little to a sales trainee taking a 1-week 
course. Advancement is quite the “motivator” 
for an upwardly mobile business executive, 
but for certain civil servants, whose promotion 
may depend more upon time, it may have 
little, if any, motivational implications. 

Furthermore, in terms of effects on overall 
satisfaction, the hypothesis that intrinsic fac- 
tors are more important, which must make 
the damaging assumption of an intrinsic-ex- 
trinsic dichotomy, does not hold up. Does it 
not seem strange to expect that, when more 
sophisticated means of measuring importance 
have failed (Ewen, 1965), merely classifying 
variables as intrinsic or extrinsic should allow 
one to achieve meaningful results? 

Nature is frequently unparsimonious. Al- 
though a simple explanation is desirable, it 
must not be so simple as to mask, rather than 
mirror, the explicandum (Chein, 1963; cf. 
also Dunnette et al., 1967). What is needed 
in the area of job satisfaction is a broadsided 
attack on problems of definition and theory 
where expectations (Weitz, 1956; Wernimont, 
1966), community characteristics (Hulin, 
1966a), and other situational parameters, in- 
cluding employee needs (Katzell, Barrett, & 
Parker, 1961), are taken into account. (For 
the most sophisticated attempts along these 
lines see Hinrichs and Mischkind, 1967, and 
Mischkind, 1967, where a model of job satis- 
faction based on social learning theory is at- 
tempted.) Indeed, is it reasonable to ap- 
proach the study of job satisfaction in a more 
simple-minded fashion than the study of 
modern personnel selection and placement 
(Dunnette, 1963), or to assume that the 
dynamics at work should be any less complex? 
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CRITERIA USED BY RESEARCH AND DEVELOPMENT 
ENGINEERS IN THE SELECTION OF AN 
INFORMATION SOURCE* 


PETER G. GERSTBERGER anp THOMAS J. ALLEN 


Massachusetts Institute of Technology, Sloan School of Management 


The criteria employed by engineers in the selection of various technical informa- 
tion channels in problem-solving endeavors are investigated. The relationships 
of certain criteria identified in past research to such factors as frequency of 
channel utilization and the rate at which engineers accept or reject technical 
information received from specific channels are the focal points of the study. 
A direct relationship is found between perceived accessibility of information 
channels and several objective measures of utilization, whereas no definite sup- 
port is found for the hypothesis that the channels perceived highest in technical 


quality are those used most frequently. 


Before information technologists are able 
to design effective systems to provide the 
scientist and engineer with the information 
necessary for his work, a great deal must be 
learned about the behavior of the user popu- 
lation. Such research is requisite both to 
assess needs and to improve predictions of 
the impact on the total information system 
of new components or modifications of exist- 
ing ones. 

As a result, a large number of what have 
come to be called “user studies” (investiga- 
tions of the information-seeking behavior of 
scientists and engineers) have been performed 
over the past 10 years. Such studies endeavor 
to determine (a) the effectiveness of cur- 
rently used information channels, or (0) the 
criteria governing the selection of information 
channels. Most such studies, until very re- 
cently, have not, however, differentiated these 
two goals. They have implicitly assumed 
channel effectiveness to be the single criterion 
governing channel selection. Operating from 
this assumption, it was necessary only to 
observe the users and to assign performance 
measures to the channels on the sole basis of 
the extent to which they are used. 

Allen (1966a, 1966b), applying an inde- 
pendent measure of effective channel per- 
formance, found no relation whatever between 
this measure and the extent to which each 
of eight channels were used. These results 

1 The research reported in this paper was supported 


by a grant from the Office of Science Information 
Service, National Science Foundation (GN 597). 


counter any assumption of information qual- 
ity as the single criterion upon which channel 
selection is based. Allen proceeds to demon- 
strate the by now obvious fact that there 
exists in such a decision process a second 
parameter of considerable consequence. In 
addition to the value of a given channel, there 
is a certain cost associated with using it. The 
present study is directed at the measurement 
of relative costs associated with the use of 
each of nine information channels by R&D 
engineers and the relative weights assigned 
cost and value in this decision process (Gerst- 
berger, 1967). 

Cost in this sense is a rather complex con- 
cept. It may consist of many dimensions— 
economic, psychological, and physical. This 
study measures perceived cost to the user in 
terms of channel accessibility and ease of use. 
There is considerable support for the idea 
of measuring cost on these two dimensions. 
First, Allen’s (1966a) data led him to hy- 
pothesize that the frequency with which a 
channel is used is principally determined by 
the accessibility of the channel to the engi- 
neer seeking information. Second, both 
Werner (1965) and Rosenberg (1966) con- 
clude that the behavior of the users they 
studied appears to be influenced by percep- 
tions of the ease with which an information 
channel might be used. 


ResearcH Metruops 


The study was conducted in two divisions of a 
large electronics firm. A sample of 33 engineers was 
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Massachusetts Institute of Technology 
Solution Development Record 








Name: Date: — 
Problem: 
a. estimate of probability that a given 

alternatives under consideration alternative will be employed (please 


circle a probability estimate for 
each alternative) 





0 al! eD 3 4 a) 6 ed 8 9 1.0 





0 sil ee 2S 4 Bs) 6 all 8 9 1.0 





0 sil 2 co) 4 5) 6 Pt 8 aS 1.0 





0 a! 2 3 4 S 6 ai 8 9 1.0 
(Circled Probabilities Should Add to 1.0) 





b. If technical information having an impact upon your visualization of the problem or any of its potential solutions 


was received at any time during the past week, please circle the source(s) of that information on the line below. 
Information Source(s): L V c ES LS CR G E D 


Please try to recall two instances during the past week when you were actively searching for technical informa- 
tion related to the problem under consideration. For each case, please fill in a description of the type of informa- 
tion you sought and indicate the search order of all information channels consulted even though some or possibly 
all yielded little or no relevant information. (Place a “1” in the space provided in front of the source you con- 
sulted first; place a “2” in front of the source you consulted second; and so forth.) : 


(1) Type of information sought : 








Search order: Ts V Cc ES cas CR G E D Other 


(please specify) : 





(2) Type of information sought: 








Search order: TG, V. Cc ES aS GR G E. D Other 


(please specify) : 





Did you obtain the information sought in (1): pe Vies INO in (2): pee VES SaeNo 


Comments: , 5! 





Thank you! 


Fic. 1. The Solution Development Record. (This device is used to instrument and record activities during Research 


and Development projects. Respondents report, on a weekly basis, the probability of acceptance of solution alterna- 
tives and the use of information sources. Information sources are abbreviated as follows: L = literature; V = 
vendors; C = customer; ES = external sources; TS = technical staff; CR = company research; G = group 
discussion; E = experimentation; D = other division. The information sources are explained to the respondent 
on the reverse side of this device.) 
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TABLE 1 


INFORMATION CHANNELS 


Channel Description 

Books, professional, technical, and trade journals and other publicly accessible written 
material. 

Representatives of, or documentation generated by, suppliers or potential suppliers of 
design components. 

Representatives of, or documentation generated by, the government agency for which 
the project is performed. 

Sources outside the laboratory or organization which do not fall into any of the above 
three categories. These include paid and unpaid consultants and representatives of 
government agencies other than the customer agency. 

Engineers and scientists in the laboratory who are not assigned directly to the project 
under consideration. 

Any other project performed previously or simultaneously in the laboratory or organiza- 


L = literature 
V = vendors 
C = customer 


ES = external sources 


TS = technical staff 


CR = company research 


G = group discussion 
group. 
E = experimentation 


D = other division 





initially selected; of these, 19 remained fully coopera- 
tive over the 15-wk. period of the study.2 Of the 
19 cooperating engineers, 8 are employed in one di- 
vision, 11 in another. Those who are employed in 
the first division are further subdivided into those 
who work in a components laboratory and those who 
are with a systems laboratory. The two laboratories 
are separated by about 3 mi.; the second division is 
18 mi. away. The mean age of the 19 cooperating 
engineers is 33 yr. and the range 24-43 yr. Six have 
MS degrees; the remainder all hold BS degrees in 
engineering. They have been with their present com- 
pany an average of 5% yr. 

Data were gathered by means of a modified ver- 
sion of the Solution Development Record (Allen, 
1966c) (Figure 1). In addition to the questions about 
the probability of accepting alternative-solution can- 
didates to the S’s problem and the use of specific 
information channels, each S was asked each week 
to report in more detail on two information searches. 
Specifically, Ss were asked to report the type of in- 
formation sought, the order in which information 
channels were approached, and an indication of the 
success of the search. 


2A comparison of the 19 cooperating engineers 
with the 14 noncooperative ones shows no reason to 
suspect that they differ in terms of any background 
characteristics or (from the evidence available) in 
terms of their information-gathering behavior. 

3 The two divisions operate independently of on» 
another, and the nature of their business is quite dif- 
ferent. One division is in the missile systems business; 
the other is in the business of developing and pro- 
ducing electronic components. All relations reported 
in this paper, unless otherwise indicated, hold true 
for both divisions, when each is considered separately. 


tion regardless of its source of funding. This includes any unpublished documentation 
not publicly available, and summarizing past research and development activities. 
Ideas which are formulated as the result of discussion among the immediate project 


Ideas which are the result of test or experiment or mathematical simulation with no im- 
mediate input of information from any other source. 
Information obtained from another division of your company. 


The modified Solution Development Record was 
supplemented by periodic questionnaires in which 
each S was asked to rank nine information channels 
(Table 1) on the basis of the following four criteria: 

1. Accessibility—accessibility of the nine informa- 
tion channels (without giving consideration to the re- 
liability or quality of the information expected). 

2. Ease of Use—ease of use of the nine information 
channels listed. 

3. Technical Quality—technical quality or reliabil- 
ity of the information obtainable from each of the 
nine channels listed (without giving consideration to 
the accessibility of the channel). 

4. Degree of Experience—degree of experience the 
engineer has had with the nine channels during his 
career. 

Each of the four channel-ranking questionnaires 
was administered at a separate point in time over 
the course of the study. The minimum separation 
between questionnaires was 1 wk. Upon completion 
of the study, each S was questioned to determine 
whether he understood clearly the meaning of each 
of the four dimensions. In a few cases, in which an 
S exhibited confusion over the meaning of a dimen- 
sion, this difficulty was corrected by the interviewer, 
and S was allowed to rerank the channels. 

The resulting rank order of information channels 
for each criterion provides subjective estimates of 
“channel cost” and “channel payoff,” or value. These 
channel rankings can be tested for correlation with 
rank orders based on the frequency-of-use determined 
from Solution Development Record data. This tech- 
nique makes possible the identification of the relative 
importance of various criteria in information-channel 
selection. The postproject interview was also used 
to determine the information sources which had 
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brought to mind each of the solution alternatives 
considered by each engineer. 


RESULTS 


Seven hypotheses are tested in the study. 
All are concerned with predicting the criteria 
which determine both the frequency with 
which an information channel is used and the 
order in which it is consulted by an engineer 
requiring information: 

1. Those information channels which are 
considered more readily accessible will be 
used more frequently than less accessible 
channels. 

2. Those information channels which are 
considered easier to use will be used more 
frequently than less easy to use channels. 

3. Those information channels which are 
believed to provide information of higher 
quality will be used more frequently than 
those providing lower quality, less reliable 
information. 

4. Those information channels which are 
considered more readily accessible will be 
used first in the course of an active informa- 
tion search. 

5. Those information channels which are 
considered easier to use will be used first in 
the course of an active information search. 

6. Those information channels which are 
believed to provide information of higher 
quality or reliability will be used first in the 
course of an active information search. 

7. A greater proportion of ideas (measured 
by the ratio of messages accepted to those 
received) will be accepted from those in- 
formation channels which are believed to 
provide information of higher quality or 
reliability. 


Frequency of Use 


Testing the first hypothesis, a strong rela- 
tion (Kendall Tau = .67, p< .01) is found 
(Table 2) between channel accessibility and 
frequency of use. This first-order test pro- 
vides substantial support for the first hy- 
pothesis. Turning to the second criterion, 
ease of use, a somewhat weaker relationship 
(r= 44, p< .06) is found between the 
channel rankings. The application of partial 
correlation analysis (Table 3) to the data 
clearly reveals the magnitude and direction 


BTS 


TABLE 2 


COMPARISON OF FREQUENCY OF USE witH MEDIAN 
ORDERINGS ASSIGNED TO INFORMATION CHANNELS 
ON THE Basis OF THREE CRITERIA 














Criterion 
Information “dae 
channel z a seo 
Accessi- | Ease of | Technical} of Use 
bility use quality 
Literature 3 3 1 1 
Vendors 4 5 8 3 
Customer 7 8 9 6 
External sources 8 9 6 7 
Technical staff 2 1 3 4 
Company research 6 6 5 9 
Group f i 4 2 
Experimentation 5 4 2 5 
Other division 9 7 7 8 





Note.—N = 19 R&D engineers. Median rank orders for the 
nine information channels are derived in the following manner. 
The channel rankings assigned by the 19 engineers for each 
criterion considered are summed to yield a total score for each 
of the information sources. For example, if every engineer had 
ranked ‘‘Literature’’ as being the most accessible channel, the 
total score for literature would be 19. The channel having the 
lowest total score is ranked first, the next lowest second, and 
so forth. 


of the relationship. Accessibility is definitely 
the dominant cost criterion determining the 
relative frequency with which information 
channels are used. When accessibility is held 
constant, ease of use actually shows a slightly 
negative relation with frequency of use. Simi- 
larly, in the case of perceived technical qual- 
ity, there is a weak positive relation with 
frequency of use, but when accessibility is 
controlled, the relation approaches zero. Ap- 
parently, in the minds of the Ss, there is some 
relation between their perceptions of tech- 


TABLE 3 


RELATION OF THREE SELECTION CRITERIA TO 
FREQUENCY OF INFORMATION CHANNEL USE 




















Correlation be- Kendal atte Variable 
tween frequency Tau (ant partialed 
par 
of use and tialed) out 
Perceived ac- 
cessibility ON 98 Ease of use 
64 Technical 
quality 
Perceived ease 
of use 44* — .18 Accessibility 
36 Technical 
quality 
Perceived tech- 
nical quality .28 .03 Accessibility 
08 Ease of use 
*b <.05 
pr < 01 
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TABLE 4 


RELATION OF THREE SELECTION CRITERIA TO 
FREQUENCY OF CHANNEL SELECTION AS A 
First SOURCE 

















Correlation be- Kendall ; 
tween frequency Kendall law Variable 
of selection as Tau (oar partialed 
a first source \P out 
tialed) 
and 
Perceived ac- 
cessibility sOem 57 Ease of use 
61 Technical 
quality 
Perceived ease 
of use A4* —.16 Accessibility 
soi! Technical 
quality 
Perceived tech- 
nical quality 39 19 Accessibility 
pl Ease of use 
* pi <= .0 
D> < 01 


nical quality and channel accessibility, but 
it is the accessibility component which al- 
most exclusively determines frequency of use. 


First Sources 


Each § was asked once a week to report 
two instances during the preceding week in 
which he actively sought information. In each 
instance, Ss reported whether or not the 
search was “successful” and the order in 
which they contacted channels. There were 
154 searches reported. 

Table 4 presents the correlations between 
frequency of selection as a first source and the 
three criterion variables. Here again acces- 


TABLE 5 


MeEpIAN RANK ORDERING OF INFORMATION CHANNELS 
ON THE Basis OF DEGREE OF EXPERIENCE 





Information channel Ranking assigned 
Literature 

Group 
Experimentation 
Technical staff 
Vendors 
Company research 
Customer 

Other division 
External sources 


OOIAMNPWNHH 





Note.—N = 19 R&D engineers. 
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sibility appears as the dominant criterion 
upon which selection is based. Engineers turn 
first to that channel which is most accessible; 
perceived technical quality influences this 
decision only to a minor extent. 


Effect of Experience 


One would expect that the degree of experi- 
ence which an engineer has with a given 
information channel would influence his per- 
ception of both costs and value associated 
with that channel. He might never use the 
library, for example, because of a mispercep- 
tion that it contains nothing of value, or that 
it is really too difficult to find what you are 
looking for. Given some moderate amount of 
experience with library use, however, he may 
come to adjust these perceptions so that they 
are more in accord with reality. Similarly 
with interpersonal channels, increased experi- 
ence should adjust perceived costs. The ad- 
justment can, of course, mean either an in- 
crease or decrease in perceived cost, and may 
be entirely random, increasing for some indi- 
viduals or channels and decreasing for others. 
Should the latter situation hold, correlation 
analysis will fail to detect any systematic 
trend, and a zero correlation will be found 
between experience and perceived cost. If, 
however, experience brings about a systematic 
shift (most Ss shifting in the same direction 
with respect to most channels), correlation 


TABLE 6 


CORRELATION AND PARTIAL CORRELATION Co- 
EFFICIENTS FOR RELATION BETWEEN DEGREE 
oF EXPERIENCE AND PERCEIVED Costs 
oF Usinc INFORMATION CHANNELS 





Correlation be- Kendall 





tween degree | Kendall| Tau Variable 
of experience Tau (par- partialed 
and tialed) ut 
Accessibility oe 37 Ease of use 
.53 | Frequency of use 
Ease of use nose .37 | Accessibility 
.60 | Frequency of use 
Frequency of use | .61** 25 | Accessibility 
SL | Ease of use 
Technical quality} .56* 43 Accessibility 


.32 | Ease of use 
.39 | Frequency of use 
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analysis can detect this phenomenon and will 
show a positive or negative correlation be- 
tween experience and cost. 

Table 5 shows the nine channels arranged 
according to the amount of experience the 
engineers report having had with each of 
them. Table 6 demonstrates quite clearly that 
a strong positive relationship holds between 
the degree of experience an engineer has had 
with a given channel and both perceived 
accessibility and ease of use. 

Of course, the engineer may simply refer 
more frequently to those channels which he 
sees as more accessible or easier to use. In 
other words, the direction of causality may 
be from cost to experience. Lower-cost chan- 
nels are used more, and he thereby accumu- 
lates greater experience with them. To test 
this possibility and determine causal direc- 
tion, the frequency with which the engineers 
used the nine channels is controlled and the 
relation between experience and _ perceived 
cost again tested (Table 6). This reduces 
both of the original relations somewhat, but 
reasonably strong correlations remain, thus 
indicating that the degree of experience that 
an engineer acquires with an information 
channel does tend to lower his perception of 
the cost of using that channel. 


DISCUSSION 


Any assumption that engineers act in 
accord with a simple instrumental learning 
model in which they turn most frequently to 
those information channels which reward 
them most often should now clearly be laid 
to rest. Engineers, in selecting among infor- 
mation channels, act in a manner which is 
intended not to maximize gain, but rather to 
minimize loss. The loss to be minimized is 
the cost in terms of effort, either physical 
or psychological, which must be expended 
in order to gain access to an information 
channel. 

Their behavior thus appears to follow a 
“law of least effort” (see, e.g., Zipf, 1949). 
According to this law, individuals, when 
choosing among several paths to a goal, will 
base their decision upon the single criterion 
of least average rate of probable work. In 
other words, to minimize his average rate of 
work expenditure over time, “. . . an indi- 
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vidual estimates the probable eventualities, 
and then select(s) a path of least average 
rate of work through these [Zipf, 1949, p. 
6) 2? 

In the selection of information channels, 
the engineers certainly appear governed by a 
principle closely related to this. They attempt 
to minimize effort in terms of the work re- 
quired to gain access to an information chan- 
nel. However, they either ignore probable 
future work (searching through other chan- 
nels, upon failure to find the needed informa- 
tion through the first channel), or they are 
unable to estimate it. If, in fact, engineers 
were to consider future effort in making their 
decision, one would expect to find a negative 
correlation between the mean number of 
channels which must be used to gain the 
desired information after selecting one chan- 
nel as the first source. In other words, for 
each time that a given channel is chosen as 
a first source, the number (including zero) of 
channels which had to be used before the 
information was in hand can be counted. The 
mean number of additional channels used can 
be computed for each channel. This mean 
score serves as an index of the long-term 
effort (over and above that effort connected 
with gaining access) associated with each 
information channel. That correlating this 
index with the frequency of use of each chan- 
nel produces a near-zero correlation indicates 
that engineers do not take this long-term 
effort into account in selecting channels. So 
we are left with engineers behaving according 
to a simplified version of the law of least 
effort, in which they take only their immedi- 
ately predictable effort into account and 
minimize that parameter in making their 
decision. 

The implications of this finding are very 
important. Improving the quality or perform- 
ance of a particular information service will 
not, in and of itself, lead to increased use 
of the service. More investment in library 
holdings, for example, will be wasted unless 
at the same time this material is made more 
accessible to the user. Engineers will simply 
not be attracted to the library by improve- 
ments in the quality or quantity of the 
material contained there. The library must, 
in a sense, come to them. 
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Those concerned with R&D management 
whether they are laboratory directors, librar- 
ians, or administrators of professional engi- 
neering societies have both dimensions of this 
problem, to varying degrees, within their con- 
trol. To some extent they can control the 
quality of the information which is available 
through certain channels. The laboratory di- 
rector, for example, can through hiring make 
available a more competent technical staff to 
improve the quality of internal consulting. 
The librarian can improve the extent and 
quality of the material contained in the 
library. The professional societies can through 
careful editing and review improve the qual- 
ity of their present publications or make new 
publications or services available to the mem- 
bership. Improvements in any or all of these 
directions may well be desirable. But it must 
be remembered that such improvement, by 
itself, will not guarantee any improvement in 
the performance of the organization or of the 
engineering community. Before the improved 
information service can lead to increased per- 
formance, it must be used! And the only way 
to increased use is through increased acces- 
sibility. Fortunately, however, this parameter 
can usually be quite readily controlled by 
management. The laboratory director must 
not just hire more competent technical staff, 
he must encourage their use as consultants 
by rewarding them for this function, and by 
encouraging his engineers to go to them for 
information. He must make it known that 
no one will downgrade a man for seeking 
help and take steps to reduce the psychologi- 
cal risk and cost in internal consulting. The 
librarian must make his services more readily 
available. This may involve some rather ex- 
pensive additions to personnel (library assist- 
ants specializing in particular technical areas, 
special messenger services, mobile libraries, 
etc.). But unless this additional investment is 
made, the money spent on library holdings 
and facilities will be wasted. The engineering 
societies, too, can make their services more 
accessible. Many such improvements, as for 
example selective dissemination, demand de- 
posit, and the publication of critical annual 
reviews with extensive reference lists, have 
been discussed in recent years and many of 
these hold great promise. But there is a more 
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important step which the societies might take 
to make their material more accessible. The 
principal reason for the reluctance of engi- 
neers to use their professional literature is 
that, for the most part, they cannot under- 
stand it. Most of the professional engineering 
literature is too mathematically sophisticated 
for the average engineer to comprehend. It is 
therefore inaccessible to him. Now if one 
wishes to bring engineers into greater contact 
with their literature, there are two obvious 
approaches. The first is to operate on the 
engineer and enhance his mathematical so- 
phistication; equip him so that he can under- 
stand the literature as it now exists. The 
second and perhaps more tractable approach 
is to bring the literature to the engineer. The 
professional societies should publish a litera- 
ture form, whose technical content is reason- 
ably high, but which is understandable by the 
audience to whom it is directed. This is not 
an easy task. And it does not necessarily im- 
ply that the societies forego all of their pres- 
ent program of publications. What it does 
mean is that these publications are at least 
supplemented by a series of more readable 
works. That this can be done is amply demon- 
strated by the success of a large number of 
profit-making, controlled circulation publica- 
tions, which engineers do read (Allen, 1966a). 
The engineering societies should give up their 
style of emulating the scientific societies. 
Their goals are different. There is nothing 
wrong with publishing nonoriginal contribu- 
tions in engineering. The goal of engineering 
publication is to inform, not to stake out 
claims. Only when the engineering societies 
come to realize this will they serve their 
audience properly. 

Furthermore, since engineers perceive chan- 
nels to be more accessible as they gain greater 
experience in using them, any steps to in- 
crease use through improving accessibility 
will be self-reinforcing. As laboratory man- 
agers make their internal consultants more 
accessible, they will come to be used more 
and the engineers’ perception of their acces- 
sibility will be further enhanced. As engineers 
gain more experience with the more accessible 
professional literature, their perception of in- 
creased accessibility may generalize to the 
more sophisticated literature and encourage 
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them to work toward improving their ability 
to comprehend this literature. 

Engineers, it is true, do compensate, by 
varying the degree of skepticism with which 
they filter the ideas received, for their lack 
of attention to quality when selecting an 
information channel. This is an interesting 
phenomenon in which technical quality is 
recognized, but its consideration is delayed 
until after the channel has been selected. 
Accepting or rejecting messages is a process 
which takes place over a considerable period 
of time. The focus during this time is tech- 
nical quality (including, of course, cost and 
hardware availability). The alternatives under 
consideration are quite rigorously compared 
in terms of the quality dimension. It is no 
wonder then that the perceived reliability of 
the source is taken into account, and that 
ideas are accepted or rejected, at least in 
part, on that basis. 

From a simple engineering viewpoint, this 
seems to be an appallingly inefficient way of 
doing things: using the wrong criterion early, 
and later correcting for it. This is one more 
reason why steps must be taken to aid the 
engineer in overcoming the channel selection 
problem. Many of the suggestions made ear- 
lier in this section should more than pay for 
themselves in terms of engineering effort 
conserved. 


CONCLUSIONS 


Empirical support is found for the model 
of the information-channel selection process 
first proposed by Allen (1966a). In this 
process, channel selection is based almost 
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solely on channel accessibility, the -most 
accessible channels being used first. 

The following specific conclusions are de- 
rived from the study: 

1. Accessibility is the single most impor- 
tant determinant of the overall extent to 
which an information channel is used. 

2. Both accessibility and perceived tech- 
nical quality influence the choice of first 
source, 

3. Perception of accessibility is influenced 
by experience. The more experience an engi- 
neer has with a channel, the more accessible 
he perceives it to be. 
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Preference ratings of 10 familiar professional, 10 familiar skilled, 10 unfamiliar 
professional, and 10 unfamiliar skilled labor jobs, presented as either titles or 
descriptions, were obtained from 100 male high school students. Professional or 
familiar jobs were preferred as titles but these effects were absent or reduced 
in ratings of descriptions of the same jobs. Changes in job preferences in a 
2nd rating were greater when Ss were shifted from titles to descriptions, or 
the reverse, than when the identical type of material was rated, indicating non- 


equivalence between stimulus materials in study of job attitudes. 


Implications 


for personality and interest tests using job titles as items are discussed. 


In interest and personality tests such as the 
Vocational Preference Inventory (Holland, 
1965), Minnesota Vocational Interest Inven- 
tory (Clark, 1965), and the Strong Vocational 
Interest Blank (Strong, Campbell, Berdie, & 
Clark, 1966), the S states his occupational 
preferences. Frequently, however, only job 
titles are presented to S without further in- 
formation. Quite some time ago, several stud- 
ies (e.g., Bateman & Remmers, 1939; Nick, 
1942; Recktenwald, 1946, 1948) demon- 
strated the effect of information presented as 
lectures, discussion, reading, etc., on job-atti- 
tude change. Hoppock (1967) argues that 
providing individuals with such information 
assumes that attitudes are affected by what 
they know about vocations. He contends that 
simple job tities afford only limited cues, and 
suggests guidelines for providing occupational 
information, including title, description, quali- 
fications, demands, etc. However, these guide- 
lines are obviously uneconomical when more 
than a few jobs are to be presented for atti- 
tudinal evaluation, and thus have not been 
used to construct materials for interest tests 
or job-preference studies. 

Information and one of its resultants, fa- 
miliarity, are not the only variables affecting 
attitudes toward job titles. Using a few limited 
titles, Osgood and Stagner (1941) found that 
occupational stereotypes conform closely to a 
framework based on prestige, the latter being 
imputed to titles per se on the basis of brains, 
being noticed, financial return, excitingness, 
etc. Moreover, although not immediately rele- 


vant to the present study, recent factor-an- 
alytic research using titles alone disclosed 
differences among them on five major factors, 
which in one study (Gonyea, 1961) failed to 
resemble conventional interest groupings or 
job classifications, but which in a second in- 
vestigation (Gonyea & Lunneborg, 1963) with 
different methodology resembled a classifica- 
tion proposed by Holland (1959). In a study 
designed to examine the role played by job 
descriptions in interaction with titles, Osipow 
(1962), using the semantic differential, found 
differences in attitude toward differing job 
titles (Janitor or Building Superintendent) for 
the same job on evaluative and potency di- 
mensions which disappeared when descriptions 
were given. Osipow suggests that perceptions 
of jobs may be changed with minimal descrip- 
tions. 

Attitudes toward job titles (vocational 
stereotypes) have been employed by Holland 
as the basis of a personality inventory (Hol- 
land, 1958) and as a means of developing a 
theory of personality types based in part on 
vocational choice. Holland (1966) in fact 
states: 


The assumption that vocational stereotypes have re- 
liable psychological and sociological meanings is im- 
portant because most interest inventories rest heavily 
on its validity. If vocational stereotypes were unre- 
liable and inaccurate, the validity of interest inven- 
tories would be seriously reduced. In the case of the 
present theory, the validity of this assumption lends 
credence to the first assumption—vocational choice 
is an expression of personality—and to several sub- 
sequent assumptions [p. 5]. 
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The recent work with job titles invites fur- 
ther investigation of variables affecting atti- 
tudes toward them, such as familiarity, pres- 
tige level, and information provided by job 
descriptions. Regarding the latter, it was felt 
that somewhere between the completely re- 
stricted cues of titles alone and the detailed 
guidelines suggested by MHoppock (1967) 
should lie brief but informative kinds of occu- 
pational materials: job descriptions condensed 
from the Dictionary of Occupational Titles 
(DOT) (United States Employment Service, 
1949) .1 Such descriptions should serve to con- 
vey the general “flavor” and essential details 
of job duties, without being elaborate and 
time consuming. They should also be particu- 
larly useful when job titles are unusual or 
misleading, and should tend to reduce the in- 
fluence of prestige level and occupational 
stereotypes on measures of job attitudes and 
preferences, as Osipow (1962) clearly demon- 
strated. 

The present study was designed principally 
to compare titles and descriptions in terms of 
the influences of job familiarity and prestige 
level on occupational preference ratings. In 
addition, results were obtained concerning the 
stability and equivalence of ratings of these 
two kinds of materials. 


MeEtTHOpD 
Materials 


All jobs in the DOT 0 code (professional) and 
DOT 4 and 5 codes (skilled labor) were scanned, in 
order to select, on an a priori basis, familiar or un- 
familiar jobs in each category. An attempt was made 
to match at each code level a familiar with one or 
more quite unfamiliar jobs in the same “job family.” 
For example, Electrician, a skilled familiar job, was 
paired with Boardman, an unfamiliar job in the 
same three-digit group. Sixty-eight titles were thus 
selected, representing a wide range of occupations, 
and preliminary ratings of familiarity were obtained 
from small groups of college freshmen (n’s of 16 
and 17). Inspection of mean ratings caused attrition 
of 20 titles because no matched job was sharply 
discriminated on familiarity. The preliminary ratings 
were also used to determine which matched job 
offered the greatest familiarity difference for each 
original title (e.g., Controller was much more un- 
familiar than Comptroller as a match for Account- 


1The 1949 edition of the DOT rather than the 
latest edition was used in this study in order to as- 
sure completely unfamiliar titles, as called for in the 
design. 
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TABLE 1 


FAMILIAR AND UNFAMILIAR Jogps, Dor CoprEs, AND 
REsuLts OF SiGN TESTS OF FAMILIARITY 






































DIFFERENCES 
Pamili DOT ays DOT | Sign 
Familiar ans Unfamiliar eas eae 
Professional jobs 

Accountant 0-01.10} Controller 0-01.70} 3.9** 
Reporter 0-06.71 | Continuity 0-06.23 | 4.8** 

Writer 
Chemist 0-07.00 | Enologist 0-07.10 | 4.9%** 
Actor 0-02.11 | Elocutionist 0-02.25 | 4,4** 
Medical Doctor 0-26.10} Pathologist 0-26.10} 4,1** 
Psychologist 0-39.17 | Etymologist 0-39.16} 4.4** 
Criminal Lawyer | 0-22.10] County 0-22.50} 4.8** 

Surrogate 
Biologist 0-39.31 | Ecologist 0-39.33 | 4.97% 
Physicist 0-39.48 | Entomologist 0-39.65 | 4.0** 
Civil Engineer 0-16.01 | Structural 0-16.01 | 2.7** 

Designer 

Skilled labor jobs 
Auto Mechanic 5-81.61} Millwright 5-78.10 | 5.1** 
Electrician 4-97.01 | Boardman 4-97.11 | 4.9** 
Jeweler 4-71.01 | Gemologist 4-71.22 | 4.4** 
Locomotive 5-41.06 | Hostler 5-41.03 | 4,3** 
Engineer 

Gunsmith 5-83.54 | Armorer 5-83.54 | 3.8** 
Stone Engraver 4-46.10 | Lithograph Artist] 4-46.30 | 2.5* 
Welder 4-85.01] Brazer 4-85.31] 4,4** 
Engraver 4-73.51 | Pantographer 4-73.51]| 4.4** 
Stone Mason 5-24.21 | Terrazzo Worker | 5-24.51 | 4.4** 
Carpenter 5-25.11 | Pitwright 5-25.83 | 5.6** 





Note.—DOT = Dictionary of Occupational Titles (United 
States Employment Service, 1949). 


p <.05 
¥ED < 201. 


ant). The remaining 48 jobs, half professional and 
half skilled, were then submitted by title to another 
group of freshmen (n= 116) who again rated these 
jobs on a 7-point scale as to their degree of famili- 
arity with them. By means of the sign test (Siegel, 
1956), 40 titles were then selected which were sharply 
discriminated as to familiarity. These 40 jobs, evenly 
divided into 10 professional familiar (PF), 10 pro- 
fessional unfamiliar (PU), 10 skilled familiar (SF), 
and 10 skilled unfamiliar (SU), their DOT codes, 
and results of the sign test, are presented in Table 1. 
In all except 1 of the 40 jobs, the match of familiar 
with unfamiliar occurred exactly for the first three 
digits of the DOT code, revealing the “job family” 
closeness of the 20 pairs. 

In order to have a well-controlled stimulus for 
job descriptions, each of the longer DOT descriptions 
of these 40 jobs was condensed to a length of 20-30 
words. Mean length was 22.5 words for professional 
jobs and 23.1 for skilled. In rewriting descriptions, the 
title of the job was, of course, scrupulously avoided. 
Thus, the two types of presentation in the main ex- 
periment had one which gave the title and no other 
cues, and one which gave a brief description of the 
job duties but no title. 


Subjects and Design 


Four groups of volunteer Ss rated the 40 occupa- 
tions on two separate occasions spaced 1 mo. apart. 
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The groups differed only in the type of material, 
that is, titles or descriptions, rated in each session: 
Group I, Title-Title (TT); Group II, Description- 
Title (DT); Group III, Title-Description (TD) ; 
Group IV, Description-Description (DD). The basic 
plan was thus a 2 X 2 crossover wiih respect to type 
of material and order of presentation, with additional 
repeated measured factors of familiarity (familiar 
versus unfamiliar jobs), and prestige level (profes- 
sional versus skilled jobs) as described above. 

A total of 240 Ss, all male high school seniors, 
had been randomly preassigned with 60 in each 
group. At the first session, 224 Ss appeared, with 
slightly uneven amounts in each group. For adminis- 
trative reasons, Ss were tested as a single group in 
both rating sessions conducted in the school cafeteria 
during specially assigned periods. They were aware 
only of being assigned to one of four groups (not 
the rationale of the groups) who were given an ap- 
propriate booklet in which titles or descriptions, as 
the case might be, were arranged in random order. 
The Ss were instructed to rate the stimuli on a 5- 
point Likert-type scale from 1 (like very much) to 
5 (dislike very much). In order to stimulate interest 
prior to rating, they were asked to indicate, in addi- 
tion to their names and ages, their father’s job, own 


TABLE 2 


MEAN PREFERENCE RATINGS OF JoB TITLES 
AND DESCRIPTIONS 


























Virst rating Second rating 
Type of Job M SD M | SD 
material class = 
Groups TT Groups TT 
& TD & DT 
Titles PE 3.00 al! 3.03 76 
PU 345 .66 3.50 12 
SF 3.43 5) 3.43 74 
SU 3.70 .69 3.76 .69 
Groups DT Groups TD 
& DD & DD 
Descriptions | PF 3.16 09 3.09 70 
Pin esr oO q00 3.46 |. .68 
SF 3.24 .62 Soil 61 
SU 3.42 58 Sal foi 
Total PF 3.08 .66 3.06 73 
EO eel .60 3.48 70 
SF 3.34 .69 3.40 .68 
SU 3.56 64 3.64 .63 











Note.—Abbreviations: PF = professional familiar; PU 
= professional unfamiliar; SF = skilled familiar; SU = skilled 
unfamiliar. TT = Title-Title; TD = Title-Description; DT 
= Description—Title; DD = Description—Description. 
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goal job, and college-attendance intent. They were 
also told that Es were interested in their attitudes 
toward certain jobs, that their cooperation would 
aid research in this area, and to do their best in rat- 
ing all stimuli presented to them. The Ss were not 
informed of the type of material rated by other Ss. 
Of the number who participated in the first session, 
100 failed to return for the second rating (50 from 
the “title” condition and 50 from the “description” 
condition) but no significant differences were ob- 
served in the first session data between second session 
returnees and nonreturnees. An additional 24 Ss were 
randomly eliminated to bring the 2 of each group 
to equality at 25 Ss. 


RESULTS 
Job Preferences 


The mean ratings for each job class and 
type of material are shown in Table 2. There 
were no statistically reliable effects attributa- 
ble to order of presentation, and therefore the 
data of the groups rating titles and of those 
rating descriptions, respectively, were com- 
bined for the first and second ratings, as the 
case might be. 

The ratings of titles consistently showed 
professional to be preferred to skilled labor 
jobs, and familiar to be preferred to unfa- 
miliar jobs. These differences, however, ap- 
peared to be considerably reduced in the 
ratings of descriptions. When titles alone were 
rated, the difference between means of pro- 
fessional and skilled jobs was .34; the same 
difference for descriptions was .08. Again, 
when titles alone were rated, the difference 
between means of familiar and unfamiliar jobs 
was .38; the same difference for descriptions 
was .26. Analysis of variance of the data of 
the first rating showed significant effects of 
familiarity (F = 67.5, df= 1/96, p< .001) 
and prestige level (F = 7.1, df =1/96, p< 
.01). Analysis of variance of the second rat- 
ings resulted in similar significant effects: fa- 
miliarity, F = 111.6,-af= 1/96" p= 0er 
prestige level, F = 10.8, df = 1/96, p < .01; 
but additionally there was a significant inter- 
action between these variables (F = 10.6, 
df = 1/96, p < .01). It should be noted that 
no particular meaning can be attached to this 
interaction since the familiar and unfamiliar 
jobs were not selected to insure an equal dif- 
ference in familiarity within skilled and pro- 
fessional categories, respectively. While the 
familiarity effect is somewhat larger for pro- 
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fessional jobs, there is no reversal of the basic 
pattern of the main effects. 

The only significant effect of type of ma- 
terial found in the first rating involved an in- 
teraction with prestige level (F = 7.1, df = 
1/96, p < .01). As Table 2 reveals, there was 
essentially no difference in preference between 
professional and skilled jobs when these were 
rated as descriptions, but a considerable dif- 
ference between them was obtained when they 
were rated as titles. However, a similar effect 
was not observed in the data of the second 
rating (F = 1:3, df= 1/96, p > .20). A sig- 
nificant interaction between type of material 
and familiarity was obtained in the latter an- 
alysis = 5.7; df = 1/96; p< .05)} appar- 
ently reflecting the reduction in the difference 
between familiar and unfamiliar jobs found 
for ratings of descriptions as compared to 
those of titles. This reduction must be attrib- 
uted largely to the ratings of skilled jobs pre- 
sented as descriptions, although the three-way 
interaction was not statistically significant 
eee —=1/96, p20). 


Preference Shifts 


Table 3 shows the mean absolute change in 
job preferences obtained between the first and 
second ratings. Since no statistically signifi- 
cant differences were obtained between the 
two groups rating the same type of material 
on both occasions, or between the two groups 
rating different types of material, the data 
were combined for groups having constant and 
changed materials, respectively. 

The preference shifts were clearly larger 
when the cue form was changed between the 
two ratings, as might be expected, but the 
effect was more pronounced for unfamiliar 
than for familiar jobs. An analysis of variance 
confirmed the reliability of these effects, show- 
ing significance for change in materials (F = 
15.5, df = 1/96, p< .001) and familiarity 
(F = 12.5, df = 1/96, p< .001), and inter- 
action between these variables (Ff = 12.0, 
df= 1/96, p< .001). No other significant 
effects were obtained in the analysis of the 
shift scores. 


DISCUSSION 


Looking at the absolute preference ratings 
first, Table 2 data show that familiar jobs 
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TABLE 3 


MEAN PREFERENCE SHIFTS FOR GROUPS RATING 
CONSTANT AND CHANGED MATERIALS 














Constant Changed 
Job [Groups TT & DD|Groups TD & DT| Total 
class M 
M SD M SD 

PF 62 oil .67 m0 .65 
PU 62 35 91 31 ih 
SF .65 ao at 30 iil 
SU .65 ESI 87 aso 16 





Note.—PF = professional familiar; PU = professional un- 
familiar; SF =skilled familiar; SU = skilled unfamiliar. 
TT = Title-Title; DD = Description—Description; 
Title—Description; DT = Description—Title. 


TD= 


were obviously preferred to unfamiliar ones, 
regardless of the occupational prestige level, 
in both the first and second ratings. This 
would be comprehensible in the title situation, 
where the cue was absolutely minimal, that is, 
one word. However, it is interesting that a 
similar difference, a sort of “clinging to the 
familiar,” was found in the description situa- 
tion where there was now far greater verbal 
elaboration. The only effect of cue form on 
familiarity was to reduce slightly, but not 
eliminate, its effect in the second rating. There 
are two possible reasons for the effect of per- 
sistence of the effect of familiarity despite 
the additional information provided by the 
description. On the one hand, there would 
seem to be a tendency for job descriptions of 
familiar jobs to specify relatively familiar ac- 
tivities with the result that S prefers these 
descriptions to those of the unfamiliar jobs. 
(This hypothesis is offered despite the fact 
that in the descriptions the words per se were 
all within the vocabulary level of high school 
seniors.) On the other hand, inspection of the 
job descriptions suggests that those of the 
familiar jobs, especially at the professional 
level, involved rather broad, general activities, 
whereas those of the unfamiliar ones tended 
to specify relatively narrow, specialized du- 
ties. In essence, the latter perhaps provided 
less opportunity for inclusion of an appealing 
element to Ss such as those of this study. 
Regarding prestige level, results showed 
that job descriptions were effective in elimi- 
nating biases in favor of professional jobs 
found when titles alone were rated. The means 
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for the first rating were: titles, Professional 
(P) = 3.23, Skilled (S) = 3.57; descriptions, 
P = 3.33, S = 3.33. For the second rating the 
means were: titles, P = 3.27, S= 3.60; de- 
scriptions, P = 3.28, S = 3.44. While the pres- 
tige level effect was not entirely eliminated 
by the presentation of descriptions in the sec- 
ond rating as it was in the first, it is the data 
of the first rating that would appear to have 
the most relevance to any application of these 
findings to interest inventories, since such tests 
are usually taken only once. To summarize 
here briefly the two main findings, it might 
be said that job description was more effec- 
tive in eliminating biases toward prestige level 
than it was in eradicating the effects of fa- 
miliarity. 

Turning attention now to shift scores, it is 
obvious that the degree of intersession varia- 
tion in ratings with the type of material un- 
changed was comparable across occupational 
levels and degree of familiarity. Additionally, 
the finding of equal shifts for the TT and DD 
groups in the preliminary analyses indicates 
that ratings of titles and descriptions were 
equally stable over the interval used here. 
There was clearly less consistency in ratings 
with changed cue form, indicating that Ss 
generally did not rate titles and descriptions 
in a similar fashion. This effect was much 
more pronounced for unfamiliar than for fa- 
miliar jobs. In fact, it did not seem to appear 
for the PF jobs (perhaps not quite as ob- 
viously so for SF jobs), indicating that S rec- 
ognized these even in description form. The 
implication of this particular finding is that 
additional information provided by job de- 
scriptions does not compensate for the effects 
of familiarity very well in terms of mean rat- 
ings. It is clear that information influences 
ratings much more for unfamiliar than for fa- 
miliar jobs, a situation which would indicate 
that descriptions are more valid indicators of 
preferences than titles for unfamiliar jobs. 
On the other hand, either titles or descrip- 
tions are relatively interchangeable as meas- 
ures for jobs familiar to S. 

The present study, while admittedly op- 
erating with a model of extremes (familiarity 
or unfamiliarity, title or description, higher 
or lower prestige level), has some implications 
for interest inventory item construction, as 
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well as for Holland’s (1966) work, which 
makes extensive use of occupational titles. 
Regarding interest inventory items, the argu- 
ment by Kuder (1966) concerning his new 
Occupational Interest Survey that “occupa- 
tional titles should be avoided in order to keep 
obvious vocational significance to a minimum” 
might carry more weight in view of the find- 
ings of this study. As is well known, many of 
the Kuder items, which involve activities, are 
in essence exceedingly brief occupational de- 
scriptions. Different item formats may thus 
partially account for reduced correlations be- 
tween inventories (e.g., Strong versus Kuder) 
exploring similar vocational interest domains. 
Fairly recently, King, Norrell, and Powers 
(1963) studied the relationships between 
“identical twin” scales (those on the Strong 
and Kuder bearing the same name) and “‘fra- 
ternal twin” (similar but not identical) scales 
and found generally that even r’s for “‘iden- 
tical twin” scales were rather low, the average 
coefficient being .370. 

Concerning Holland’s research, since titles 
alone are apparently sensitive to other varia- 
bles, such as prestige and familiarity, varia- 
tions in S’s choices (and consequent personal- 
ity typing) may be influenced by these. For 
example, in the latest revision of his Voca- 
tional Preference Inventory (Holland, 1965) 
some relatively ‘unfamiliar’ job titles are 
used, such as Meteorologist, Quality Control 
Expert, Manufacturer’s Representative, Traf- 
fic Manager, Inventory Controller, etc. His 
titles also vary widely over prestige levels. 
Probably the most prestigious of his six voca- 
tional-personality types is the Intellectual, 
and the least prestigious is the Realistic (Hol- 
land, 1966). Many of the job-title preferences 
of the former closely resemble the PF cate- 
gory in this study; those of the latter resemble 
the SF category. Whereas the basic validity 
of his theory of vocational choice is not ques- 
tioned, it would be valuable to examine the 
160 Vocational Preference Inventory titles for 
significant preference shifts, if any, with job 
descriptions. 
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Need-fulfillment theories of job satisfaction generally assume that individuals 
differ in the outcomes they prefer (need) to obtain from their jobs. Moreover, 
these theories hypothesize that the relationship between the outcomes received 
on the job and satisfaction is dependent upon these preferences (needs). The 
hypothesis this study investigated is that the pattern of preferences for job 
outcomes moderates the relationship between preference for an outcome and 
satisfaction with that outcome. Questionnaires were employed to assess prefer- 
ence and satisfaction on 30 different job outcomes. Ss were 113 industrial re- 
search scientists working in 1 facility. Preference (need) ratings were subjected 
to Q cluster analysis. 2 “need” types were identified. It was found that need 
type moderated the relationship between preference (need) for an outcome 
and satisfaction with that outcome only on those outcomes most closely related 
to the company itself. A hypothesis was proposed that these 2 need types 
reflect the 2 types of research and development personnel that differ primarily 
in orientation toward the company itself. 


The significance of individual differences in 
motivational variables for the prediction of 
job satisfaction has been emphasized by sev- 
eral writers (Morse, 1953; Schaffer, 1953; 
Vroom, 1960, 1964). The rationale commonly 
associated with such predictions is the need- 
fulfillment model, which states that job satis- 
faction is a function of the degree to which 
needs are met (“fulfilled”) by the work en- 
vironment. According to this model, two in- 
dividuals with different sets of needs, even 
though working in the same jobs, with similar 
working conditions and similar compensation, 
would not necessarily exhibit the same level 
of job satisfaction. 

In a more recent statement of the need-ful- 
fillment model, the Theory of Work Adjust- 
ment (Dawis, England, & Lofquist, 1964) 
hypothesizes that job satisfaction is a function 
of the correspondence between the needs of 
the individual and the reinforcer (i.e., need- 
fulfillment) system of the job. In common 
with earlier need-fulfillment models, this the- 
ory places major emphasis on individual dif- 
ferences in needs, but leaves to empirical re- 
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Health, Education and Welfare, Washington, D. C. 

2 Now at the University of Illinois, Urbana. The 
senior author wishes to thank Merle E. Ace for 
critically reading an earlier draft of this manuscript. 


search the task of specifying the nature of 
the relationship between needs and reinforcer 
systems which is predictive of job satisfaction. 

In the present study, a moderator variable 
is hypothesized as affecting the relationship 
between need-reinforcer correspondence and 
job satisfaction. The ‘‘need-type hypothesis” 
proposes that the pattern of needs of the in- 
dividual is an important moderator variable 
in the prediction of job satisfaction. More 
specifically, the hypothesis states that different 
degrees of accuracy in predicting satisfaction 
will be found for groups with different need 
patterns, that is, for different “need types.” 
The need-type hypothesis therefore assumes 
that there exist different need types, that is, 
individuals with distinctive need patterns. 
These need types are seen to differ mainly in 
the way their needs are interrelated. Thus, 
need type may be conceptualized as needs 
organized in complex interaction. 


METHOD 


To test the need-type hypothesis, Ss must be 
working on jobs which have equivalent reinforcer 
systems. Industrial research scientists working in re- 
search and development for one company and at 
one location may be assumed to meet this require- 
ment. A measure of work-relevant needs also is re- 
quired for a test of the hypothesis. The Minnesota 
Importance Questionnaire (Weiss, Dawis, England, 
& Lofquist, 1964b), an instrument developed to 
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measure vocational needs, was employed to meet 
this requirement. 

As the measure of need-reinforcer correspondence, 
the study employed the subtractive model, which 
assumes that the more completely the need is ‘‘ful- 
filled,’ the greater the satisfaction; that is, the 
strength of the need minus the amount of need 
fulfillment (reinforcement) equals the degree of 
satisfaction. Evidence for the moderating effect of 
need type would be found in differences in the need- 
satisfaction relationship for different need-type groups 
or, in terms of analysis of variance, an interaction 
effect for need-type group and need level on the de- 
pendent variable of satisfaction. 

The Ss were 113 industrial scientists working in re- 
search and development for a large company. All 
worked in the same organizational unit. Median age 
was 34 yr. and median tenure with the company 
was 4.3 yr. 

The instruments employed were the 1966 revisions 
of the Minnesota Importance Questionnaire (MIQ) 
and the Minnesota Satisfaction Questionnaire (MSQ). 
The MIQ consisted of 117 Likert-type items and was 
scored on 30 scales. Respondents were asked to rate 
the importance of various aspects of work to their 
idea of an ideal job. For each item, the response 
choices were “not important,” “only slightly im- 
portant,’ “important,” “very important,’ and “ex- 
tremely important.” The response choices were 
scored 1 through 5, respectively. Validity evidence 
for the MIQ as a measure of needs is reported in 
the Minnesota Studies in Vocational Rehabilitation 
series (Weiss, Dawis, England, & Lofquist, 1964a, 
1964b, 1965). 

The MSQ consisted of 150 Likert-type items and 
was scored on 30 scales representing the same rein- 
forcement dimensions utilized for the MIQ.* Respon- 
dents rated the degree of satisfaction they felt with 
various aspects of their job. For each item, the re- 
sponse choices were “not satisfied,’ “only slightly 
satisfied,” “satisfied,” “very satisfied,” and “extremely 
satisfied.” These were scored 1 through 5, respectively. 
The MIQ and the MSQ were administered to the Ss 
in the work setting during working hours. To en- 
courage frankness, the identity of the respondents 
was kept anonymous. 

The MIQ scale scores were transformed into T 
scores with a mean of 50 and a standard deviation of 
10 to equate scale means and variances. Q correla- 
tions between each individual and all other indi- 
viduals were computed for the 113 individuals over 
the 30 T-scored MIQ scales. To identify homogeneous 
(need type) groups the 113X113 Q-correlation 
matrix was cluster analyzed using Tryon’s procedure 
(Fruchter, 1954). 


3A table giving reliability coefficients of the MIQ 
and MSQ scales has been deposited with the Ameri- 
can Documentation Institute. Order Document No. 
9950 from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington, 
D. C. 20540. Remit in advance $1.25 for photocopies 
or $1.25 for microfilm and make checks payable to: 
Chief, Photoduplication Service, Library of Congress. 
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Once the need-type groups had been identified, 30 
two-way analyses of variance for unequal frequencies 
were performed. The first factor in each ANOVA 
was need group. The same two categories were 
used for all ANOVAs. The second factor in the 
ANOVAs was the MIQ scale corresponding to the 
same dimension as the dependent variable, the MSQ 
scale. For all ANOVAs the MIQ scale-score distribu- 
tion was dichotomized by cutting at the median 
score. To determine if the moderating effects of 
need group held in the continuous case, product- 
moment correlations were calculated between the 
MIQ scale scores and the corresponding MSQ scale 
scores for each need group separately. Finally, demo- 
graphic and MIQ variables were analyzed to de- 
termine further the nature of the need-type groups. 


RESULTS 


Cluster analysis of the 113 X 113 Q-cor- 
relation matrix resulted in 12 clusters which 
included all 113 Ss. With the £ coefficient set 
at 1.75, only the first cluster was of sufficient 
size to warrant use in further analysis. The 
first cluster contained 32 individuals, the 
second 12, the third 18, the fourth 13, and 
the remaining clusters less than 10 individuals 
each. Therefore, the first cluster of 32 in- 
dividuals was selected for use as a homogene- 
ous (need-type) group. It was called the “in” 
cluster. All 81 Ss not included in the in cluster 
represented a heterogeneous need group which 
was called the “out” cluster. 

Of the 30 two-way analyses of variance, 8 
yielded significant interactions between need 
group and need (MIQ), with satisfaction 
(MSQ) as the dependent variable. Table 1 
shows the results of these eight ANOVAs. 
Only on the Security scale was there a sig- 
nificant need-group mean difference in satis- 
faction score. 

The three largest interaction F ratios were 
found on the dimensions of Working Condi- 
tions, Security, and Company Goals. Other 
scales with significant interactions were Au- 
thority, Company Image, Company Policies 
and Practices, Supervision—Human Relations, 
and Social Status. 

The interaction effect is apparent from a 
comparison of the means of the four cells. In 
the case of the in cluster, the high-need group 
showed higher satisfaction than the low- 
need group on all eight dimensions. In con- 
trast, for the out cluster, with one exception, 
the high-need group showed lower satisfaction 
than the low-need group on these dimensions. 
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TABLE 1 


SATISFACTION MEANS FOR THE E1GHT DIMENSIONS 
WITH SIGNIFICANT INTERACTIONS 














“Tn” cluster | “Out’’ cluster 
Scale eo 
Low | High | Low | High 
need | need | need | need 
Authority 14.36 | 15.78 | 15.35 | 14.30* 
Company goals 12.13 | 14.19 | 13.29 | 11.16** 
Company image 13.57 | 16.78 | 14.86 | 15.68* 
Company policies and | 13.90 | 15.83 | 14.64 | 13.57* 
practices 
Security 14.39 | 17.16 | 15.02 | 14.11** 
Social status 14.36 | 15.14 | 15.57 | 14.15* 
Supervision—human 151A eal OO) |) LO On lo.” 
relations 
Working conditions 1502 O26 Tes /o deel or = 








Note.—N = 113. ne 
* Interaction between need group and need level significant 
at p < .0S. 


** Interaction significant at p < .01. 


Correlations between MIQ scores (on a 
continuous scale) and MSQ scores on the 
corresponding dimension, for the eight dimen- 
sions with significant interactions, are shown 
in Table 2. Correlations for the total sample 
ranged from —.20 to .34. These two extreme 
values were the only correlation coefficients 
differing significantly from zero. The correla- 
tions for the in cluster ranged from .11 to .60 
with four of the eight coefficients differing 
significantly from zero. In contrast, the cor- 
relations for the out cluster ranged from —.34 
to .23, with four of these being significant. 
The significant correlations were positive for 
the in cluster and, with one exception, nega- 
tive for the out cluster. Fisher’s Z transforma- 
tion was employed to test the differences in 
correlations between the in cluster and the 
out cluster. Significant differences were found 
on seven of the eight dimensions. Only the 
correlation difference for the Supervision—Hu- 
man Relations dimension was found to be non- 
significant. The largest difference between cor- 
relations was on the Security dimension, which 
was .60 for the in cluster and —.16 for the 
out cluster. 

An analysis of demographic variables for 
the two clusters revealed no differences be- 
tween the groups in age, education, tenure on 
the job, tenure with the company, time in 
the profession, job classification, or work- 
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group membership. Similarly, no differences 
between the means or variances of the in 
cluster and the out cluster were found on any 
of the MIQ scales. 


DISCUSSION 


The hypothesis that need type moderates 
the prediction of job satisfaction finds sup- 
port in this study. For the present group of 
industrial scientists working in research and 
development for one company, there appears 
to be at least two need groups for which the 
relationship between need level and satisfac- 
tion on several dimensions is quite different. 
The improvement in the accuracy of pre- 
dicting satisfaction contributed by the need- 
type variable can be seen in Table 2. The 
correlations between corresponding need and 
satisfaction dimensions for the entire sample 
masked the correlations within each of the 
need groups. For instance, the correlation for 
the entire sample on the dimension of Security 
was .03 which masked correlations of .60 for 
the in cluster and —.16 for the out cluster. 
Similarly on Working Conditions, the overall 
correlation was —.01, but for the in cluster 
and out cluster, respectively, the correlations 
were .48 and —.20. Thus the moderating ef- 
fect of the need-type variable in this study is 
evident. 

It is of some interest to inquire into the 
nature of the two need groups. The in 
cluster and the out cluster identified in this 


TABLE 2 


CORRELATION BETWEEN NEED LEVEL AND SATISFAC- 
TION ON THE EIGHT DIMENSIONS WITH 
SIGNIFICANT INTERACTIONS 





Need group 
Total 
Scale group Zr 
(N & 113) “In” “Out” 
cluster cluster 
QN = 32). CV = 84) 
Authority —.01 223) —.23* 2.15* 
Company goals —.20* Al —.34%* | 2.13* 
Company image 344% SO .23* 2.04* 
Company policies —.06 43* —.20 3.05** 
and practices 
Security .03 .60** —.16 3.937% 
Social status —.16 -20 —.27* 2.21* 
Supervision—- .00 .25 —.09 1.59 
human relations 
Working —.01 48k —.20 3,34** 
conditions 





a Z values on Fisher's test for the difference between ‘‘in”’ 
cluster and ‘‘out"’ cluster correlation coefficients. 
— 505. 


i SS 
ED < 01. 
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study differed in need-satisfaction relation- 
ships on the dimensions of Authority, Com- 
pany Goals, Company Image, Company 
Policies and Practices, Supervision—Human 
Relations, Security, Social Status, and Work- 
ing Conditions. These are, for the most part, 
the dimensions most closely associated with 
satisfaction with the company itself. Among 
the 30 dimensions included in this study, the 
only “company” related dimension on which 
a significant interaction effect was not found 
was that of Company Prestige. Yet, even on 
this dimension the correlations between need 
and satisfaction were in a similar direction, 
being .15 for the in cluster and —.19 for the 
out cluster. 

These findings suggest that the two need 
groups might differ in their orientation toward 
the company. The findings might be inter- 
preted from the viewpoint of the “profes- 
sionally oriented” and “company oriented” 
dichotomy for research and development per- 
sonnel proposed by Schultz (1964). According 
to Schultz, the professionally oriented view 
the company merely as a means to achieve 
their broader professional goals and rewards, 
whereas the company oriented view the com- 
pany as the primary source of their achieve- 
ments and rewards. The in cluster in this 
study might correspond to the company- 
oriented type, viewing the company as the 
major source of need fulfillment. Thus, on 
the “company” dimensions their satisfaction 
increases and decreases with need level. In 
contrast, the out cluster might correspond to 
the professionally oriented type, or, at least 
it is not a company-oriented type. Its mem- 
bers apparently do not perceive the company 
as providing fulfillment of their needs. Thus, 
the higher their need level, the lower their 
satisfaction. These data seemingly support the 
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hypothesis of two types of research and de- 
velopment personnel who differ primarily in 
their orientation toward the company itself. 
Further study is necessary to test this hy- 
pothesis and to determine if this dichotomy 
can be found in other professional groups em- 
ployed in industrial firms. Further research 
also is necessary to determine the generality 
of the finding reported here that need type 
operates as a moderator of the relationship 
between need and satisfaction. 
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Comparative influences of fittingness and meaningfulness of the brand name and 
the utility of the product represented by the brand name on brand awareness 
were studied. An operational definition of fittingness and a method of selecting 
fitting brand names were proposed. 24 male and 24 female Ss 1st rated brand 
names from advertisements and then recalled the brand names. These adver- 
tisements included 3 levels of product utility, 2 levels of meaningfulness, and 
2 levels of fittingness following a factorial design. Brand awareness was better 
for (a) highly meaningful and (b) fitting brand names. Meaningfulness influ- 
enced brand awareness when brand names were nonfitting, but not when they 
were fitting. Effects of cultural factors on brand awareness were noticed when 
brand recall for 3 product-utility categories was analyzed. 


Recently, Kanungo and Dutta (1966) re- 
ported that brand awareness is better if the 
brand name is highly meaningful than if it is 
of low meaningfulness, and that brand aware- 
ness of high-utility products is superior to that 
of low-utility products. In addition, they sug- 
gested that the technique of measuring mean- 
ingfulness (Kanungo & Panda, 1964a, 1964b; 
Noble, 1961) of verbal labels could be fruit- 
fully used by the advertisers for the purpose 
of brand naming. As an extension of this 
earlier study conducted in India (Kanungo 
& Dutta, 1966), the present investigation at- 
tempts to test the generality of the above 
findings using subjects (Ss) belonging to a dif- 
ferent cultural milieu. 

The present study also examines the effects 
of the fittingness or the appropriateness of the 
brand name for the product on brand aware- 
ness. Fittingness of a brand name seems to be 
an important characteristic that might influ- 
ence the strength of brand-product associa- 
tion. To the present author’s knowledge, no 
study has been reported on the effects of fit- 
tingness variable on brand awareness. The 
main obstacle to the systematic investigation 
of the effects of this variable has been the 
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lack of any operational definition and an ap- 
propriate measure of fittingness of brand 
names. Many advertisers often try to select 
or reject brand names on the basis of their 
fittingness to their product, but such selections 
are mostly based upon their intuition or sub- 
jective feelings due to the absence of any 
operational measure. The present study, there- 
fore, tries to offer an operational definition 
and a measure of the fittingness characteristic 
of the brand name, so that systematic study 
of the effect of the variable on brand aware- 
ness may be possible. 

It is proposed that a brand name may be 
considered fitting or appropriate for the prod- 
uct if it can readily evoke associations to the 
product in the minds of the Ss. Thus, accord- 
ing to the present author, a brand name is 
fitting for the product if it resembles (in form, 
sound, or meaning) a word that belongs to 
one of the most frequent associations to the 
product. For example, if the product lipstick 
evokes the word “color” as one of the most 
frequent associations, then the brand name 
“Kolory” (apparently a nonsense word, but 
having resemblance in form and sound to the 
word “color”) may be considered a fitting 
brand name for lipsticks. However, the same 
nonsense word will be a nonfitting brand name 
for the product smoking pipe if the word 
“color” is only a rare association to the prod- 
uct smoking pipe. This implies that associa- 
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tion norms for different products may serve 
as a basis for selecting fitting and nonfitting 
brand names. Since a fitting brand name 
would resemble a common associate of the 
product, one might assume that the initial 
strength of the brand-product association 
would be greater in the case of fitting than in 
the case of nonfitting brand name, Thus with 
single or repeated exposures brand awareness 
would be better in the former than in the 
latter case. 


METHOD 
Selection of Brand Names 


Consideration of the fittingness, the meaningful- 
ness, and the product-utility variable manipulated in 
the present study influenced the selection of brand 
names. First, to manipulate the product-utility vari- 
able, there was a need to choose products that dif- 
fered in their utility for the consumers. Thus three 
different categories of products were selected, namely 
products used by males, by females, and by both 
sexes. Four commonly used products from each of 
these categories were chosen, making a total of 12 
products. 

Second, in order to choose brand names that dif- 
fered in their fittingness for the products, the follow- 
ing procedure was employed. Each of the 12 product 
names was printed on a separate sheet of paper and 
presented to a group of 19 male and 19 female col- 
lege undergraduate students. For each product, they 
were asked to give as many relevant associations as 
they could within a 30-sec. period. These Ss were 
instructed not to give any existing brand names as 
their associative responses. All the responses given 
by the group were found to be meaningful words. 
The most frequent (occurring 50% or more) and 
rarest (occurring only once) responses to each prod- 
uct were then determined from the responses given 
by the group. From these response words, two fitting 
and two nonfitting brand names for each product 
were constructed. One fitting and one nonfitting 
brand name were highly similar to a frequent and 
a rare response, respectively, and the other fitting 
and nonfitting brand names only partially resembled 
in sound or form a frequent and a rare response, re- 
spectively. The reason for making half the brand 
names more similar to the associative responses than 
the other half was to ensure that the former appear 
more meaningful than the latter. 

Third, to make sure that half of the brand names 
had higher meaningfulness value than the other half, 
another 25 undergraduate students were presented 
with these 48 brand names. These Ss were asked to 
rate each brand name on a 5-point meaningful— 
meaningless scale. The mean meaningfulness value 
(m') for each of the brand names was calculated 
from these ratings. The products, their frequent and 
rare associates, and the brand names along with their 
m’ are presented in Table 1. 
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Design and Materials 


It will be noticed from Table 1 that for each of 
the 12 products, there were four brand names: high 
meaningful-fitting (HM-F), low meaningful-fitting 
(LM-F), high meaningful-nonfitting (HM-NF), and 
low meaningful-nonfitting (LM-NF). Using each 
of these brand names, four different advertisements 
for each product were prepared. Twelve copies of 
each of these 48 advertisements were printed on a 
14 cm. % 22 cm. art paper. In each printed layout, 
a brand name following the word “use” appeared at 
the top, and below which a picture of the product 
was presented. Below the picture a short phrase 
containing the product name was printed. For ex- 
ample, in one of the nylon advertisements, at the 
top of the picture would appear “USE LEGS” and 
at the bottom of the picture would appear “for your 
Nylons.” Farther down the page, the name and ad- 
dress of the advertiser (a fictitious manufacturing 
concern) were given. 

Finally 48 booklets were compiled. Each booklet 
contained one advertisement of each of the 12 prod- 
ucts in such a manner as to include a HM-F, a LM— 
F, a HM-NF, and a LM-NF brand name for each 
of the three product categories: male-use, female-use, 
and used-by-both products. The sequence of the 12 
advertisements in each booklet was randomized to en- 
ure varied order of presentation to Ss. 


Subjects 


Twenty-four adult male and 24 female under- 
graduate students drawn from two Canadian uni- 
versities served as Ss. Their ages ranged from 18 to 
30 yr. The male Ss were drawn from St. Mary’s 
University, Halifax, and the female Ss were drawn 
from Mount St. Vincent University, Rockingham. 
These Ss had never participated in any kind of 
psychological experiment before. 


Procedure 


Each § was given a separate booklet and was 
asked to rate each brand name, in each advertisement 
in the booklet, on each of two 6-point scales: mean- 
ingfulness and appropriateness. The S was given 12 
answer sheets, one sheet for each advertisement in 
the booklet, to record his ratings. The two scales were 
printed on the answer sheets with the following 
verbal labels for the 6-points: very high, high, mod- 
erate, low, very low, and none. The S was instructed 
to indicate his or her rating by underlining the appro- 
priate verbal label in a scale. The order of presenta- 
tion of the two scales was counterbalanced for each 
S. To ensure that Ss paid proper attention to the 
brand names in the booklet, each S was instructed to 
write down the brand name he or she was rating at 
the top of the answer sheet in a blank provided for 
it. The Ss were made to believe that the manufac- 
turer intended to launch an extensive advertising 
campaign with a view to introducing these new prod- 
ucts into the markets. Hence the advertiser wanted 
to know the reaction in terms of the two scales of 
various individuals toward these advertisements. The 
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TABLE 1 
Propucts, Source Worps, AND BRAND NAMES WITH THEIR MEANINGFULNESS (m’) VALUE 
High meaningful Low meaningful 
Product name - 
Brand name m' Brand name m’ 
1. Nylons 
r “Legs” (Leg) 4.36 “Leget” (Leg) 1.28 
NF “Feel’’ (Feel) 4.12 “Supet” (Superior) 1.20 
2. Girdle 
F “Shapely”’ (Shape) 4.48 “Tyten” (Tight) 1.24 
NF “Stately” (Stately) 4.28 “Nyten” (Night) 1.28 
3. Lipstick 
EF “Colory” (Color) 2.36 “‘Peenkis” (Pink & Kiss) 1.32 
NF “Shiney” (Shiny) SOL “Crimist’’? (Cream) 1.40 
4, Eye shadow 
F “Blue-Lash”’ (Blue & Lashes) 3.36 “Loblu”’ (Blue) 1.36 
NF “Dark-Silk” (Dark & Silk) 3.88 “TLolid”’ (Lid) 1.28 
5. After-shave lotion 
F “Cooling” (Cool) 4.12 “Smelo’’ (Smell) 1.40 
NF “Healing” (Heal) 4.32 ““Cozic” (Cozy) 1.20 
6. Smoking pipe 
F “Manly” (Man) 4.32 “Tobax” (Tobacco) 1.36 
NF “Clean” (Clean) 4.48 “Nicot”’ (Nicotine) 1.56 
7. Briefcase 
EF “Executive” (Executive) 4.60 “Bizinet” (Business) Lez 
NF “Important” (Important) 4.32 “Promets” (Promotion) 1.48 
8. Shirts (men’s) 
F “Tie-Match” (Tie) 2.48 “Dreso” (Dress) 1.40 
NF “Party-Fit” (Party) 2252 “Mone” (Money) 1.36 
9. Writing pads 
F “Letters” (Letter) 4.32 “Leterit”’ (Letter) 1S 
NF “Economy” (Economy) 4.44 “Thicrap” (Thick) 1.24 
10. Pen 
F “Writer” (Write) 4.36 “TIncko” (Ink) L.3Z 
NF “Family” (Family) 4.68 “Metto” (Met) 1.40 
11. Adhesive tape 
F “Sticky” (Sticky) 4.24 “Bandes” (Bandaid) 1.48 
NF “Sporty” (Sport) 4.20 “Tandes’’ (Land) 1.40 
12. Folders 
F “Paper-Safe” (Paper) 2.88 “Filex” (File) 132 
NF “Camp-Sign” (Camp & Sign) 2.52 “Yirex’’ (Fire) 1.40 








Note.—Items 1-4 are female-use products, Items 5—8 are male-use products, and Items 9-12 are products used by both sexes. 
The bracketed word(s) next to each brand name is the associative response word(s) from which the brand name was derived. 


= fitting; NF = nonfitting. 


purpose of asking Ss to rate the brand names was 
twofold. First, through such ratings, Ss were given 
incidental exposure to the advertisements, so that 
later test of brand awareness would be possible. 
Second, Ss’ ratings would directly demonstrate if Ss 
perceived the meaningfulness and appropriateness 
characteristic of the brand names in a manner in- 
tended by the experimental manipulations. 

Each S$ was given approximately 1-1.5 min. to 
rate each brand name on the two scales. After S 
had finished rating all brand names from the booklet, 
the answer sheets and the booklet were collected. 
The S was then given a recall sheet on which all 
the 12 products were listed in alphabetical order and 
was asked to recall as many brand names as possi- 


ble and to write them beside the name of the product 
with which they were associated. 


RESULTS 
Ratings of Brand Names 


The first part of S’s task in this experiment 
was to rate each brand name for its meaning- 
fulness and appropriateness. There were 48 
brand names, half of which were HM and 
the other half LM brand names. Likewise 24 
of the 48 brand names were F and the rest 
NF brand names (see Table 1). The adver- 
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tisement booklets were distributed in such a 
manner that each of the 48 brand names was 
rated by 12 Ss (6 male and 6 female Ss) on 
6-point meaningfulness and appropriateness 
scales. The 6 points in each scale were as- 
signed ordinal weights of 5 (very high) to 0 
(none) for the purpose of deriving the mean 
meaningfulness and appropriateness value for 
each brand name. The S’s ratings indicated 
that the 24 HM brand names have signifi- 
cantly higher meaningfulness values (mean 
m'’ = 2.74) than those of the 24 LM brand 
names (mean m’ = 1.89, ¢= 3.40, p< .01). 
The appropriateness ratings also indicate that 
the F brand names were considered more 
appropriate (mean appropriateness value = 
3.07) than the NF brand names (mean ap- 
propriateness value= 1.72, += 4.65, p< 
01). This suggests that Ss did perceive the 
HM and LM brand names selected by the 
experimenter as having high and low mean- 
ingfulness, respectively. Likewise, they also 
perceived F and NF brand names, respec- 
tively, as more and less appropriate for the 
products. 


Effects of Fittingness and Meaningfulness 


In order to determine the effects of fitting- 
ness and meaningfulness variables on brand 
awareness, a 2 X 2 X 2 analysis of variance 
(Lindquist, 1953, Type VI design) was per- 


TABLE 2 


ANALYSIS OF VARIANCE OF RECALL SCORES 























Source of variation df MS FP 
Between Ss 47 
S’s sex (S) 1 eld << 
Error (b) 23 2.65 
Within Ss 144 
Meaningfulness (M) 1 17.52 23.67* 
Fittingness (I) 1 24.08 66.89* 
MXF 1 6.07 10.84* 
Mx<S 1 .96 1.30 
Bx Ss 1 Poh El 
MXFXS 1 35 at 
Error (w) 138 55 
Error: (w) 46 74 
Error2 (w) 46 36 
Error; (w) 46 56 
Total 191 

p< .01. 
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TABLE 3 


MEAN RECALL SCORES FOR FITTING AND 
MEANINGFUL BRAND NAMES 














Fitting Nonfitting t 
High meaningful 2.81 2.46 23 
Low meaningful 2.56 1.50 eOjas 
t 1.67 6.40** 
Note.—Means are based on the recall of all 48 Ss. 
*p < .05: 
¥ED < .O1. 


formed on the recall of brand names. The 
three classifications were: high and low mean- 
ingful brand names, fitting and nonfitting 
brand names, and male and female Ss. The 
two main effects of meaningfulness and fitting- 
ness and their interaction were highly sig- 
nificant (fs = 23.67, 66.89, and 10.84, re- 
spectively). These results are presented in 
Table 2. 

The mean recall of HM-F, LM-F, HM- 
NF, and LM-—NF brand names by all 48 Ss 
presented in Table 3 suggests that brand 
awareness is better for fitting than for non- 
fitting brand names and for high meaningful 
than for low meaningful brand names. Fur- 
thermore, these data suggest that meaningful- 
ness variable does not influence brand aware- 
ness when the brand names are fitting ones 
(the difference between HM-—F and LM-F 
recall yields a nonsignificant ¢ = 1.67). How- 
ever, when the brand names are of nonfitting 
type, the higher the meaningfulness the better 
is the brand awareness (the difference between 
HM-NF and LM-NF yields a ¢ = 6.40, p< 
01). This is the reason for the significant 
interaction between meaningfulness and fit- 
tingness. The male and female Ss did not dif- 
fer with respect to their recall of brand names 
(see Table 2). However, within each group 
the HM brand names were recalled signifi- 
cantly better than the LM brand names. For 
male Ss the recall means of HM and LM 
brand names were 5.27 and 3.87, respectively 
(t = 6.67, p< .01), and for the female Ss 
the corresponding means were 5.16 and 4.25, 
respectively (¢ = 4.33, p< .01). Similarly 
for both male and female Ss taken separately 
F brand names were recalled better than NF 
brand names. For male Ss the recall means 
of F and NF brand names were 5.21 and 3.83, 
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respectively (t =6.57, p< .01), and for fe- 
male Ss the corresponding means were 5.33 
and 4.08, respectively (¢ = 5.95, p < .01). 


Effect of Utility of the Product 


A 2 X 3 analysis of variance was performed 
on the recall scores of male and female Ss 
for the brand names of the three categories: 
male-use, female-use, and used-by-both prod- 
ucts. Out of the two main effects, the male 
and female Ss, and three product categories, 
only the latter approached significance (F = 
24,0) = 2 Oz Owe O5)ee Tne mean 
recall scores of the male Ss were 3.29, 3.12, 
and 2.87, and the corresponding recall scores 
of the female Ss were 3.17, 3.25, and 3.00 for 
male-use, female-use, and used-by-both prod- 
ucts, respectively. These results indicate that 
male and female Ss did not differ in their re- 
call patterns. The brand names for male-use 
and female-use products were recalled equally 
well by both the groups. There was a trend 
in both the groups to recall fewer brand names 
for the used-by-both products than those for 
the other two categories. 


DISCUSSION 


The above results clearly confirm the earlier 
findings that brand names having higher 
meaningfulness values are retained better than 
those having low meaningfulness value (Ka- 
nungo & Dutta, 1966). In addition, the re- 
sults lend strong support to the hypothesis 
that a fitting brand name derived from the 
association norms of the product is retained 
better than a nonfitting brand name. The fit- 
tingness variable seems to influence brand 
recall not only when brand names are low 
meaningful (¢ = 7.07, p < .01) but also when 
they are high meaningful (¢ = 2.33, p < .05). 
The meaningfulness variable, however, influ- 
ences brand recall only when brand names 
are nonfitting (f= 6.40, p< .01) but not 
when they are fitting (¢= 1.67, p> .05). 
Such interaction effects may interest the ad- 
vertisers who want to choose between two 
HM or LM brand names. However, further 
replications are needed before the advertiser 
can place his confidence in such interaction 
effects. 

In practice, many advertisers choose brand 
names that they consider intuitively as fitting 
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to their products. Very often, such intuitive 
decisions are based upon one or a few indi- 
viduals’ remote associative responses to the 
products. As suggested in this study, a better 
method of choosing fitting brand names would 
be to use the conventional laboratory word- 
association techniques (Cofer, 1958) with a 
sample of potential consumers. Generally, the 
association techniques have been used for the 
purpose of studying language and verbal learn- 
ing processes (Deese, 1959; Noble, 1952) and 
for clinical diagnoses (Jung, 1910; Kent & 
Rosanoff, 1910). The present study extends 
its usefulness to the selection of brand names 
by advertisers. 

The pattern of recall for brand names of the 
three product-utility categories does not con- 
form to the pattern of results reported by 
Kanungo and Dutta (1966). The authors re- 
ported that their male Ss recalled fewer brand 
names of female-use products and their fe- 
male Ss recalled fewer brand names of male- 
use products than what was expected of them. 
These findings supported their hypothesis that 
brand awareness of high-utility products is 
superior to that of low-utility products. In the 
present study, however, both male and fe- 
male Ss recall male-use and female-use prod- 
ucts equally well. These differences in the 
results of the two studies are presumably due 
to the difference in the culture patterns of 
Ss. The Ss used by Kanungo and Dutta 
(1966) were drawn from three Indian educa- 
tional institutions where males and females 
live in a more or less segregated fashion. The 
cultural standards of India impose greater 
restrictions on free mixing among young un- 
married male and female students compared 
to the North American cultural standards. 
Thus, for example, a female student in India 
would have less knowledge and concern about 
male-use products such as cigarettes or shay- 
ing blades. Likewise a male student in India 
would be less exposed to and interested in 
female-use products such as perfume or nail 
polish. On the other hand, the male and fe- 
male Ss used in the present study share each 
others interests to a greater extent due to 
greater exposure to each others interests in 
this culture. Thus it is not surprising that 
both male and female Ss in the present study 
may be showing equal interest in both male- 
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use and female-use products. Hence they re- 
call the brand names of male-use and female- 
use products equally well. These results there- 
fore suggest the possible role of cultural fac- 
tors in brand awareness, which need to be ex- 
plored more fully in the future. 

Finally, it may be recalled that a trend was 
noticed in both the male and female Ss to 
recall fewer brand names for the used-by-both 
products as compared to the brand names for 
either male- or female-use products. If it is 
assumed that the products in the used-by-both 
category, such as writing pads or adhesive 
tape, have less personal significance for both 
male and female Ss compared to the products 
in the other two categories, such as after 
shave lotion, nylons, or girdles, then the trend 
shown by the results are consistent with the 
hypothesis that brand awareness of high-util- 
ity products is better than that of low-utility 
products. The four products belonging to the 
used-by-both category may have been con- 
sidered somewhat less important and therefore 
of lower utility than the products belonging 
to the other two categories. Unfortunately the 
products were not rated on a utility scale by 
the Ss. Such ratings might have substantiated 
the above explanation. 
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AGE, CIGARETTE SMOKING, AND TESTS OF PHYSICAL FITNESS 
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University of Hawaii 


It was hypothesized that smoking would decrease Ss’ performance on physical 
fitness tests calling for strenuous activity, but not for tests calling for mild 
activity. 88 Ss were given 5 physical fitness tests as part of their work require- 
ment and then asked to report the number of packages of cigarettes smoked 
per day. A multiple linear regression analysis of the results, using age as a Cco- 
variate and amount of smoking as the independent variable, supported the 
hypothesis. The smoking conditions resulted in significant differences for the 
most strenuous test (1-mi. Run) and the 2 moderately strenuous tests (Crawl- 
ing and Dodge & Jump), with performance scores decreasing as reported 
smoking scores increased. The least physically demanding tests showed no sig- 
nificant differences between smoking conditions. Age was an important factor 


in accounting for Ss’ performance on all of the tests. 


The present study was undertaken in order 
to investigate the relationship between vary- 
ing amounts of cigarette smoking and several 
tests of physical fitness. If cigarette smoking 
affects an individual’s performance on tests 
of physical fitness, the effect is probably 
mediated by the decreased efficiency of the 
circulatory and respiratory systems. It would 
appear probable that smoking reduces the 
oxygen capacity of the lungs; therefore, dur- 
ing physical activity the muscles may be de- 
prived of sufficient oxygen, leading to feelings 
of fatigue and lowered performance. More 
strenuous activity would be expected to place 
a greater demand upon the circulo-respiratory 
system for oxygen than would mild activity. 
Thus, if smoking decreases the efficiency of 
a person’s circulo-respiratory system, one 
would expect that smoking would decrease 
one’s performance on tasks in which the 
circulo-respiratory system is of major im- 
portance, that is, tests in which vigorous 
physical activity is required. Conversely, one 
would expect that smoking would not affect 
one’s performance on tests in which the 
circulo-respiratory system plays a minor role, 
that is, tests in which there is minimal physi- 
cal activity. 

In a study of the relationship between 
smoking and physical fitness, it was hypoth- 
esized that smoking would decrease Ss’ per- 
formance on tests calling for strenuous ex- 
ercise, and that greater smoking would result 


1 Now at the University of Maryland, Far East 
Division. 


in a greater decrease in performance. For fit- 
ness tests calling for mild physical activity, 
smoking was expected to have a negligible 
effect upon Ss’ performance. Thus, for the 
One-Mile Run test, which was the most stren- 
uous test, smoking was expected to result in 
a greater decrease in performance than for 
any of the other tests. The Dodge & Jump 
test and the Crawling test were considered to 
be moderately strenuous, thus smoking was 
expected to result in decreased performance, 
but to a lesser degree than for the One-Mile 
Run test. The Climbing test was considered 
to be a mildly active test, thus smoking was 
expected to result in a slight or no decrease 
in performance. The Throwing test called for 
a minimal amount of activity, thus smoking 
was not expected to decrease performance. 


MrTHOD 


Subjects. The Ss were 88 military personnel rang- 
ing in age from 19 to 39 yr. The physical fitness 
tests were administered to Ss as part of their normal 
work requirement. During the 6 wk. preceding the 
testing, Ss performed moderate calisthenics for 2 hr. 
per week. The number of calisthenics was the same 
for all Ss, except that the older Ss (30-39 yr.) per- 
formed 10% fewer exercises than the younger Ss 
(19-29 yr.). 

Tests. The five physical fitness tests were: (A) 
Crawling test—S was required to crawl for a distance 
of 120 ft., sliding along the ground at all times. The 
S’s score was the amount of time required to crawl 
the distance. (B) Climbing test—S was required to 
swing from rung to rung on a horizontal ladder 
without touching the ground. The S’s score was the 
number of rungs traversed during a 1-min. period. 
(C) Dodge & Jump test—S had to run across a 
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TABLE 1 
SUMMARY OF REGRESSION ANALYSIS OF SMOKING FOR FitNEss TESTS 
r2 
Fitness test M SD af F 
Smoking x ; 
and age ABE 
Crawling 31.59 sec. 6.28 .660 596 5/81 oOne 
Climbing 45.18 rung 13.40 348 309 5/81 97 
Dodge & Jump 25.83 sec. 1.87 564 388 5/81 6.50** 
Throwing 23.93 pt. 8.07 .526 473 5/81 1.84 
One-Mile Run 519.41 sec. 86.01 644 Sif 5/81 13.09 
Age 25.83 yr. Or 








AN 


* pb 02. 
Ky < 001 


49-yd. obstacle course and return to the starting 
position. The S had to avoid the obstacles and jump 
over a ditch during the running. The S’s score was 
the amount of time taken to complete the course. 
(D) Throwing test—S had to throw five grenades 
from a kneeling position, in which he attempted to 
hit the center of a target area 90 ft. away. The S’s 
score was based upon the combined scores of the five 
throws. Performance on this test is primarily a func- 
tion of psychomotor skills, rather than demanding 
physical endurance. (E) One-Mile Run test—S was 
required to run around a 1-mi. grass track. The S’s 
score was the amount of time taken to run the dis- 
tance. This task is a physically demanding test, in 
which physical endurance is of primary importance. 

Procedure. The Ss were separated into four groups 
of 22 Ss each, assigned according to the alphabetical 
order of their last names. All four groups were given 
the Crawling test first and the One-Mile test last, 
with the other three tests given in a counterbalanced 
order. The order of the tests for the four groups 
was: ABCDE, ACBDE, ACDBE, and ADCBE. The 
Ss were instructed to complete the four timed tests 
as quickly as possible and to strive for accuracy in 
the Throwing test. 

After completing the tests, Ss were required to 
write down the number of packages of cigarettes 
smoked per day, according to one of six choices: 
(a) none, (b) less than 4 per day, (c) 4 to less than 
1, (d) 1 to less than 14, (e) 14 to less than 2, and 
(f) 2 or more. The number of Ss falling within 
each category were: a2=19, b=14, c=13, d=14, 
e= 14, and f= 14. 


RESULTS AND DIscussION 


The age of the Ss appeared to be correlated 
with their performance scores; therefore, 
Pearson coefficients between age and perform- 
ance were computed for the five physical fit- 
ness tests. The coefficients between age and 
the Crawling, Climbing, Dodge & Jump, 
Throwing, and the One-Mile Run tests were 











1/2, —.050, 025, —.688, and .598 (p< .01, 
df = 86), respectively. 

The results were analyzed by means of a 
multiple-regression analysis? (Bottenberg & 
Ward, 1963). A test for an interaction effect 
between age and the six smoking conditions 
was made for each of the physical fitness 
tests, with no significant differences occurring 
(p> .05). Since age was significantly cor- 
related with the fitness tests, and age and 
smoking showed no significant interactions, 
age was used as a covariate with amount of 
smoking as the independent variable. The 
analysis showed that there were significant 
differences between the smoking conditions 
for three of the five physical fitness tests (see 
Table 1). The smoking variable showed sig- 
nificant differences for the Crawling test (Ff 
SO UC} = O/ Oly p.ae02) athe mDodge. 7c 
Jumpetest (F = 6.56)92) =95/81) > — 001); 
and the One-Mile Run test (F = 13.09, df = 
5/81, p< .001), with no significant differ- 
ences found for the Climbing test and Throw- 
ing test (p > .05). 

In order to determine which of the smoking 
conditions were significantly different from 
each other, an analysis was made between 
each of the smoking conditions for the Crawl- 
ing, Dodge & Jump, and One-Mile Run tests; 
thus, 45 comparisons were made using age as 
a covariate. A comparison of the six smoking 
conditions for the Crawling test showed that 
11 of the 15 comparisons gave significant dif- 

2 Appreciation is expressed to Frank Ahern for the 


analysis and to the Computing Center of the Univer- 
sity of Hawaii. 


298 


ferences (p < .01) in the predicted direction, 
while the other four comparisons showed no 
significant differences (p > .05). The Dodge 
& Jump test showed 13 significant differences 
between the smoking conditions, with the two 
nonsignificant differences occurring between 
conditions $-1 and 1-14 packages (p > .05) 
and 1-14 and 14-2 packages (p > .05). The 
significant differences were in the predicted 
direction, with greater smoking resulting in 
greater decrease in performance. The One- 
Mile Run test showed significant differences 
between all 15 comparisons of the smoking 
conditions (pf < .01 for 14 of the comparisons, 
and p< .02 for the other). Thus, for the 
Crawling, Dodge & Jump, and One-Mile Run 
tests, an increase in self-reported smoking was 
associated with a decrease in performance. 
The analysis of the results supported the 
hypotheses. The smoking conditions had a 
highly significant effect upon the most strenu- 
ous of the physical fitness tests, the One-Mile 
Run test, with smoking accounting for ap- 
proximately 29% of the variance. Smoking 
also showed significant effects for the two 
moderately strenuous tests, with the Dodge & 
Jump and Crawling tests accounting for ap- 
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proximately 18% and 6% of the variance, re- 
spectively. Smoking showed no significant ef- 
fects for the two least physically demanding 
tests, the Climbing and Throwing tests. There- 
fore, the results support the position that 
smoking results in decreased performance of 
physical fitness tests which require strenuous 
activity, and with a decrease in the required 
endurance of the test there is a corresponding 
decrease in the effect of smoking. 

The age factor was of greater importance 
than smoking in accounting for the differences 
between Ss’ scores on the physical fitness tests 
(see Table 1). For the Crawling, Climbing, 
Dodge & Jump, Throwing, and One-Mile 
Run tests, the age factor accounted for ap- 
proximately 60%, 31%, 39%, 47%, and 36% 
of the variance (p < .001), respectively. 
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INCREASES IN EXERCISE COMPENSATION 
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113 American graduate business students recommended annual salary increases 
for 10 fictitious engineers, each with a different reason for being given a raise. 
The mean recommendation was 9.3%. Recommenders were more generous if the 
students were of lower intelligence and achievement, and if they held strong 
social and service rather than theoretical and economic values. They also were 
more willing to spend company money for other social and personnel purposes. 


Several recent reviews point to increasing 
interest in psychological aspects of pay 
(Adams & Jacobsen, 1964; Haire, Ghiselli, & 
Porter, 1963; Weick, 1966). A modest amount 
of available research has probed the effects 
of pay differentials on attitudes and perform- 
ance of those receiving the money, but little 
has been done to ascertain what motivates 
those who decide on what salaries will be paid 
when they respond to requests for salary in- 
creases and how their own ability and values 
affect their monetary decisions. These latter 
issues are the concern of this report. 


METHOD 
Subjects 


Forty-four percent of the 113 male graduate busi- 
ness students who participated in this study held 
undergraduate engineering degrees, the other 56% 
had earned undergraduate degrees in the liberal arts 
or in business administration. They averaged close 
to “B” in undergraduate grades and earned a mean 
of 566 on the Admissions Test for Graduate Schools 
of Business (ATGSB). 


Procedure 


During the fourth week of a course in behavioral 
science, each participant was involved in a series of 
small group exercises (Bass, 1965, 1967). One of 
these, Exercise Compensation (First ed.), required 
that each participant, by himself, make a set of 10 
salary recommendations prior to a group discussion 
about the matter. He was given the instructions 
which in essence were as follows: 


You will have to recommend increases for 10 
engineers in your company, all of whom are un- 
married, 22-year-old college graduates who have 
just completed their first year with the company 
and are now to be considered for an annual raise. 
Keep in mind that you may be setting precedents, 
that you need to keep down salary costs, that you 
need to consider equity, that you want to retain 


all the engineers if possible. Your company has no 

job evaluation schedule but tries to match com- 

munity and industry norms and to be competitive. 
The 10 engineers are as follows: 


Al, an average performer (50th percentile) ; 

Bill (90th percentile in merit) ; 

Charlie (10th percentile in merit). 

All the remainder are like Al in merit, but in 
addition: 

Dan is working on an insecure government con- 
tract. Because of his specialty, transfer is dif- 
ficult. 

Ed is on a “dead-end” job lacking advancement 
potential. 

Frank has a dirty, unpleasant, hazardous, uncom- 
fortable job. 

Garry has a low-prestige job in a low-prestige 
subsidiary. 

Henry has to work closely with a crew of uncon- 
genial, unfriendly, competitive co-workers. 

Irwin has a boring, dull, and monotonous job. 

Jim has a definite job offer from a competitor sub- 
stantially better than his current salary. 


Other Measures 


Of the 113 participants, during other weeks of the 
course, 112 completed the Allport-Vernon Study of 
Values and ranked their own life goals on the first 
edition of Exercise Life Goals (Bass, 1965). The 
goals were defined as follows: 


(a) Leadership: To become a community leader, to 
become influential in public affairs, to become in- 
fluential among co-workers. 

(b) Comfort: To have the time and means to en- 
joy life, to become happy and content, to have 
financial security, to be well-liked. 

(c) Creativity: To produce original and unique 
plans and ideas, to devise new approaches, to be 
expressive. 

(d) Expertness: To become an authority on a spe- 
cial subject in any field, to persevere to reach a 
hoped-for expert level of skill and accomplishment. 

(e) Technology: To make contributions to tech- 
nical or scientific knowledge, to invent apparatus or 
equipment. 
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TABLE 1 
RELATION BETWEEN TESTED INTELLIGENCE ACHIEVEMENT AND COMPENSATION DECISIONS 
Mean % 
ATGSB I 0 
p Undergraduate | increase in SD of 
Recommendations for races? pees recone 
Ls Qs Ts dations® ations 
Al (average) —.08 —.18* —.17* —,19* 8.02 2.62 
Bill (best) —.08 —.14 —.15 SS iks 11.96 3.78 
Charlie (poor) —.17* —.17* — .23* .00 5.55 2.38 
Dan (no security) — .09 —.11 —.15 = 40%! 9.83 3.24 
Ed (no opportunity) —.12 —.21* —.22* —.09 9.05 3.00 
Frank (poor conditions) —.01 —.24** —.16* —.04 10.18 3.08 
Garry (no prestige) —.10 —.21* —.20* —.14 9.49 3.16 
Henry (unfriendly co-workers) | —.05 —— eile lise —.13 9.72 3.10 
Irwin (uninteresting work) 00 —.19* —.12 —.14 9.83 3.26 
Jim (competitive offer) —.02 — .16* —.11 —.08 9.34 3.32 
Note.—ATGSB = Admissions Test for Graduate Schools of Business. L = Linguistic; Q = Quantitative; T = Total. 
aN = 101. 
bN =99, 
CU SS) 
* > <= .05. 
hp <=. 00: 


(f) Prestige: To become well-known, to obtain 
awards and recognition. 

(g) Service: To contribute to the satisfaction of 
others, to be helpful to others who are in difficulty. 

(h) Wealth: To earn a great deal of money, to 
build a large financial estate. 

(2) Independence: To be free to do what one 
likes, to be one’s own boss, to control others rather 
than be controlled by them. 


Exercise Objectives (Bass, 1965) also was com- 
pleted by the 113 participants. In this activity, after 
reading a company’s last annual profit and loss state- 
ment, each participant made five budgeting decisions. 
In one of these he had the option to spend or not to 
spend $225,000 to correct a safety hazard likely to 
produce a serious injury. In another, he could spend 
or avoid spending $150,000 to modify a plant that 
was polluting a stream. In still others, he had to 
spend money on managers, products, and to settle 
a strike, but in these cases his problem was on which 
way to spend the money to handle each case. These 
latter decisions were not expected to relate to salary 
recommendations and in fact they did not do so. 


RESULTS 
Mean Recommendations 


The mean increase awarded the 10 engi- 
neers by the 113 graduate business students 
(Table 1, column 5) was 9.3%. As is common 
with most other groups of managers and stu- 
dents tested here and abroad, the largest 
source of variance in recommendations was 
merit. Bill, of high merit, was assigned 1.5 
times the increase awarded Al, the average 


man, while Charlie, the man of low merit, 
was given approximately .7 as much as aver- 
age Al. The hypothesis has been advanced 
(Bass, 1967) that half of the increase given 
Al, the average man, will be denied to Charlie. 
This half will be added to Bill’s increase so 
that Bill gets 1.5 times as much as Al, and 
Charlie is left with only .5 of the increase 
given Al. But the hypothesis did not hold 
completely in Charlie’s case. Here, the ratio 
of mean recommendations that occurred for 
Bill: Al:Charlie was 3:2:1.5, rather than 
Sie lie 

As seen in Table 1, column 5, next to 
meritorious performance the participants were 
most willing to pay for poor working condi- 
tions (Frank) and were relatively least re- 
sponsive to Ed who lacked opportunities, al- 
though in comparison to what they awarded 
average Al, they were willing to give a 
premium for everyone from Dan to Jim who 
had special problems. 


Correlational Analyses 


The main purpose of this report is to see 
what personal factors influenced the recom- 
mendations. For this, correlational analyses 
will be presented in four sections. First, the 
extent that salary recommendations are as- 
sociated with the intelligence and achievement 


SALARY INCREASES 


of the recommender will be examined. Next, 
will be seen how the self-reported values of 
the recommender may enter the picture. 
Third, a look will be taken at how the recom- 
mender’s own stated life goals correlated with 
his recommendations for raises in salary. 
Finally, the author will note how generosity 
in recommending raises also may relate to 
willingness to spend company money on other 
personnel and organizational needs. 


Ability and Compensation Decisions 


Table 1 shows the product-moment cor- 
relations between the salary increases each 
participant was willing to award each of the 
10 engineers and each participant’s tested in- 
telligence and undergraduate record (columns 
1-4). It can be seen that differences among 
the participants’ decisions were unrelated to 
verbal aptitude as measured by the ATGSB, 
except possibly the salary decisions about 
Charlie, who was lacking in merit. However, 
a modest association was found between 
quantitative aptitude and the tendency to 
recommend smaller raises in salary. Partici- 
pants with greater tested quantitative apti- 
tude awarded smaller salary increases, par- 
ticularly (p < .01) when the increases were 
being justified by poor working conditions 
(ry = —.24). When verbal and quantitative 
scores were combined and total intelligence 
test scores were correlated with compensation 
decisions, the effect was most pronounced in 
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dealing with the salary increase for Charlie, 
the engineer low in merit. Those with greater 
tested intelligence were significantly likely to 
recommend a smaller raise for Charlie (r = 
Selon 

Perplexing was the significant correlation 
(p< .01) of —.25 between undergraduate 
quality-point averages and the raise awarded 
Bill, suggesting that achieving students recom- 
mended smaller salary increases for the most 
meritorious engineer. They awarded Al, the 
average engineer, a smaller raise also. There 
was little other evidence of the influence of 
undergraduate achievement on the other com- 
pensation decisions. 

To sum up, some tendency was seen for 
brighter and more academically successful 
participants to recommend smaller raises (or 
for the duller and less successful to recom- 
mend bigger salary increases). Quantitative 
aptitude was more of a factor than verbal 
aptitude. 


Values, Life Goals, and Compensation Decision 


The correlations among the Allport-Vernon 
Study of Values scale scores and the salary 
recommendations are shown in Table 2. It can 
be seen that holding certain values predis- 
posed recommenders to be generous, while 
strong values in other directions did the re- 
verse. Those with strong social values tended 
to recommend bigger raises in general to all 
engineers, but the effects were less marked 


TABLE 2 


RELATION BETWEEN ALLPORT-VERNON STUDY OF VALUES AND COMPENSATION DECISIONS 








Recommendations for 
Theoretical | Economic 
Al (average) — .23** — .38* 
Bill (best) —.13 —.18* 
Charlie (poor) —.20* — .38* 
Dan (no security) —.14 —.20* 
Ed (no opportunity) — .23** — .27** 
Frank (poor conditions) —.16* —.26** 
Garry (no prestige) —.29** ihre 
Henry (unfriendly co-workers) —.24** —.29** 
Irwin (uninteresting work) — .30** — .30** 
Jim (competitive offer) — .20* —.24** 








Note.—N = 112. 
*p <.05. 
ED <01. 





Allport-Vernon scores 











Aesthetic Social Political Religious 
.10 4 — .03 .16* 
—.09 220% 07 mili 
.08 oor —.06 li 
—.10 24% edd .10 
—.01 aa 04 14 
—.05 nLOe .09 al 9% 
.03 oe 04 os 
04 ne .08 n2 
.05 athe .09 alla 
02 LT .07 ilps 
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TABLE 3 
RELATION BETWEEN SELF-RANKED LIFE GOALS AND COMPENSATION DECISIONS 
Self-ranked life goals 
Recommendations for : Inde- 
ce Raa ta aaa Prestige | Service | Wealth | pend- 
| ship fort tivity ness nology are 
Al (average) .10 —.07 —.11 —.10 —.09 02 E200 —.03 01 
Bill (best) ike rae tem AK) —.13 — .03 09 2th — .06 —.10 
Charlie (poor) 09 eect —.09 —.00 —.12 —.00 ale —.03 -02 
Dan (no security) .10 —.08 | —.14 =A —.19* 06 Zhe —.00 .04 
Ed (no opportunity) oh? .09 nls 07 19* | —.01 One 02 02 
Frank (poor conditions) al —.07 —.11 —.07 —.13 —.00 pole —.04 —.04 
Garry (no prestige) 11 eae) aS aL OF .03 Dim 04 == 0% 
Henry (unfriendly | .09 —.06 | —.06 —.12 —.18* 03 SD ieee 02 06 
co-workers) 
Irwin (uninteresting 07 —.05 — .06 —.14 —.20* 02 e2iie —.00 .04 
work) me 
Jim (competitive offer) nor =—.03 —.15* | —.16* | —.21* .03 sain 02 .02 














Note.—Signs reversed. N = 113. 
Xp) <= 056 
RH <.01. 


when dealing with Bill (most meritorious), 
Frank (poor working conditions), and Jim 
(competitive offer). Evidently, lack of se- 
curity, opportunity, prestige, congenial co- 
workers, and interesting work weighed heavier 
as an excuse to grant bigger raises for recom- 
menders scoring high in social values. 

The correlations also were uniformly posi- 
tive between religious value scores and the 
size of salary recommendations, but they 


TABLE 4 


RELATION BETWEEN WILLINGNESS TO SPEND MoNnrEyY 
ON SAFETY AND STREAM CLEANING AND 
COMPENSATION DECISIONS 

















Spend on 
Engineer ee stream 
a fety 
ae clean-up 
Al (average) .07 .09 
Bill (best) 10 14 
Charlie (poor) .09 aibih 
Dan (no security) B28 el 
Ed (no opportunity) e20% mis 
Frank (poor conditions) .18* alia 
Garry (no prestige) sql siGll 
Henry (unfriendly co-workers) PN a13 
Irwin (uninteresting work) pox nGn 
Jim (competitive offer) Poe 13 
| 





Note.—N = 113. 
*p <.05. 
#kK bp < 01. 

















were generally lower. Strong social values 
played a greater role than strong religious 
values. At the other extreme, recommenders 
holding strong economic and theoretical values 
according to their self-reports were likely to 
offer smaller raises to the engineers. Those 
holding strong economic values were particu- 
larly unlikely to be generous to Al (average 
merit) and Charlie (low merit). Those hold- 
ing strong theoretical values were particu- 
larly unlikely to be generous to George (no 
prestige) and Irwin (unfriendly co-workers). 

Table 3 shows how self-ranked life goals 
were related to compensation recommenda- 
tions. Here one goal stands out. Recom- 
menders with relatively greater concern for 
service as an important life goal tended to 
be more generous—almost uniformly so. 

To a lesser extent, those concerned about 
technical success were likely to be less gener- 
ous than the average recommender. 

Different reasons for being generous may 
or may not be involved. For those high in 
economic value, it may be that the same 
amount of money is worth more and is con- 
sidered to be a larger raise than would be 
seen by those low in economic value. For 
those high in concern for theory, understand- 
ing, and technical success, less need may be 
seen to grant large increases in compensation 
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for effort. For those with strong concerns for 
social service and religion, pure generosity 
may be operative in their salary recommenda- 
tions. 


Willingness to Spend Money 


How general is generosity? Two of the 
five budgeting decisions on Exercise Objec- 
tives involved whether or not to spend money 
(not how to spend it) primarily for human or 
social rather than immediate economic re- 
turns. As can be seen in Table 4, there was a 
slight positive association between the size of 
salary increases recommended and a willing- 
ness to spend money on fixing a plant to end 
pollution of a stream. There seemed to be an 
even greater tendency to provide extra money 
for raises for all kinds of special reasons other 
than merit (7’s ranged from .18 to .23) and 
a willingness to spend money on safety equip- 
ment to avoid the possibilities of a serious 
accident. 


CONCLUSIONS 


Despite the observation that “Pay, in one 
form or another, is certainly one of the main- 
springs of motivation in our society” (Haire 
et al., 1963), little data are available on what 
bases decisions about pay are made. Many 
people would like to believe that highly ob- 
jective analyses produce highly objective job 
evaluations and merit rating and manpower 
plans which yield highly objective standards 
for establishing pay and pay raises. In fact, 
the author suggests that much subjectivity of 
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judgment is involved at every phase in these 
schemes and that it would be useful to de- 
velop some understanding of what may affect 
these judgments. In the present study, one 
sees that while merit overshadowed other 
factors in consideration by the recommenders 
of salary increases, at the same time, as might 
have been anticipated, the abilities and values 
of the recommenders played some role in 
what values they were willing to recommend. 

Higher salary increases were recommended 
by those lower in intelligence and achieve- 
ment, by those who held strong social and 
religious rather than theoretical or economic 
values, who were more interested in service 
rather than technical achievement and who 
also were more willing to spend their com- 
pany’s money for other personnel and social 
purposes. 
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Simulator sickness was hypothesized to be caused by the conflict between the 
visual presentation of apparent motion and the lack of any corresponding body 
sensation of motion. The hypothesis was tested by correlating individual differ- 
ences in scores on the Rod and Frame Test (RFT; which measures accuracy of 
adjustment of a rod to true vertical under conditions of visual-kinesthetic con- 
flict) and degree of simulator sickness. The data for Series 3 of the RFT and 
the indexes of sickness were best represented by hyperbolic functions yielding 
correlations of .40-.52. Implications for simulation technology and for a general 
conflict of cue theory are discussed with emphasis on supporting evidence 


from several areas of investigation. 


Since World War II, simulators have been 
developed which give the visual illusion of 
motion without any actual physical motion. 
Unfortunately, certain Ss became ill while 
operating these devices. The illness phe- 
nomenon was first intensively studied by Mil- 
ler and Goodson (1958, 1960), who labeled it 
motion sickness because the symptoms re- 
sembled those experienced by some people in 
moving vehicles. There was, however, no rea- 
son to prematurely so label this phenomenon 
motion sickness since motion is not involved 
in fixed-base simulators. As Tyler and Bard 
(1949) have pointed out, the primary cause 
of motion sickness is probably motion, and 
the failure to appreciate this fact can lead to 
confusion in conceptualization. In this paper 
the term “simulator sickness,” rather than 
motion sickness, will be used to denote the 
symptoms which occur in fixed-base simu- 
lators incorporating a moving visual scene. 

A number of hypotheses have been in- 
formally advanced to explain simulator sick- 
ness: distortion of vertical objects, rapid 
change in brightness, too much detail, poor 
resolution, excessive lag between simulator 
controls and corresponding shift in visual dis- 
play, high-frequency vibrations which disrupt 
accommodation, distance between the visual 
display and the observer such that accom- 
modation is different from that usually ex- 
perienced, and conflict between the apparent 
motion seen on the visual display and lack of 
any corresponding motion of the simulator. 


This last hypothesis was investigated in the 
present study. 

Several other conflict situations have been 
found to produce sickness. Wood (1895) de- 
scribed an amusement park device with a large 
immobile swing inside a movable room. When 
the room moved many individuals experienced 
considerable discomfort. Crampton and Young 
(1953) induced nausea in Ss seated in a fixed 
chair in the center of a rotating room. Dis- 
comfort lasted for some time after the ex- 
perience, for one S up to 2 days. This wide 
range of sickness time (from 0 to 48 hr.) is 
noteworthy, since all Ss were subjected to the 
same experience. 

While conflict between visual and body 
cues may be the dominant cause of simulator 
sickness, other factors such as type of simu- 
lator, fidelity of simulation, S’s experience, 
and S’s involvement are also important. Fixed- 
base simulators may have two types of visual 
displays: outside-in (O-I), such as a child’s 
remote control car, or inside-out (I-O), in 
which the operator views the scene as he 
would from inside a real vehicle. While sick- 
ness has been quite common with an I-O 
display, none has been reported for O-I simu- 
lators. Low involvement resulting from being 
unable to put oneself psychologically into the 
vehicle is probably the reason that O-I dis- 
plays do not cause sickness. A comparison of 
I-O and O-I displays (Matheny, Dougherty, 
& Willis, 1963) revealed that performance 
improved with an I-O display (but not O-I) 
when motion cues were added. Since the op- 
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erator “expects” motion in an I-O situation, 
cue conflict which leads to sickness and/or 
decrement in performance can occur when 
such expected motion is not present. 

Concerning S’s experience, Miller and Good- 
son (1958) found that 60% of the experienced 
helicopter pilots but only 12% of the student 
pilots tested reported simulator sickness. 
There was probably no cue conflict for the 
student pilots since they had not learned the 
particular body inputs which occur during 
helicopter operation. Fitts (1951) suggested 
that visual control was important when an 
individual first learns a perceptual-motor 
task, and proprioceptive feedback or “feel” 
becomes more important as experience in- 
creases. Fleishman and Rich (1963), for ex- 
ample, using predictors of a perceptual-motor 
task, found that a visual test correlated sig- 
nificantly with the task in early but not later 
trials. Conversely, a kinesthetic test cor- 
related on the later but not the early trials. 
This indicates that initially visual cues guide 
performance, but with experience individuals 
sensitive to proprioceptive cues can become 
most proficient. 

The importance of body cues was also 
demonstrated by a factor analysis of 12 driv- 
ing tests (Herbert, 1963). One of the main 
factors was labeled proprioception because of 
the high loading of a nonvisual driving task. 
Experience with the necessary proprioceptive 
feedback, then, appears to be necessary before 
cue conflict can result. 

Fidelity of simulation would be the degree 
to which the simulated conditions approach 
conditions of the real world. If the simulator 
were of low fidelity no conflict or sickness 
would result since Ss cannot become involved 
in the very unrealistic task. 

The Ss will also not become involved in 
the task if they have a “play set.” Their be- 
havior will not be pertinent to good driving 
performance with, for example, an attempt to 
crash the simulated vehicle. Low involvement 
yields no conflicting cues and therefore no 
simulator sickness. This conclusion is con- 
sistent with the finding that people do not 
become ill in amusement park simulators even 
though all the other necessary conditions may 
be present. 
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Recently Barrett and Nelson (1965d, 1966b) 
evaluated an automobile simulator which had 
all the aforementioned parameters: high 
fidelity, I-O display, and experienced and in- 
volved Ss. They found symptoms which were 
quite similar to those reported in previous 
simulator research, including cold sweating, 
upset stomach, vertigo, dizziness, nausea, 
feeling of faintness and disorientation. About 
half of the Ss became too ill to continue after 
only 5-10 min. Two Ss became so ill as to 
regurgitate. 

A research program was initiated to test 
the cue conflict and conflict sensitivity hy- 
pothesis, as measured with a Rod and Frame 
Test (RFT) apparatus following the field- 
dependence conceptualization of Witkin, 
Lewis, Hertzman, Machover, Meissner, and 
Wapner (1954). Field-independent Ss were 
deemed more sensitive to body cues than field- 
dependent Ss. On this basis it was predicted 
that the field-independent Ss would experi- 
ence more discomfort in the simulated situa- 
tion than field-dependent Ss. 


MrtTHOD 
Automobile Simulator 


An unprogrammed automobile simulator was the 
basic research tool. A terrain model, an 87:1 (HO 
gauge) scale representation of several flat roads, sup- 
plied the visual scene. Mounted above the terrain 
model was a television camera with motions in di- 
rect response to the movements of the brake, ac- 
celerator, and steering wheel of the automobile. Thus, 
S had complete control over the part of the ter- 
rain model that the camera traversed. 

A projected image visual display was used with 
half of the Ss and a virtual image display with the 
remainder. Both visual displays gave the driver ap- 
proximately a 50° horizontal angle view of the ter- 
rain model and a center resolution of approximately 
500 lines. There was no significant difference between 
the displays in the percentage who left the simulator 
because of discomfort. Therefore, the two displays 
were considered to be functionally equivalent. With 
each visual display the driver sat in the automobile 
and performed the usual control movements as- 
sociated with driving a car. A more detailed descrip- 
tion of the simulator and associated visual displays 
has been reported by Barrett and Nelson (1965c, 
1966a). 

Perceptual style was measured with a standard 
RFT (Witkin et al., 1954). The apparatus consisted 
of a luminous frame (40 in. sq.) pivoted at its 
center so that it could be tilted left or right. Pivoted 
at the same center, but moving independently of 
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TABLE 1 


RELATIONSHIP BETWEEN PERCEPTUAL 
STYLE AND DISCOMFORT 


| Rod and frame measures 
Measure of | 








discomfort | 

| 1/S; | 1/S» | 1/S3 | 1/OI 
Dicsntort || te ets 40* 23 
Illness after ee poi 2's .44* 
No. trials —,.22 —.11 | —.45** —.25 
Stayed left —.18 —.14 | —.47** —.26 





Note.—Si =Series 1; Se = Series 2; Ss = Series 3; OI = Orien- 
tation Index. 


the frame, was a luminous rod (39 in.). The S was 
seated 8 ft. from the rod and frame in a chair which 
could be placed in three positions: erect, tilted 28° 
to the left, or tilted 28° to the right. Testing was 
done in a completely darkened room and S wore 
goggles with dark lenses so that he could see only 
the rod and frame. 


Subjects 


A random sample of 50 male Ss between the ages 
of 30 and 45 were selected from approximately 1,200 
employees in an engineering division of an aerospace 
corporation, Approximately 6 mo. after the comple- 
tion of the simulator evaluation, 46 Ss (23 from each 
display) were able to be recontacted and tested with 
the RFT. 


Procedure for Automobile Simulator 
Investigation 


Each § drove three orientation trials around the 
terrain model followed by a pretrial run for a study 
of driving at requested speed. After making the 
tenth and final speed judgment, S was exposed to an 
emergency situation where he had to stop for a 
suddenly emerging pedestrian dummy. The pro- 
cedures have been described in greater detail by 
Barrett and Nelson (1965a, 1966b). During the evalu- 
ation of the simulator, S was observed by the ex- 
perimenter. If S complained of discomfort he was 
told that he was free to leave at any time. 


RFT Procedure 


Approximately 6 mo. after the data emergency be- 
havior study, the perceptual style of Ss was measured 
following the standard procedure (Witkin et al., 
1954). Series 1 (Si) of the RFT consisted of eight 
trials in which S and the frame were tilted 28° in 
the same direction; Series 2 (Sz) were eight trials in 
which S and frame were tilted 28° in opposite di- 
rections; Series 3 (Ss) consisted of eight trials with 
the frame tilted 28° to the right or left while S re- 
mained upright. The S’s task was to position the rod 
to what he considered to be true vertical by asking 
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the experimenter to move the top of the rod right 
or left. The S’s score was the number of degrees in 
error in each series. In addition, an Orientation In- 
dex (OI) was computed from the standard scores of 
the three series. 


RESULTS 


Four measures of discomfort were com- 
pared to RFT scores. The first measure was 
S’s rating of discomfort, using a 0-10 graphic 
rating scale. The scale was part of a 10-item 
questionnaire concerning the simulator which 
was administered 6 mo. after the simulator 
study. A second questionnaire measure was 
S’s estimate of the length of time after leaving 
the simulator that the discomfort persisted— 
termed “illness after.” Responses to this ques- 
tion ranged from zero illness to 48 hr. Third, 
the number of trials S was able to remain in 
the simulator was used as an index of dis- 
comfort. The range was from 1/4 trial to 
completion (14 trials). Fourth, Ss were di- 
vided into two categories: those who com- 
pleted all trials and those who did not. Twenty- 
three of 46 Ss were able to complete the 
simulator study. 

Linearity was approximated by reciprocal 
transformations of the perceptual style mea- 
sures. Correlations are shown in Table 1! 
where it can be seen that the only consistently 
significant relationships were between S3 and 
the four measures of sickness. 

Table 2 shows an apparent threshold phe- 
nomena. The Ss were classified according to 
an adult standardization sample for S3 (Wit- 
kin et al., 1954) with those who were either 
1 standard deviation above or below the mean 
labeled extreme field dependent or extreme 


TABLE 2 


RELATIONSHIP BETWEEN LEAVING SIMULATOR AND 
PERCEPTUAL STYLE CLASSIFICATION 





ON SERIES 3 
Hanene Hida | “Field fos a 
Subject tee inde- | depend- a 
epend- 
pendent pendent ont ent 
Left simulator 12 6 2 3 
(N = 23) 
Remained in 21 2 
simulator 
(CNig==925)) 





PERCEPTUAL STYLE AND SIMULATOR SICKNESS 


field independent. All the extremely field-in- 
dependent Ss left the simulator. 

In order to determine if other aspects of the 
RFT test would significantly add to the re- 
lationship, multiple-regression equations were 
calculated using x, x?, 1/x, in x-data trans- 
formations of S;, Se, Ss, and OI. No measures 
were found to add significantly to the variance 
accounted for by Ss. 

Mention should be made of the possible rea- 
son for Ss being the only measure which was 
related to simulator discomfort. Witkin et al. 
(1954) statistically analyzed 10 perception 
subtests and found three general groupings. 
Ss was not in the same grouping of perception 
tests as were S; and So, indicating that the 
RFT taps at least two perceptual factors and 
that only one of them was related to simu- 
lator sickness. 


DISCUSSION 


If the conflict of cue hypothesis is correct, 
an obvious remedy to the simulator sickness 
problem is to introduce a certain degree of 
physical motion into the simulated system. 
An interesting question is the degree of mo- 
tion required to give the necessary body cues. 
Simple random vibration may be enough to 
eliminate the cue conflict, a possibility having 
considerable practical and economic import 
for the simulation art. 

Besides the specific cue conflict when mo- 
tion is lacking, the results may have implica- 
tions for motion sickness research, in which 
case motion is present with inappropriate or 
missing visual cues. This is supported by the 
finding that those who experience motion 
sickness were also likely to experience sick- 
ness in a conflict situation (Crampton & 
Young, 1953). It has been reported (Clark, 
1963) that some pilots become disoriented 
while flying in extreme haziness or cloud 
covering. They may be receiving adequate 
motion cues but not the corresponding visual 
inputs. In much the same manner, while flying 
under instrument conditions (again motion 
but no vision), some pilots become dis- 
oriented and mistrust their instruments. 

Simulation of zero gravity in aircraft pro- 
duced illness for 50-70% of the Ss tested by 
Loftus (1963). It is possible that cue con- 
flict experienced in this unusual situation in- 
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duced the illness. The finding that labyrin- 
thine-defective Ss (fewer body cues) showed 
no signs of zero gravity sickness, while 64% 
of the normal Ss did, supports this hypothesis 
(Kellogg, Kennedy, & Graybiel, 1964). 

Wendt (1951) concluded that the vestibu- 
lar apparatus per se causes motion sickness 
since people with no vestibular sensitivity do 
not become sick. An alternate conclusion 
might be that, since labyrinthine-cue-related 
body sensitivity is low for deaf people, it is 
less likely that they will experience any con- 
flict in cues. 

It is possible that both cue conflict and 
vestibular stimulation are important, but under 
different conditions. Walsh (1962) oscillated 
Ss in the horizontal plane. At 1 cps the Ss 
correctly felt they were traveling in a given 
direction. When the oscillations were 1/3 cps 
or slower, the sensations of moving were in 
anticipation of the motion, there being a 
phase advance. Sensations of motion in one 
direction were frequently aroused when the 
person was still traveling rapidly in the op- 
posite direction, thus conflict. It is possible 
that motion sickness may have different causes 
depending upon the rate of motion. At slow 
rates the conflict of cues may apply; with 
greater accelerative forces the associated 
symptoms might be due to excessive stimula- 
tion of the vestibular mechanism. 

The fact that pilots and drivers rarely be- 
come sick and passengers often do (Tyler & 
Bard, 1949) can be explained by the conflict 
of cue hypotheses. Since the driver receives 
direct feedback from the vehicular controls, it 
is understandable that he does not experience 
conflict, while the passenger has no such 
referent and may become ill. 

The question remains as to why certain 
field-dependent people became ill in the simu- 
lator situation. Other variables such as the 
physical condition of Ss may have had some 
influence. Another explanation can be found 
from early perceptual style research. Witkin 
et al. (1954) stated that some Ss experienced 
great difficulty in making a judgment on the 
Body Adjustment Test. They appeared to be 
influenced by the visual scene but also aware 
of body position. Being unable to consistently 
utilize body position, their responses were 
quite variable. They eventually become dis- 
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oriented, with some experiencing physical dis- 
comfort akin to simulator sickness. The field- 
dependent Ss who function in this manner 
may also experience simulator sickness. To 
probe this possibility the 10 most-field-de- 
pendent Ss were compared as to variability of 
responses. Of the five sick Ss, four were ex- 
tremely variable (range = 20°) and four of 
the five nonsick Ss were very consistent in 
their responses (range = 5°). While the re- 
sults were suggestive, a large subsample would 
be required for statistical confirmation. 
Although cue sensitivity in terms of per- 
ceptual style appears to explain the extreme 
discomfort that some people experience, Wit- 
kin, Dyk, Faterson, Goodenough, and Karp 
(1962) recently discussed perceptual style in 
terms of being able to extract an item from 
an embedded context. The results of the study 
could be explained in these terms also. In a 
simple laboratory study of kinesthetic sensi- 
tivity where cues are isolated there may be 
no differences between field-dependent and 
field-independent Ss. However, in the com- 
plex simulator (an embedding context) the 
field-independent person may be more aware 
of the cues which are in conflict (i.e., can 
disembed them), and thus he becomes ill. 
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EVALUATION OF A NEW HYBRID IN 
PROGRAMMED INSTRUCTION 


D. H. SCHUSTER 1 


Iowa State University 


A new format of programmed instruction similar to the branching style of 
Crowder was developed for the subject matter of electronic test equipment. All 
of the answers were explained on the page following a question. This hybrid 
was evaluated and compared with the conventional scrambled format in a 
field tryout with electronic technicians. There was no significant difference in 
achievement between the 2 formats, whether additional participation by the 
use of preperforated punch cards was considered or not. There was a tre- 
mendous opinion difference in favor of the linearized version afterward, re- 
gardless of whether the student had actually taken the linear format or not. 


The aim of this research was to try out with 
electronic technicians a new format in pro- 
grammed instruction and compare it with a 
‘conventional scrambled book or Crowderian 
format. A secondary question was to evaluate 
the effectiveness of additional participation in 
the program by means of overt responses. 

One type and format of programmed in- 
struction, due to Crowder (1960), is the fol- 
lowing: A page of instructional material is 
ended by a question with several multiple- 
choice answers. Only one of these choices is 
correct. The choices are answered and dis- 
cussed on separate pages distributed at ran- 
dom throughout a given chapter. If a student 
selects a wrong answer, the wrongness of that 
particular answer is discussed in view of the 
instructional material and the student is 
usually returned to the original instructional 
page to reread it and to try again. If a student 
selects the correct answer, he is told that this 
is the correct answer and an explanation is 
given. Then the student goes on to the next 
page of instructional material with its ter- 
minating question and so on through a chap- 
ter. A book constructed this way is said to be 
“scrambled” or “self-tutoring.” 

From a preliminary study with a scrambled 
book on electronic test equipment, approxi- 
mately half of the students stated that they 


1A note of thanks is due C. J. Draker of Collins 
Radio Co., Cedar Rapids, Iowa, for his help in the 
preparation, training, and data-collection phases of 
this project. Appreciation also goes to Ken Thomp- 
son, United States Naval Training Device Center, for 
his help in preparing materials. 
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would like to see the scrambled book un- 
scrambled or linearized. The writer accord- 
ingly modified the scrambled book program 
such that all of the answers, correct and in- 
correct, were listed and explained on the 
page immediately following the instructional 
page with its ending question. Thus the stu- 
dent progressed through a given chapter 
linearly instead of randomly throughout a 
given chapter. This feature was felt to fa- 
cilitate the student’s participation. The key 
question to ask was whether the linearized 
(unscrambled) version of the same material 
would be as effective in helping the student 
learn the material, as was the scrambled-book 
version with its emphasis on participation 
and concentration throughout a given chapter. 
The student could not half-learn the material 
in a scrambled-book version; he had to par- 
ticipate to learn each page of instructional 
material before he could leave the page. The 
student could skim over the material in the 
linearized version and thus not learn it thor- 
oughly. The hypothesis was as follows: The 
students would show superior learning on the 
scrambled-book version compared to a linear- 
ized version of exactly the same instructional 
material. 


METHOD 


The programmed instructional material was used 
as a self-study course with electronic technicians 
going through the material on their own time at 
home. There were no classes during work hours, but 
the technicians were encouraged to ask questions 
about the same test equipment in the lab at work 
and to familiarize themselves with its use. Pre- and 
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TABLE 1 
List AND DESCRIPTION OF VARIABLES 
Variable | Level 

Fixed | 

Sequence of forms in pretest to posttest | 2 

Book format, linearized or scrambled 2 

Card participation, preperforated IBM cards | 
Random 


On 


used to record answers, or not 2 
Subjects, electronic technicians | 





posttests were developed and given to measure 
achievement gain. In addition to developing the 
linearized version, the scrambled version was re- 
vised on the basis of previous tryouts. 

The experimental design follows (Winer, 1963). 
(See Table 1.) Two levels of three independent 
variables were used in a factorial design with five 
subjects (Ss) per cell. The first independent variable 
was achievement test sequence: this was done to 
control for a known minor difference in test diffi- 
culty. The second independent variable was the 
format, linear versus scrambled; half of the students 
got one version, and the balance the other. The next 
independent variable was student participation, to 
check on the hypothetical superiority of the 
scrambled-book version. The participation condition 
had half of the students using IBM preperforated 
cards for the recording of answers to the book ques- 
tions, whether they were using the linearized or 
scrambled version. The rest of the students had no 
such cards and thus presumably got themselves less 
actively involved in the book. 

“Subjects” was the last independent variable. The 
Ss were all electronic technicians who had been em- 
ployed for a varying number of years at Collins 
Radio at Cedar Rapids, Iowa and had different ex- 
periential and educational backgrounds. The Ss were 
assigned serially to one of the eight treatment condi- 
tions as they became available to the study. It was 
hoped to get six Ss for each of the eight treatment 
cells, but several people were not able to complete 
the book and the posttest satisfactorily. Luckily these 
people with incomplete records did not occur in the 
same treatment group so that in the final analysis 
there were a total of five Ss for each of the eight 
treatment cells. Because of a limited number of 
text books, it was desirable to split the training-test- 
ing phase of the experiment into two sections. Ac- 
cordingly about half of the test Ss were trained and 
tested in the summer of 1966 and the second half in 
the winter of 1967. Data from the two sections were 
pooled. 


RESULTS AND DISCUSSION 


The median time for technicians to com- 
plete the book on their own time was 12 hr., 
with extremes ranging from a minimum of 9 
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to a maximum of 40 hr. The use of card par- 
ticipation did not particularly affect these 
times. The time involved in pretesting was ap- 
proximately 1 hr. on company time with an- 
other hour likewise in posttesting. 

An analysis of variance was made of the 
gain in test scores from pretesting to post- 
testing. The posttest scores themselves could 
have been analyzed but were subject to pos- 
sible sampling biases in pretest technician 
ability in the eight treatment cells. This could 
have been corrected by an analysis of co- 
variance with the two covariables of pretest 
score and format. The use of gain scores ap- 
peared simpler. Consequently, the analysis 
basically rests on the gains in test scores, as 
shown in Tables 2 and 3. 

Table 2 presents the analysis-of-variance 
summary for the test gain scores. The basic 
datum for each S was to subtract his pretest 
score from his posttest score and then to 
analyze these resultant gain scores. The only 
significant treatment in Table 2 was that for 
test sequence. This was expected as the pre- 
liminary studies had shown that Form A was 
harder than Form B. This is reflected in Table 
3 where it is seen that a greater gain occurred 
when Form A was given as pretest and Form 
B as posttest. This, of course, is to be ex- 
pected if Form A were harder than Form B. 
Neither the main effect of the book format 
nor of card participation was significant. Nor 
were any of the interactions significant. This 
lack of significance is also reflected in Table 
3 where the averages are given for the treat- 
ment conditions. For the difference in book 


TABLE 2 


ANALYSIS-OF- VARIANCE SUMMARY FOR 
Test GAIN SCORES 





























| 
Treatment and effect SS) | d/ | MS RB 
fet) 
Test sequence (A) 168.10) 1 | 168.10 | 13.74* 
Book format (B) 14.40} 1 | 14.407) dis 
Card participation (C) 0.90; 1 0.90 | 0.07 
AXB ata) |) th 8.10 | 0.66 
ASE 14.40} 1) 1440] 1.18 
BSGC a, Free 4.90] 1} 4.90] 0.40 
AXBXC 10.00} 1] 10.00] 0.82 
Within er 391.60 | 32 | 12.24); — 
Total * 612.40 | 39 
* p< 01, 
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TABLE 3 


SUMMARY OF GAIN SCORES BY TREATMENT 











Treatment Average* 

Test sequence 

Form A first and B last 200) 

Form B first and A last 125 
Book format 

Linearized 3.90 

Scrambled 2.70 
Card participation 

No cards used to record answers Sal 

IBM cards used to record answers 3.50 








Note.—Form A = linearized; Form B = scrambled. 
aN = 20 data per treatment average. 


format, linearized versus scrambled, not only 
was the scrambled-book format not superior 
to the linearized-book format, as hypothesized, 
but a little bit of the reverse happened. While 
insignificant, the linearized format had a 
slightly greater gain achievement than did 
the scrambled format. The interpretation can 
be only a conjecture at this point. The linear- 
ized format with its facilitation of review and 
easy coverage of all answers was just as ef- 
fective as the scrambled-book format with its 
somewhat greater participation supposedly. 
Clearly participation is not part of the answer 
as the effect of card participation was in- 
significant although slightly in the expected 
direction; that is, students using cards to 
record their answers did have a slightly higher 
achievement gain than students without the 
card participation. Or perhaps greater ir- 
ritation went with the participation in the 
scrambled book. 

Two other studies are relevant at this point. 
Brown (1966) varied the location of the 
answer in four different versions of the same 
programmed instructional package. Typical 
answer locations were “the next line down” 
or “on back of the page.’’ He found that the 
formats with an easy opportunity to cheat re- 
sulted in less achievement than where the 
opportunity to cheat was less frequent or 
harder. Kapel (1966), on the other hand, 
found that cheating did not have this detri- 
mental effect. Cheating, defined in terms of 
opportunity to review material or to search for 
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answers, actually appeared to be superior to 
where searching was not permitted. 

The linearized format used in this study 
had an easy opportunity to cheat in the 
sense of being able to search over the follow- 
ing page over all of the answers and pick out 
just the right one. The person thus could 
fool himself as well as the experimenter. How- 
ever, this in itself may have been helpful. A 
number of students with the scrambled format 
commented that they looked up the wrong 
answer sometimes just to find out why it 
was wrong even though they knew so ahead 
of time. In other words, even students utiliz- 
ing the scrambled format learned from look- 
ing at the wrong answers or “cheating” in 
this narrow sense. 

In summary the two formats were equally 
effective in presenting the material, and the 
reasons for this are not known. Certainly the 
reason does not appear to involve the sup- 
posedly greater participation on the part of 
the students using the scrambled-book format. 
Perhaps the answer does lie in the easy avail- 
ability of all answers in the linearized format 
for review and optimal search purposes. 

There was one methodological flaw in the 
present study, namely that there was no con- 
trol group used to evaluate the test-retest 
effect itself. It is fairly well known that a 
person will improve on an achievement test 
if he takes the same test of factual knowledge 
on a second occasion. This is not the case 
with esoteric subject matter as has been dem- 
onstrated in other studies. The assumption 
was made in this study that the amount of 
improvement in knowledge about electronic 
test equipment would be very slight in the 
5- or 6-wk. period between testing unless S 


TABLE 4 


FORMAT PREFERENCE VERSUS FORMAT 
STUDIED BY TECHNICIANS 








Format preference* 


Format studied 








Linear 2 Scrambled 
Linear 23 2 1 
Scrambled 18 4 3 
Totals 41 6 4 





a x2 = 4.23, p <.05 by grouping ‘‘?"’ and ‘‘Scrambled.”’ 
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had indeed gone through the textbook. If 
the study were to be redone, it would be ad- 
visable to include a control group to evaluate 
the test-retest effect over a comparable time 
period even though it might be hard to obtain 
Ss for this purpose. 

Reader preference for one book format 
versus the other is a final interesting consid- 
eration. As students came in to take the post- 
test, they were asked to fill out a short ques- 
tionnaire. One of the questions was ‘‘Read an 
item or two in the other book format than the 
one you had. Which do you prefer, linear or 
random? Why?” The student tabulations to 
this question are given in Table 4. Ignoring 
the question mark response, these results 
show an overwhelming 41:4 opinion prefer- 
ence in favor of the linear format. The sig- 
nificant chi-square value means that students 
going through the linear book preferred it 
more than did students going through the 
scrambled book. Even though there was no 
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difference in achievement between the two 
book formats, the tremendous opinion prefer- 
ence for the linearized version has a very 
practical consideration in terms of student 
usage. Consequently, the decision was made to 
go ahead and publish the linear edition 
(Schuster, 1968). 
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ABSOLUTE SCALING OF JOB PERFORMANCE: 
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The logic and development of a technique for evaluating job performance 
directly in terms of organizational objectives without reference to normative 
materials are described. The technique involves the establishment of critical 
performance levels, as derived from a Guttman scale, in terms of stated or- 
ganizational objectives and relating performance to these levels. The retest re- 
liability, internal consistency, predictability, and concurrent validity of the 


technique are described. 


Throughout the history of psychological 
and educational measurement, emphasis has 
been placed on the use of norms for interpret- 
ing test scores. It has been commonplace to 
encounter the statement that a given raw 
score on a test is meaningless until it is com- 
pared to scores made by other people. Text- 
books have usually stressed the use of per- 
centiles, standard scores, age norms, etc. 

Several recent developments, however, have 
suggested that there are other, less norma- 
tively oriented methods for deriving meaning 
from test scores. For example, the elaboration 
of scaling theory has drawn attention to the 
internal relationships of the items of the 
score scale itself, somewhat aside from the 
people from whom the original score data 
were obtained. Also, systems technology has 
concentrated on the importance of the func- 
tioning of the system as a whole and has 
evolved concepts of personnel subsystem de- 
sign and evaluation which relate human per- 
formance directly to system performance. In 
this context, Glaser (1962) has contrasted 
the use of “criterion-referenced” measures, 
which tie scores directly to a given criterion 
behavior, with “norm-referenced” measures, 
which are interpreted in terms of the score 
distribution of a representative group of peo- 
ple. With the growing sophistication concern- 
ing the problem of assigning meaning to test 
scores, even current standard textbook treat- 


1The research reported in this paper was per- 
formed under Contract Nonr-4153(00) between the 
Office of Naval Research and Applied Psychological 
Services. The cooperation and encouragement of 
Victor Fields, Sidney Friedman, Charles Hodges, and 
Martin Wiskoff, Bureau of Naval Personnel, are 
gratefully acknowledged. 

2 Now at Case Western Reserve University. 


ments of the issue include mention of such 
matters as types of scales and expectancy 
tables (e.g., Cronbach, 1960, pp. 69-73). 

The purpose of the research reported here 
was to develop a performance appraisal pro- 
cedure which would relate each individual’s 
score to a standard, external to the body of 
personnel, that is, a standard defined by the 
criterion performance required in order for 
the particular element of the industrial or 
military organization to accomplish its or- 
ganizational objectives. 


MeEtTHOD 


The job constellation under study was the family 
of enlisted positions associated with naval air elec- 
tronics (avionics) work, and the method employed 
consisted of the following steps: 


1. Formulation of statements of the specific sys- 
tem objectives of naval air electronics and determina- 
tion of the importance of their respective contribu- 
tions to system requirements; 

2. Establishment of critical points on a Guttman- 
type job-performance scale, the critical points repre- 
senting different levels of satisfying the objectives; 

3. Estimation of the reliability of the instrument, 
derived in Step 2, through fleet administration in six 
operational squadrons; 

4. Investigation of the extent to which the cri- 
terion instrument, developed in Steps 2 and 3, is re- 
lated to various Navy predictor instruments and a 
job-sample performance test. 


Definition and Importance of Objectives 


A preliminary list of the objectives of the naval 
avionics personnel subsystem was prepared following 
a search of a wide variety of documents concerned 
with naval and electronic maintenance activities. 
The search included examination of such sources as 
bulletins of training course offerings, including pre- 
requisites and purposes of the courses, volumes giving 
descriptions of the duties of the various occupational 
specialties, and equipment operation and mainte- 
nance manuals. Throughout such literature there 
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were found both direct statements and inferential 
comments about the avionics personnel subsystem’s 
mission. As a result of this search, four major ob- 
jectives were drawn up which appeared to encom- 
pass the mission of electronics activities carried out 
in the Navy’s air arm. These four objectives con- 
cerned the need for readiness, performance, safety, 
and adaptability. 

This preliminary set of objectives was reviewed 
with 18 individuals in high-level assignments involvy- 
ing electronics maintenance, training, and supply. 
Fourteen of the individuals were attached to the 
Staff of the Commander of Naval Air Forces in the 
Atlantic Fleet; the remaining four were members of 
an operational squadron and included the squadron’s 
avionics officer and his maintenance staff. The mean 
length of the naval service represented in the group 
of 18 individuals was approximately 19 yr. 

Semistructured interviews were conducted with 
these staff and squadron men at the end of which 
they were asked to complete the statement: “In op- 
erational terms, the electronic maintenance objectives 
of the Naval Air Fleet are. .. .” On the basis of the 
information obtained from the interviews and the 
written statements the preliminary set of objectives 
was modified, and the final statement of the objec- 
tives of naval air electronics was formulated as 
follows: 


1. Operation: To obtain optimum system output 
when equipment is operated, that is, output charac- 
terized by precision and variability appropriate to 
mission ; 

2. Performance: To complete any given mission in 
minimum time at appropriate level of accuracy and 
reliability ; 

3. Preparation: To prepare for personnel require- 
ments of present and future equipment, systems, and 
situations through use of training programs, mainte- 
nance of high morale, etc; 

4. Readiness: To efficiently maintain self, equip- 





Items 
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ment, and systems in a state of readiness consistent 
with Fleet requirements ; 

5. Safety: To carry out duties with maximum pro- 
tection for men and equipment consistent with mis- 
sion. 


Estimates of the relative importance of these ob- 
jectives were needed so that they might later be 
combined in the formulation of overall goals and 
the resulting establishment of single critical per- 
formance points in addition to critical performance 
points for each objective separately. The objectives 
were scaled along an importance dimension by the 
constant-sum method described by Comrey (1950), 
one of the fractionation methods which produce ratio 
scale values. The judgments required for the scaling 
analysis were obtained from 57 commissioned officers 
and 1 warrant officer, from 30 different operational 
squadrons, all of whom had current responsibility 
for avionics activities. Scale values were derived in 
accordance with Torgerson’s (1958, pp. 108-112) 
modification of Comrey’s analytical procedure. This 
process resulted in importance scale values which 
ranged between 1.14 and .83. In view of the high 
degree of similarity among these variables, the five 
objectives were taken to be equally important. 


Establishment of Critical Points on 
Generalized Job-Performance Scale 


Prior research (Schultz & Seigel, 1961) had de- 
veloped an 8-item hierarchical group of electronic 
task descriptors which was shown to meet the 
specifications for a Guttman scale. This generalized 
job-performance scale of electronics activities had 
been found to yield a coefficient of reproducibility 
of .94, and an index of consistency (Green, 1956) of 
55. The items of this scale are shown as follows; 
the scoring values associated with the items are con- 
secutive integers from 1 for “Employing safety pre- 
cautions on” to 8 for “Employing electronic prin- 
ciples involved in maintenance of.” 


Employing electronic principles involved in maintenance of .. . 
Troubleshooting/isolating malfunction(s) in... 


Calibrating ... 


Knowing relationship of equipment to other related . . . 
Following block diagrams for... 


Removing... 
Replacing ... 


Employing safety precautions on... 


Critical points on this scale were established by 
adapting the procedures of the psychophysical method 
of limits. Two threshold points were sought for each 
objective: one would be the threshold between ac- 
ceptable and unacceptable performance levels, or the 
minimum point of acceptable performance; the other 
would be the threshold between acceptable and de- 
sirable performance levels, or the minimum point of 
desirable performance. The Ss for this analysis were 





61 noncommissioned officers in 23 operational squad- 
rons who were supervisors in the avionics specialties.® 

The scale was reproduced on each page of a ques- 
tionnaire and was headed by one of the five objec- 
tives. The Ss were instructed that the scale items 


8 Aviation electrician’s mate, aviation electronics 
technician, aviation fire control technician, aviation 
antisubmarine warfare technician, and training de- 
vices man. 
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TABLE 1 
Point JUDGMENTS FOR Minimum ACCEPTABLE AND DESIRABLE PERFORMANCE 
on Eacu NAvAtL AVIONICS OBJECTIVE 
Objective 
Job-performance Perf. : j ‘ Combined 
ace erformance Safety Preparation Readiness Operation Bete 
eve objectives 
M SD M SD M SD M SD M SD 
Desirable 
Descending 6.66 | 1.20 6.52 1.70 (ei 1.38 7.24 1.14 7.37 E22, 
Ascending 3.50 1ST, 3.84 1.62 4.57 1.84 | 4.47 1.80 5.01 2.09 
M 5.08 5.18 5.84 5.86 6.19 5.63 
Minimum acceptable 
Descending ASH) || ey || Beal | alee || ZEW |) ale |) Sel |p al |) Geet tak oil 
Ascending 2.24 1.16 2.63 1290 3.39 1e/S oe 1.73 SHS) 1.92 
M Croll 3.42 4.07 4.19 4.63 3.92 




















Note.—N = 61 senior noncommissioned officers in avionics specialties. 


were to be interpreted as completing the blank in 
the sentence: “A technician at this level is capable 
of most of the equipment typically in- 
volved in his rating and is also capable of perform- 
ing all the tasks listed below this point on the scale.” 
They were asked to think of journeyman avionics 
personnel in a typical operational naval air squadron. 
When the method of limits is employed to estab- 
lish threshold points, stimuli are presented in both 
ascending and descending series, and the final thresh- 
old value is taken as the midpoint between the initial 
threshold values arrived at in the two series. The pro- 
cedure in this study was to take Ss, page by page, 
through the questionnaire. As they considered each 
objective, they were asked to indicate: (a) the place 
on the scale which described the lowest level man 
they could make some use of in meeting the objective, 
and (0) the place which described the lowest level 
man who would be a desirable journeyman in the 
sense that he would accomplish his assigned tasks so 
as to fully achieve the objective. This constituted the 
ascending series. Then, on additional pages, they were 
asked to indicate: (a) the place on the scale which 
described the highest level man they could not make 
some use of in meeting the objective, and (b) the 
place which described the highest level man that they 
could make some use of but who would not qualify 
as being a desirable journeyman in meeting the ob- 
jective. This constituted the descending series. 
Employing the scoring values associated with the 
items that the Ss had checked, the mean of the 
ascending series was computed over all Ss for the 
indicated points of minimally acceptable and de- 
sirable performance on each objective, as was the 
mean of the descending series. Then, the mean of 
these respective means was computed, and its value 
was taken as the critical point on the job-per- 
formance scale. This resulted in two critical scores 
for each naval avionics objective: the mean of the 
ascending and descending judgments of the point of 


minimally acceptable performance became the mini- 
mum acceptable critical point, and the mean of the 
two sets of judgments of the minimum point of de- 
sirable performances became the desirable critical 
point. These critical points, with the intermediate 
means and standard deviations upon which they are 
based, are shown in Table 1. 

For each performance level, a mean over all five 
objectives was also computed and is reported in the 
column of Table 1 headed “Combined objectives.” 
In this calculation each objective was unit weighted 
as a result of the determination of essentially equal 
importance of these objectives reported above. 
These values, therefore, represent critical points for 
minimally acceptable and desirable proficiency for the 
overall objectives of the avionics sector of the Navy. 

The overall minimum acceptance judgment was at 
about Point 4 on the Guttman scale (following block 
diagrams for), while the overall desirable judgment 
approached Point 6 (calibrating), a difference greater 
than most of the standard deviations. At both the 
minimum acceptable and desirable levels, the opera- 
tion objective was felt to call for the highest com- 
petence and the performance objective the least. 

An interesting feature of the data in Table 1 is 
that the difference between the means resulting from 
the descending and ascending series is in every case 
substantial and statistically significant. Either the 
critical points are not precise points but, rather, more 
like broad intervals on the basic scale, or else the 
questions, which were quite different in the two cases, 
asked of these judges do not reflect merely two ap- 
proaches to the same point. 


Reliability 


The job-performance scale, prepared in the format 
shown in Figure 1, was employed in six operational 
squadrons of the Atlantic Fleet by immediate line 
supervisors in evaluating a total of 64 journeyman 
avionics technicians. Eight weeks later these tech- 
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TECHNICAL PROFICIENCY CHECKOUT FORM 


The man rated on this form should be a striker or petty officer, third class, in one of the following 
ratings: AE, AT, AQ, TD, AX. 


Listed below are eight tasks that a man in one of these ratings may be called upon to perform. 

You are to indicate whether the man being rated is checked out on each task, i.e., is capable of 
doing the task "on his own" without direct supervision. Indicate your decision by placing an "X" 
either in the "CHECKED OUT" or "NOT CHECKED OUT" box to the right of each task descrip- 


tion. 


TASK DESCRIPTIONS 


NOT 
CHECKED OUT CHECKED OUT 


1. Capable of employing safety precautions on most of this 
squadron's equipment with which his rating is concerned. CJ CJ 


2. Capable of replacing most of this squadron's equipment 


with which his rating is concerned. 


3, Capable of removing most of this squadron's equipment 


with which his rating is concerned. 


4, Capable of following block diagrams for most of this 


squadron's equipment with which his rating is concerned. 


5.. Capable of knowing relationship of equipment to other 
related equipment with which his rating is concerned. 


6. Capable of calibrating most of this squadron's equipment 


with which his rating is concerned. 


ips Slate lee Sy | 
fel lS Bee. ie sa) 


7. Capable of trouble shooting/isolating malfunction(s) in 


most of this squadron's equipment with which his rating is 


concerned. 


O 
L) 


8. Capable of employing electronic principles involved in 





maintenance of most of this squadron's equipment with which 


his rating is concerned. 


O O 


MAKE CERTAIN THERE IS AN ‘X” IN A BOX NEXT TO EACH TASK DESCRIPTION 


Name of supervisor 


Rating 





Name of man rated 
Squadron Location 


Months supervisor has known man 


Rating 


Date 


Fic. 1. Appraisal form based upon job-performance scale. 


nicians were reevaluated by the same supervisors and 
the product-moment correlation coefficient was com- 
puted between the scores the technicians received on 
the two evaluations. This correlation coefficient was 
.97, indicating a satisfactory level of stability for the 


job-performance measuring instrument over an 8-wk. 
period. 

An instrument which possesses the characteristics 
of a Guttman scale is internally consistent. However, 
to determine the effect upon the internal consistency 


ABSOLUTE SCALING OF JoB PERFORMANCE 


of this instrument of application to a different sam- 
ple than the one on which it was originally developed, 
the data from the sample employed for stability 
estimation were also subjected to reproducibility and 
consistency (Green, 1956) analyses. Table 2 presents 
the results of these analyses, showing the respective 
reproducibility and consistency values in association 
with those obtained in the original development of 
the scale. It is seen that the scalability of the items 
did not deteriorate upon re-use. 


Relationships to Predictors and Performance 
Test 


Copies of the Technical Proficiency Checkout 
Form and a biographical information form were sent 
to each of 13 squadrons of the Atlantic Fleet. These 
were accompanied by a request from the Bureau of 
Naval Personnel that the Technical Proficiency 
Checkout Form be completed by immediate line 
supervisors of the journeyman avionics technicians 
in the unit and that the biographical information be 
provided for each man rated. The biographical in- 
formation consisted of scores on the tests of the 
Navy’s Basic Test Battery, final grades in the par- 
ticular Basic Specialty Training School attended, 
and final Basic Specialty Training School class stand- 
ing. These are the predictor variables commonly 
employed by the Navy. Class standings were cor- 
rected for class size by normalizing to standard 
scores with a mean of 50 and a standard deviation 
of 10. Triserial correlation coefficients were then 
computed between Technical Proficiency Checkout 
Form scores, trichotomized in accordance with the 
derived cut points for the combined objectives and 
each of the predictor variables. Table 3 presents 
these correlation coefficients. The Ns vary for the 
various coefficients because of incomplete data for 
some cases. 

In order to assess the relationship of the Technical 
Proficiency Checkout Form to another measure of 
job performance, a job-sample test of knowledge 
and skill in electronics was constructed. The first 
part of the test required the technician to identify 
the safety violations shown in nine pictures. Part 2 
consisted of 11 questions concerning seven block 
diagrams of electrical or electronics equipment. In 
the third part, the examinee was asked to trouble- 
shoot four malfunctioning electrical circuits in a 


TABLE 2 


REPRODUCIBILITY AND CONSISTENCY OF 
JoB-PERFORMANCE SCALE 














Reproduci- Index of 
Boule bility consistency 
Original development? 94 zoo) 
First administration> 97 by 
Second administration» 98 .70 
aN = 181. 
bN = 64, 


ot 


TABLE 3 


TRISERIAL CORRELATION COEFFICIENTS BETWEEN 
PREDICTOR VARIABLES AND TECHNICAL 
PROFICIENCY CHECKOUT ForRM 











Predictor N r 

Basic test battery 

General Classification Test (GCT) 471 | .14 

Arithmetic (ARI) 470 | .19 

Total, GCT and ARI 470 | .19 

Mechanical Aptitude (MECH) 470 | .22 

Clerical Aptitude (CLER) 469 | .05 
Final grade, specialty training school 408 | .15 
Final class standing, specialty training 398 | .15 


school 





specially designed piece of apparatus. In the total 
score on the test, the three parts were weighted 
equally. 

The job-sample test was administered to 50 jour- 
neyman naval avionics technicians from 10 opera- 
tional squadrons, for whom Technical Proficiency 
Checkout Form appraisals were also obtained. The 
proportion of these avionics technicians falling in 
each of the three performance levels, as measured by 
the Technical Proficiency Checkout Form, was “de- 
sirable,” 64%; “below desirable but above minimally 
acceptable,” 30%; “not acceptable,” 6%. The triserial 
correlation between the test scores and the Checkout 
Form scores, as related to the combined objectives, 
was .40. 

These results indicate that the Technical Pro- 
ficiency Checkout Form is not very predictable from 
test and training school performance but that it is 
moderately related to another type of job-per- 
formance measure. Such a picture is common in the 
study of job-performance criteria. 


DISCUSSION 


This research was designed to expand the 
interpretive base of job-performance scales to 
an absolute level by tying them directly to 
the expressed needs of the Navy. The purpose 
was to establish the empirical groundwork for 
converting any appraisal on a checklist form 
into a meaningful statement regarding the ef- 
fectiveness of that proficiency level for ac- 
complishing the overall mission of the avionics 
technician in the Navy. Having established 
such a relationship, the score of any individual 
technician could then be interpreted in terms 
of the expected absolute level of output of 
that individual, that is, the absolute level of 
his usefulness to the fleet. The checklist, 
called the Technical Proficiency Checkout 
Form, had a satisfactory degree of retest re- 
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liability, and it retained the Guttman scal- 
ability properties across two independent sam- 
ples (original development administration and 
two administrations in this research). While 
the list was not very predictable from basic 
Navy predictor variables, it was moderately 
correlated with a job-sample performance 
test. In terms of the general goals of this re- 
search, it seems that the establishment of a 
reliable, valid, behaviorally based evaluative 
technique, that is not normatively oriented, 
has been at least partially achieved. It is also 
believed that the methods employed are ap- 
plicable to performance evaluation in other 
contexts and job families. 
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An account is presented of an investigation devised to extend comparison of 
operator performances using discretely and continuously varying controls to 
acquisition tasks, and to compare the ease with which Ss mastered the 2 con- 
trols. 2 groups, of 12 Ss each, learned 1 control to a stable level, then trans- 
ferred to the other, which was also practiced to a stable level. It was found 
that the discretely variable control was harder to master (p < .05) and harder 
to change over to (p< .01) when used to the other, although it was confirmed 
that there was no significant difference between the stable values. 


An investigation by Hunt (1966) indicated 
that there was little to choose between con- 
tinuously and discretely variable controls in 
a compensatory task. The present experiment 
was undertaken to see whether this finding 
held equally well for acquisition tasks, and 
to investigate the relative ease with which 
the two types of control were mastered. 


METHOD 


Apparatus and task. The apparatus used was the 
Applied Psychology Research Unit control simu- 
lator. The S’s task was to move a spot on a cathode 
ray tube a distance of 5 cm. from an initial rest 
position to a target, represented by a circle 3 mm. in 
diameter, and to keep it within that target for 2 sec., 
continuously. The S’s score was the time taken to 
achieve this. 

The S exercised control using a thumb joystick: 
the position of the stick determined the acceleration 
of the spot. With the continuous control stick, 
maximum acceleration—given by full deflection either 
side of center—was .7 cm/sec; the sense of the 
acceleration was the same as the direction of the de- 
flection of the stick from its (lightly spring-loaded) 
central position; the acceleration varied linearly with 
deflection. The stepped stick had the same external 
geometry as the continuous one, but to give an out- 
put the stick had to be in one of five spring-loaded 
indent positions. These positions were at the center 
of travel, at either extreme, and halfway to either 
extreme. (The total travel was the same in both 
sticks.) The output from these step positions was 0 
in the center, +.35 cm/sec” at the halfway points, 
and +.7 cm/sec” at the extreme points. 

Procedure. Two groups of 12 R.N. ratings acted 
as Ss for this experiment. The first group was trained 


1 The authors would like to acknowledge the help 
they received from F. W. Roberts in preparing and 
maintaining the apparatus, 


sl 


for 6 days, having 10 trials per day, using the 
stepped stick. On the seventh day, they trans- 
ferred to the continuous stick, with which they were 
trained for 2 full days, again having 10 trials per 
day. The second group followed the same procedure 
with the order of sticks reversed; that is, 6 days 
with the continuous stick, followed by 2 days with 
the stepped stick. 


RESULTS 


For each group of Ss two measures of stable 
performance and two measures of transfer 
were obtained. The stable measures were: 


(a) The overall group mean of the last 
five scores before transfer. 

(6) The overall group mean of the last five 
scores of all—that is, after transfer. 


The transfer measures used (see Hammer- 
ton, 1967) were: 


(a) a=100(n—~17)/n, where n is the 
group mean number of trials needed to reach 
a stable level with the stick first used, and r 
is the number of posttransfer trials required 
to reach a stable performance level with the 
second stick. 

Chea OO Cie oh eeswherew iis: the 
group mean of the first posttransfer score, 
and LF is the group mean of the last pretrans- 
fer score. 


The quantity m, defined as above, was also 
used as a measure of speed of initial learning. 
It was found for Group 1: stable group 
mean score with stepped stick = 12.13 sec.; 
posttransfer stable group mean score with con- 
tinuous stick = 10.95 sec.; transfer savings 
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measure ad; = 93%; first-shot transfer mea- CoNCLUSIONS 
sure d, = —18%; initial learning measure m — Fyynt’s (1966) finding that there is little to 
= 28 trials. choose between the two varieties of control in 


It was found for Group 2: stable group steady operation is confirmed. It is evident, 
mean score with continuous stick = 10.38 however, that it is more difficult to master the 
0 posttransfer stable group mean score stepped stick, and more difficult to take it up 
with stepped stick = 11.04 sec.; transfer sav- When ysed to the other. Consequently, it would 
ings measure ad: = 807% ; first-shot transfer seem that the adoption of stepped as op- 
measure d2 = —2267; initial learning mea- posed to continuous controls is not to be 


sure Ng = 20 trials. recommended. 
t tests indicate that none of the stable 

values differs significantly one from another REFERENCES 

at the .05 level. Applying U tests to the in- Hammerron, M. Measures for the efficiency of simu- 

dividual a, m, and d values, however, indicates lators as training devices. Ergonomics, 1967, 10, 

hat: 63-65. 

ae Hunt, D. P. Effects of discrete transformations of 
dy < a, (p < .05, one-tailed) controller output on human tracking performance. 


do <d, (p< .01, two-tailed) Journal of Applied Psychology, 1966, 50, 35-40. 
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VEHICLE KINEMATICS AND GAP ACCEPTANCE’ 


PAUL M. HURST, KENNETH PERCHONOK,? anp EDMOND L. SEGUIN 3 


Institute For Research, State College, Pennsylvania 


Data from traffic photographs under field conditions were analyzed to isolate 
those variables determining gap acceptability in a merging situation. 28 alterna- 
tive expressions were computed, involving positional and velocity information 
for lead, following, and subject vehicles. Regarding each of these indices as a 
correlate of gap acceptability, correlation ratios were computed between ac- 
ceptance-rejection behavior and the value of each index at the “decision point.” 
Values of » ranged from .187 to .733. Each of these values, except the lowest, 
was significant at p<.01. In general, the highest correlations were obtained 
with expressions involving time or speed relationships between subject and fol- 
lowing vehicles. Implications are presented for the use of an index and its cor- 
responding 7 values in the assessment of comparative degrees of “confusion” as 


a function of roadway and/or traffic-control parameters. 


Knowledge of factors influencing driver be- 
havior is necessary for control and prediction 
of traffic characteristics. In recent years, at- 
tention has been given to the effects of inter- 
relationships between vehicles upon the behav- 
ior of the drivers concerned. One such setting 
is the interaction between two vehicles form- 
ing a gap and a third which may or may not 
accept that gap. In this situation, the driver 
of the potentially merging vehicle may be 
thought of as judging the relationships be- 
tween the three vehicles along a single dimen- 
sion so as to make a decision to accept or re- 
ject the gap. 

This paper is addressed to the problem of 
determining those combinations of vehicle kin- 
ematics—vehicle separation, relative speeds, 
and acceleration potentials—which best cor- 
relate with observed gap acceptance/rejection 
behavior. The setting was one in which lane 
changing was necessitated by blockage of the 
medial lane on an urban expressway. 

In order to determine the predictive power 
of a given combination of kinematic factors, 


1This investigation was supported by Public 
Health Service Research Grant No. AC-66, from the 
Division of Accident Prevention. The authors wish 
to thank Edward Gervais and Frank DeRose, Jr., 
and their associates on the John C. Lodge Freeway 
Traffic Surveillance and Control Research Project, 
Detroit, without whose cooperation this study could 
not have been done. 

2Now at Cornell Aeronautical Laboratories, Buf- 
falo, New York. 

3 Now at HRB-Singer, Inc., State College, Penn- 
sylvania. 


called gap index (g), reliance was given pri- 
marily to », the correlation ratio. Two qual- 
ities desired in a gap index are (a) that dif- 
ferent values of the index yield different re- 
sponses, and (0) that different drivers respond 
similarly to the same value of the index. The 
degree to which these criteria are met reflects 
the degree to which drivers apparently utilize 
the gap index under study to make their de- 
cisions with respect to merging. The value of 
n is not dependent upon the particular nature 
of the functional relationship (e.g., linearity), 
but rather measures the total degree to which 
the stimulus controls the response. When 
squared, 7 measures the proportion of accep- 
tance/rejection behavior accounted for by the 
gap index. 


PROCEDURE 


A three-lane section of the John C. Lodge Freeway 
in Detroit was the site of the data collection. From 
the roof of a building adjacent to the freeway a fixed 
observation area, approximately 350 ft. in length, 
was photographed with a 35 mm. camera at the rate 
of eight frames per second. Alternate frames were 
used in the analysis, producing a four per second data 
rate. During the observation periods a maintenance 
vehicle had been moved into the downstream end 
of the observation area, closing the medial lane and 
thus forcing vehicles in this lane to merge into the 
adjacent Jane. The photographs of resulting vehicle 
behavior were reduced to usable quantitative form 
by processing through the Pictorial Data Transducer 
system developed by the Institute For Research 
(Hicks, 1964). This system is a film-scanning device 
coupled with an IBM 024 card punch which enables 
an operator, by positioning a pair of cross-hairs, to 
transform photographic information into digital posi- 
tion data in an X—Y coordinate system. The punched- 
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Fic. 1. Diagram of a gap occurrence at the decision 
point. (S = subject vehicle; F = following vehicle in 
adjacent lane; L=lead vehicle in adjacent lane; 
MV = maintenance vehicle blocking medial lane; Ls 
= length of subject vehicle; X» = position of follow- 
ing vehicle; Xz=— position of lead vehicle; Xs= 
position of subject vehicle.) 


card output is then processed through a battery of 
special-purpose programs using IBM 1620 and 7074 
computers. The computer output consists of point- 
by-point positional and velocity information for all 
vehicles in the observation area. For a more complete 
description of this system, see Hicks (1964). 


Measures 


From the film segments chosen for study, a total 
of 107 gap occurrences were analyzed, including 68 
acceptances and 39 rejections. This number does not 
reflect the actual occurrence rate, since to be included 
in the analysis an “occurrence” had to take place en- 
tirely within the observation area. That is, the lead 
vehicle had to pass the subject vehicle, and the sub- 
ject vehicle either enter the gap or be passed by the 
following vehicle, with all these events occurring 
within the 350-ft. interval. This restriction was neces- 
sary in order (a) to insure that any gap occurrence 
labeled as “rejected” was observed over the entire 
time interval during which acceptance could have 
occurred, and (b) to permit measurement of all ap- 
propriate kinematics at a common point, the “de- 
cision point.” To control motivation, only gaps pass- 
ing the subject vehicle were considered; gaps which 
the subject overtook and passed were not counted as 
occurrences. 

The following measures were computed at the “de- 
cision point,” defined as the instant (to the nearest 
% sec.) that the lead vehicle defining a gap passed a 
subject vehicle in the closed Jane, when the configura- 
tion was shown in Figure 1: 

MV = maintenance vehicle blocking medial lane. 

S = subject vehicle. 
L = lead vehicle in adjacent lane. 
F = following vehicle in adjacent lane. 
Xs = position of subject vehicle (front, longitudi- 
nal coordinate). 
Xz, = position of lead vehicle (rear, longitudinal 
coordinate). 
Xy= position of following vehicle (front, longi- 
tudinal coordinate). 
Ls = length of subject vehicle. 

In addition, the longitudinal component of velocity 
at the decision point was obtained for each of the 
three vehicles in question: 
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s = Velocity of subject vehicle. 

Vi = Velocity of lead vehicle. 

Vr = Velocity of following vehicle. 

From the foregoing measures, it was possible to 
compute the following terms which were utilized in 
the 27 alternative expressions listed below. Computed 
decelerations are based upon the assumptions of a 
reaction time of } sec. and a constant frictional co- 
efficient of .6. 

Hy = Xi — Xv — Ls = Distance headway antici- 
pated for following vehicle. 

Sp = .75 Vy + Vx*/38.4 = Stopping distance required 
for following vehicle. 

Tr = .75 Vr/19.2 = Stopping time required for fol- 
lowing vehicle. 

Sp-s = .75 Ve + [(Vr* — Vs?) /38.4] = Distance re- 
quired for following vehicle to match velocity with 
subject vehicle if the latter does not accelerate after 
the decision point. 

Ty-s = .75 + (Ve — Vs) /19.2 = Time required for 
following vehicle to match velocity with subject 
vehicle if the latter does not accelerate after the 
decision point. 

These terms were utilized in 27 alternative gap in- 
dices; a twenty-eighth index, TH, the time headway 
between the subject of following vehicles was also 
studied. The indices are listed below, with an accom- 
panying value of the correlation ratio, 7, expressing 
the correlation of each index with gap acceptance/ 
rejection behavior. The same 107 gap occurrences 
form the basis for each calculation. 

It should be emphasized that each of these alterna- 
tive indices, with the exception of TH, uses only the 
information potentially available to the driver at the 
point in time arbitrarily selected as the “decision 
point.” For example, Hr/Vy is really a measure of 
projected time headway based on the assumption of 
constant Vr. Thus, Hr/Vr is the “time headway” 
projection actually available to the driver at his de- 
cision point. 


RESULTS 


As can be seen from Table 1, the values of 
n ranged from .187 to .733. The data were 
analyzed for statistical significance using an 
arcsine transformation for the proportions of 
acceptances in each interval. From these 
data, it is possible to compute a between-cell 
sum of squares with an approximate chi- 
square distribution. The transformation was 
necessary because the dependent variable, ac- 
ceptance/rejection, is dichotomous. 

The y= .187 value was not significant; 
each of the other values was significant at 
pool 


Discussion 


Although some of these indices of gap ac- 
ceptability have been studied by others (cf., 
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TABLE 1 


CORRELATION RATIOS FOR VARIOUS INDICES OF g 
with Gap ACCEPTANCE/REJECTION 

















Expression for g n 
1. Vg 261 
Oe Vat .187 
3. Vr 479 
4. Hy 443 
5. Sr 479 
6. Tr 479 
7. SRS 473 
8. Tr_s 491 
9. Hy/Ve pO 
10. Hr/(Vr — Vs) 592 
11. He/Sy LOS 
12. {Hx/Vr}/Tr 542 
13. {Hx/(Ve — Vg)}/Tr .620 
14. Hy/Syr_s 585 
1S: {Hy/Vr}Tr-s 25/3 
16. {Hy/(Ve — Vs)}/Tr_s 693 
17. Vg — Vu .470 
18. Hy — Sy 526 
19. {Hy/Vr} — Tr 494 
20. {Hr/(Vr — Vs)} — Tr 632 
21. Hr — Sy_s 578 
22. {Hr/Vr} — Tr-s 368) 
23. {Hr/(Ve — Vs)} — Tr_s 605 
24, |Vs — 1/2(Vr + V1)| 500 
25. (Vs — Vr)/Hr 601 
26. Hr/|Vg — 1/2(Vr + Vx) | S71 
27. |Vg — 1/2(Ve + Vz) |/He 533 
28. TH .606 





e.g., Glickstein, Findley, & Levy, 1961), 
direct comparison of results across studies is 
perilous. The major factor tending to distort 
such a comparison is the influence, upon 7, 
of the range of dispersion of the stimulus 
variable. Since 7 does not specify the func- 
tional relationship between stimulus and re- 
sponse values, it is impossible to adjust it for 
range effects as is often done with the Pear- 
son 7. Yet such effects probably do operate in 
a strong if indeterminate manner. 

Thus, the proper comparison of 7’s is pos- 
sible only when the same sample of behavior 
is involved in each computation. In the data 
from the present study, a fixed sample of oc- 
currences and acceptances, defined by points 
on photographic film, formed the basis for 
comparing all of the alternative predictor ex- 
pressions. Were this sample to be augmented 
by a number of very large gaps which were 
generally accepted, and by a number of very 
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small gaps which were generally rejected, the 
effect would tend to increase the » obtained 
for any measure having a nonchance relation- 
ship with acceptance/rejection behavior. It 
would still be appropriate, however, to make 
comparisons within the new sample between 
7S obtained for various alternative measures 
of gap size. 

Restricting consideration, therefore, to with- 
in-sample comparisons, several interesting ef- 
fects are to be found. First, note that physical 
gap size or vehicle separation, Hp, was a 
rather poor predictor of acceptance/rejection 
(7 = .443, the third from the lowest value ob- 
tained). Time headway, TH, was better (7 = 
.606, the fifth in rank of magnitude). Expres- 
sions involving the dynamics of the lead 
vehicle (L) generally yielded low »’s. The 
four highest values include the terms Hy and 
Vy. The two highest of these, No. 22 and No. 
16, involve a Ty_g, term which corresponds to 
time required for F to match velocity with S. 
In No. 22, Tyg is subtracted from the time 
headway of F with respect to L, whereas No. 
16 represents the ratio between collision course 
time and Tyg. This collision course time re- 
fers to the time required for F to reach S if 
S were to enter the gap at the decision point, 
in a position immediately behind that cur- 
rently occupied by L. A certain strategy on 
the part of S’s driver is suggested here: he is, 
perhaps, concerned with the ability of F to 
accommodate him by decelerating before F’s 
headway is exhausted. This could be a con- 
sideration regardless of whether S’s driver in- 
tended to contribute to the velocity matching 
by accelerating his own vehicle. 

Recalling the admonition against direct (as 
opposed to relative) comparisons across stud- 
ies, it is nevertheless tempting to compare re- 
sults with those of Glickstein et al. (1961). 
Although their data concern on-ramp merging 
behavior, and are presented in terms of con- 
stant-size class intervals (as opposed to the 
constant-frequency class intervals used here), 
the comparison may be of heuristic value. 

The data from Glickstein et al. are sub- 
divided according to merging by moving vehi- 
cles and merging by stopped vehicles; 97% of 
our data were from moving vehicles. Glick- 
stein et al. used only one measure of gap size, 
“size of gap presented in seconds,” which cor- 


324 


responds to our measure of time headway, 
TH. From the data presented by Glickstein 
et al. in their Table 4, we computed values 
of » for time headway versus acceptance/re- 
jection, and obtained a value of .463 for 
moving vehicles and a value of .663 for 
stopped vehicles. The .463 value is somewhat 
lower than the value of .606, obtained for 7H 
from our primarily moving-vehicle data. The 
.663 value for stopped vehicles is comparable 
to some of our better predictive efforts with 
moving vehicles. A reasonable interpretation 
would seem to be that the driver of a stopped 
vehicle is primarily concerned with the time 
size of the gap as defined by the lead and 
following vehicles, and his behavior can be 
fairly well predicted from this datum alone. 
The driver of a moving vehicle, however, is 
concerned with the relationship between his 
speed and certain time or speed characteristics 
of the gap, so his behavior cannot be well 
predicted on the basis of time headway alone. 

Broad application of any g index must, of 
course, await cross-validational studies per- 
formed under a wide variety of conditions. 
If it is possible to obtain such an index (or 
indices) of sufficiently wide generality, we can 
proceed to compare the degree of stimulus 
control of gap acceptance across differences 
in traffic situations: differences in roadway 
features, traffic control procedures, etc. Sup- 
pose, for instance, that Situation A yields a 
high » between g and gap acceptance versus 
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rejection, and Situation B yields a much lower 
value of 7. This would mean that in Situation 
A drivers were highly consistent in rejecting 
gaps of lower g and accepting gaps of higher 
g. In Situation B, they were less consistent; 
gaps of lower g were accepted relatively more 
often, in comparison with gaps of higher g. 
This would permit the inference that drivers 
were less confused in Situation A. Consistency, 
as implied by 7, is measurable independently 
of “threshold” shifts, that is, the tendency to 
accept or reject more gaps in general. If it 
is defensible to assume that the driver popu- 
lations which were sampled in the two situa- 
tions are similar, then it can be concluded that 
something about Situation B causes drivers to 
be more confused than in Situation A, and 
preceed to derive and test hypotheses as to 
the source of this confusion. Sometimes the 
source may be obvious. In other instances it 
will be more subtle, but regardless of subtlety 
it should contribute to the understanding and 
control of driver behavior. 
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Complex thought processes used by a skilled psychologist dealing with per- 
sonnel selection and placement are analyzed and programmed in a computer 
simulation model. Heuristic methods used to limit the total possible set of 
decision branches in the model are defined and illustrated. Research results 
indicate a strong relationship between human and machine output with a 94% 
level of agreement between simulated psychological inferences and human 
decisions with identical ultimate employment recommendations in 22 of 24 test 
cases. Implications of research findings on psychological instruction, experimen- 
tation, information retrieval, test validation, and general decision making are 


discussed. 


Recent developments in the study of man 
and machine systems have resulted in signifi- 
cant interdisciplinary achievements. The areas 
of psychology, industrial management, and 
operations research have been increasingly in- 
tegrated through quantitative and scientific 
methods which aid in the definition, explana- 
tion, and prediction of human behavior. One 
of the most useful of these methods is simula- 
tion, a research methodology which facilitates 
the design and testing of models of physical 
and human systems. Through the use of 
simulation, the models can be programmed to 
yield results which closely approximate real- 
world phenomena. 

Simulation models are of varying types. 
Some are applied to the study of physical sys- 
tems such as oil refineries and railroad switch- 
ing yards. Others deal with man-machine sys- 
tems such as the Rand Air Defense Simula- 
tion in which military personnel and hard- 
ware exhibit integrated reactions to simulated 
changes in the air environment. Yet another 
type involves human cognition and can be 
used to replicate the manner in which man 
makes decisions in solving complicated prob- 
lems. The purpose of this article is to present 
the methodology, results, and, most important, 
the implications of research dealing with this 
latter form of simulation. 


1 This article is based on a paper presented at The 
College on Management Psychology Session of The 
Institute of Management Science Fourteenth Inter- 
national Meeting, Mexico City, August 26, 1967. 


Considerable work in the development of 
cognitive simulations has been conducted by 
university and privately sponsored research 
groups. Colby (1964) at Stanford, Maruyama 
(1966) at Berkeley, Kleinmuntz (1963) at 
Carnegie-Mellon, Swenson (1962) at the Mayo 
Clinic, and Finney (1967) at The University 
of Kentucky have been instrumental in de- 
signing computerized simulations which de- 
scribe individual psychological characteristics 
based on some type of test scores. 

The present research represents an exten- 
sion of work previously accomplished. Spe- 
cifically, this study investigated in depth and 
subsequently simulated the thought processes 
of a psychologist involved in the analysis and 
interpretation of test results, job requirements, 
and personal characteristics of potential cleri- 
cal employees. In this type of decision making, 
the psychologist is faced with ill-structured 
problems not amenable to solution by algorith- 
mic techniques in which the researcher is 
able to express an objective function and 
constraints in well-defined mathematical terms. 
Instead, it is necessary to rely upon the con- 
struction of simulated decision networks which 
replicate as closely as possible the psycholo- 
gist’s thinking processes. 


MetTHOop 


As shown in Figure 1, the experimental model ex- 
tended the number and types of variables included 
in previous studies by adding job specifications and 
personal characteristics of applicants. These addi- 
tional variables made the task of simulation more 
difficult since inferences derived from test scores had 
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Fic. 1. Evolution of model complexity. 


to be viewed in light of specific job requirements and 
personal characteristics of the individual such as her 
age, experience, and length of time in her present 
position. 

It was hypothesized that a simulation model could 
be developed which would replicate the thought 
processes of a skilled psychologist as he performed the 
decision task. The Protocol Method was used to 
gather data (ie., the analyst’s verbalization of 
thoughts were recorded as he evaluated a series of 
cases dealing with actual job applicants). The ap- 
plicants were all female and were applying for vari- 
ous types of clerical positions requiring varying 
levels of skill. These positions were billing clerk, 
statistical clerk, clerk typist, receptionist, administra- 
tive assistant, and executive secretary. The protocol 
was tape-recorded, transcribed, and analyzed in depth 
prior to the development of a computer flow chart 
defining the decision processes. Finally the model was 
tested by utilizing a set of cases which were analyzed 
by the human and processed by the computer. 


RESULTS 


In all, 24 cases were evaluated by both hu- 
man and machine methods. In 22 cases the 
machine selected the same employment recom- 
mendation, which was one of the four clas- 
sifications: hire, reject, hire as a fair risk, or 
check background further. In one case the 
machine was not programmed to handle the 
input data and consequently the human ana- 
lyst was called in, and, in the other case of 
nonagreement, the human recommended hir- 
ing as a fair risk while the model suggested a 
further background check of the applicant. 

Accuracy of the simulated interpretative 
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statements generated by the computer was 
judged to be 94% by a skilled analyst not in- 
volved in the collection of the. protocol. Fur- 
ther, statistical tests of hiring recommenda- 
tions showed that the probability of obtaining 
such results by chance was so small that the 
hypothesis was accepted, and it was con- 
cluded that the model did simulate very 
closely the human analyst’s psychological in- 
ferences. 


Limitations 


The preceding paragraphs have very briefly 
presented the methodology and results of the 
research.” It should be noted that no attempt 
was made to improve upon the decision-mak- 
ing process of the analyst and it would have 
been preferable to test a larger sample of 
cases had they been available. It might be 
true that a more valid and reliable model 
would have resulted if the combined decision- 
making capabilities of a group of analysts had 
been used. However, rather than spending 
time in a discussion of the limitations of the 
study, more significant issues involve the use 
of heuristics in development of the model and 
the implications of results on general psycho- 
logical decision making. 


Heuristic Reduction of the Decision Space 


In developing the computer model, the 
matter of relevancy of information became all 
important. Consider the clerical selection bat- 
tery used in this experiment. It is comprised 
of the Otis Mental Ability Test, the Short 
Employment Tests, the Washburne S—A In- 
ventory, and the Gordon Personality Inven- 
tory and Profile. This battery includes 20 
different test scores, some of which can range 
from 1 to 99 such as the Gordon scores while 
others range from excellent to maladjusted 
(7-point scale) on the Washburne Test. In 
all, there are 1,151 possible psychological test 
scores, in addition to job and personal data, 
which can be obtained for a single applicant. 
Thus, it is plain to see that the number of 
combinations of relationships and elements 
that can be chosen quickly exceeds manage- 
able proportions. In fact, using the variables 


2 For a complete discussion of research design refer 
to Smith and Greenlaw (1967). 
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in this study, it would require 107° outcomes 
to provide a model encompassing all possible 
events. This figure approaches the estimated 
number of atoms in the known universe (10*°) 
representing a programming task of impos- 
sible magnitude. To avoid this situation, it 
is necessary to restrict the boundaries of the 
system and consider only those relationships 
which are most relevant to the solution of the 
decision problem. 

To simplify development of the experi- 
mental model, the experience of the decision 
maker was organized into a graphic represen- 
tation of thought processes. The principles 
used in the simplification of the model (i.e., 
to limit the boundaries of relevance) were 
actually applied by the psychologist and 
merely transformed into more quantitative 
terms by the researcher. These principles are 
often referred to in the literature as “heuris- 
tics” taken from the Greek Heveiskein which 
means to limit search. Heuristics provide gen- 
eral decision rules or “rules of thumb” which 
assist researchers in the development of work- 
able models of systems which are within the 
boundaries of human and machine capabilities. 
Heuristic solutions to problems are not neces- 
sarily the best or optimum solutions since cer- 
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PRINT STATEMENTS 

1. Has general tendency torestrict 
social contacts. 

2. Applicant is very retiring and sub- 
missive, not suited for job as 
specified. Recommend rejection. 

3. Based upon specified job require- 
ment, recommend hiring. 

4. Applicant appears to be an introvert, 
very retiring and submissive 


Fic. 2. Partial heuristic decision map for sociability. 
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PRINT STATEMENTS: 

. Applicant is considerably below 

average inintelligenco 

2. And has very low clerical ability 

3. Recommend rejection 

4 But experience may compensate 
for this negative factor, 
Recommend checking with former 
employer. 

5. Applicant is methodical and siow 
but intelligent. She will probably 
do wall what she attempts. 

6. Otis scores unusual; refer to 
human interpreter. 


Fic. 3. Partial heuristic decision map for intelligence 
and ability. 


tain alternatives are excluded from considera- 
tion which hopefully are irrelevant but, in 
fact, may not be. Clarkson’s (1962) Port- 
folio Selection and Tonge’s (1961) Assembly 
Line Models were both heuristic in nature as 
the authors turned their approach from a 
search for the optimum solution to one that, 
with high probability, would be suitable for 
their purposes. 

The same heuristic approach was adopted 
in the present research. Instead of providing 
for each possible value of a particular vari- 
able, relevant ranges for most variables were 
defined. This type of system definition in- 
volved the use of heuristics very similar to 
those used by the analyst in his interpreta- 
tions. An example will help to clarify this 
point. The sociability score on the Gordon 
Test could range between 1 and 99. The 
analyst, however, treated only particular 
ranges for certain job types. For the statistical 
clerk, a sociability score of 95 or greater 
coupled with a low cautiousness score led to 
a negative response since the individual could 
conceivably be a “social butterfly” and in- 
capable of maintaining the prolonged con- 
centration required for the statistical work. 
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Figures 2 and 3 provide examples of heu- 
ristics as well as the general methodology used 
in mapping the thought processes. Note that, 
in Figure 2, the sociability score has been 
subdivided into three broad ranges at the 
lower end of its scale, and within one of 
these ranges (i.e., 6 through 10) it is com- 
pared with certain job specifications. This is 
not the only place in the program where the 
sociability score is considered, but it is evi- 
dent here that a low score coupled with work 
requiring high levels of verbal proficiency and 
decision making lead to a negative recom- 
mendation.* 

The psychologist placed his initial and 
greatest emphasis on scores which appeared at 
the extremes of the rating scale. That is, he 
first recognized scores above 90 or below 25 
on the Gordon Tests and this information also 
provided a useful heuristic for the model 
builder. 

If certain combinations of variables were 
relevant but not provided for in the model, 
the computer was instructed to branch to the 
next case and leave the exceptional case for 
human interpretation. In this manner, heu- 
ristics were used to reduce the number of 
combinations of variables within the system 
to those most relevant for solving the major- 
ity of cases. 

Another type of heuristic was employed 
which added efficiency to the search process. 
It was discovered during the analysis of proto- 
col that the psychologist tended to divide his 
problem into four general components paral- 
leling the four tests used in the battery. He 
chose mental ability as his first area of in- 
vestigation and seemed to weigh the Otis 
scores heavier than other factors for most 
jobs. Next, he referred to the clerical aptitude 
results followed by the emotional stability 
scores, and finally the personality profiles. 
This does not mean, however, that he fol- 
lowed an unbroken sequence from one test 
to another. If, for example, he found a girl 
who performed remarkably well on the Otis 
Test, he would investigate the Gordon score 
on “original thinking” as well as the type of 
job for which she was applying. The intent 

3 Proficiency levels required in specific jobs were 


expressed on a numerical scale ranging from 1 (lowest 
level) to 3 (highest level). 
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here was to prevent very intelligent girls from 
being placed in positions where their ability 
and interest could lead to poor performance 
on repetitive or otherwise unchallenging jobs. 

The broad approach to sequencing, how- 
ever, did lead the researcher to design separate 
computer subroutines for each major area as 
a starting point in the development of the 
integrated system. Naturally, there was much 
subsequent interaction among the subsystems, 
but the heuristic technique of fractionating 
the total problem further simplified the work 
and, more importantly, permitted the simula- 
tion to follow a path through the network 
whereby the most heavily weighted variables 
were given highest priority. Items which 
tended to eliminate candidates most often 
were considered almost immediately, and time 
was saved since many unacceptable applicants 
were discovered relatively early in the net- 
work. 


Implications of Research Results 


One of the most important implications of 
the results of this research is the knowledge 
that it is possible to program fairly complex 
psychological decision tasks. This means that 
time normally spent by highly skilled analysts 
in decision making which is fairly routine for 
them can be made available for more signifi- 
cant problem solving and research. Second, 
the probability that boredom might influence 
results may be lessened as the psychological 
evaluator concentrates on those cases of un- 
usual nature which require diverse and flexible 
skills found only in the human analyst. 

With computerized processing of routine 
data comes a marked reduction in the cost per 
interpretation when, of course, there is suf- 
ficient volume to introduce economies of scale. 
Where the psychologist normally took about 
20 minutes to give a complete analysis, the 
machine performed the same type of task in 
about 3 seconds.* In this manner, expert 
analyses may be provided to those organiza- 
tions which previously could not afford the 

4 The decision model designed for the experiment 
was programmed to branch to the next case whenever 
an applicant was rejected. The psychologist, however, 
continued with a complete analysis even though a 
combination of factors led him to formulate a nega- 


tive recommendation somewhere during the course of 
his evaluation. 


SIMULATION OF DECISION PROCESSES 


services of skilled psychologists. This may be 
particularly true in personnel offices and 
guidance centers which sometimes offer sub- 
marginal services in the field of psychological 
analysis. 

Simulation models of this type can be useful 
in the field of computer-assisted instruction. 
A laboratory approach in basic psychology 
courses might be established whereby students 
would verify their beginning and intermediate 
attempts at analyses against the results of an 
expert. With the information storage and 
retrieval capabilities of today’s computers, 
teaching efforts might be made more efficient 
through the use of feedback data. Professors 
could evaluate the most common errors made 
by their students during the laboratory anal- 
yses and thereby concentrate upon these areas 
of weakness in subsequent lectures. 

Computerized simulations of psychological 
decision processes offer other advantages from 
an information-utilization standpoint. It is 
now possible to study the effects of policy 
changes on end results without disrupting the 
present system. For example, in the personnel 
model discussed in this paper, one could de- 
termine the effect on mental ability standards 
caused by a labor shortage which required a 
business to increase the acceptance ratio by 
20% (i.e., how much would one have to lower 
mental ability scores to gain a 20% increase 
in the number of applicants accepted). In 
this type of model it would be fairly simple 
to determine the effects of changing the values 
of individual parameters on final decisions as 
well as to determine which factors were re- 
sponsible for the greatest percentage of re- 
jected applicants. If any of these rejection 
factors were amenable to correction through 
training, it would be possible to estimate the 
trade offs between the costs of additional per- 
sonnel search versus the costs of training 
persons not quite acceptable under present 
standards. 

From the view of a science of decision mak- 
ing, this experiment lends some credence to 
the theory that human thinking can be defined 
as a complex network of simple binary choices. 
It is difficult to determine whether the ana- 
lyst’s decisions were or were not based on 
simultaneous consideration of multivariate 
criteria. But, by sequencing his thought proc- 
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esses in a series of binary alternatives, it was 
possible to replicate his final decision even 
though it was based on relatively complex 
interrelationships among variables. 

Research of this type gives deeper insight 
into the manner in which people resolve prob- 
lems. The methodology allows the researcher 
to map an equivalent thought process at a 
particular point in time and could permit the 
study of the effects of aging and experience 
on decision-making capabilities of individuals 
and groups. 


Suggestions for Further Research 


Work should be done to determine whether 
there is a significant difference in job per- 
formance between those personnel selected by 
human methods and those chosen through the 
use of a model. Of course, the problem still 
exists that a representative portion of the ex- 
perimental group is lost since profit-oriented 
organizations usually do not hire applicants 
who have been rejected by skilled interpreters. 

The practice of obtaining “objective” evalu- 
ations of employee performance still permeates 
all forms of organizations. Even with the de- 
velopment of modern information systems, 
managers of all types are faced with the un- 
pleasant task of judging the performance of 
their workers. Halo and leniency effects pre- 
dominate. Perhaps it would be possible to de- 
sign a simulation model whereby the records 
of each employee could be subjected to a kind 
of skilled, nonbiased, objective evaluation 
without fear of political pressure or subjec- 
tive interpretations. 

Results of the study indicate that psycho- 
logical and other factors can be integrated in 
a workable decision model. This fact leads 
one to believe that computer-assisted psy- 
chological clinics for low-income classes might 
be feasible. Admittedly this idea connotes a 
world of cold impersonalism but there is the 
possibility that many people would willingly 
avail themselves of expert services even 
though these services were not obtained 
through direct human contact. Further ex- 
perimentation along these lines would require 
the development of a model encompassing psy- 
chological test results and environmental fac- 
tors. The model could then be used to screen 
large numbers of people, some of whom might 
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exhibit symptoms of present or potential 
mental illness. A computer-assisted screening 
center would be used to diagnose cases in 
early stages for referral to a specialist, thus 
providing minimum-cost psychological  ser- 
vices to broader segments of our society. 
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EFFECT OF PRICING ON PERCEPTION 
OPeEkOowCT: OUALLLY. 


J. DOUGLAS McCONNELL 1 


Stanford University 


An experiment was undertaken to test whether the price of a product or brand 
was a determinant of S’s perception of the quality of the product. 60 beer- 
drinking adult Ss made 24 selections from 3 brands of beer over a period of 
8 wk. Ss were free to choose whichever brand they wished. Brand names 
were consonant letters of the alphabet, and the product was identical for all 
3 brands. The only real variable was price. Analyses indicated that S’s percep- 
tion of the quality of the beer was significantly affected by the cue supplied, 


i.e,, price. 


The theory that price differentials will re- 
sult in consumers perceiving product-quality 
differences, where products are in fact homo- 
geneous, has been suggested by Scitovsky 
(1944-45), Leavitt (1954), and Tull, Boring, 
and Gonsior (1964). The experiments by 
Leavitt and Tull, however, were pencil-and- 
paper tests where Ss did not use the brands 
of products presented in the choice situation. 
Their findings, therefore, must be treated 
with reservations. The specific hypothesis 
tested in the study reported here was that, in 
the absence of more directly observable cues, 
consumers accept price as a product charac- 
teristic correlating highly with product qual- 
ity and will alter their perception of a product 
on the basis of the cues supplied. In other 
words, it was believed that Ss would perceive 
the higher priced brands as being of higher 
quality even though the actual product was 
identical for all brands. 

Observation of consumer behavior in the 
market place provides direct support for this 
hypothesis. A $5,000 automobile is perceived 
as a better quality car than one costing 
$3,500; a $3.00 pair of stockings is perceived 
as being superior in quality to an $.85 pair, 
yet both may “run” within 5 min. of being 
worn. An $80 gown is superior to a $45 gown. 
The phrase “you get what you pay for” is 
accepted as a truism in the American culture 
and those of most highly industrialized na- 
tions. 

1 Now at Stanford Research Institute, Menlo Park, 
California. The article is based on research under- 


taken as part of the author’s unpublished doctoral 
dissertation, Stanford University, May 1967. 


In view of this, the hypothesis appeared to 
be soundly based. It was believed that if a 
“Dprice-quality” cognition existed the Ss, being 
unable to change the “price-brand” stimulus, 
would change their perception of the product 
to maintain consonance between their cogni- 
tions and perception. 


METHOD 
Subjects 


Sixty adult married students, or wives of students, 
who classified themselves as beer drinkers, were 
chosen at random from Stanford University Married 
Student Housing and served as Ss. Median age was 
27 yr. with a range of 21-55 yr. Family income 
ranged from $2,500 to $15,000, with a median value 
of $5,000. Forty-four Ss were male and 16 female. 


Product 


The product used in the experiment was beer 
which was obtained from a California brewer with 
the approval of the California Department of Al- 
coholic Beverage Control and the United States In- 
ternal Revenue Service. 

An order was placed with one brewer for 60 cases 
of lager-type beer, bottled in unmarked, nonreturn- 
able “glass cans,” and sealed with a plain seal. It was 
specified that all beer come from the same batch 
number so that the product would be physically as 
homogeneous as possible. 

It was decided to restrict Ss’ range of choice in the 
experiment to three brands of beer. To identify these 
three brands, cases of beer were drawn from in- 
ventory as they were required and labeled at random 
with white rectangular self-adhesive labels lettered 
L, M, or P in red-felt marker ink. The letters, fol- 
lowing Tucker (1964), were selected for ease of 
identification and memory and because each is a 
consonant falling in the middle of the alphabet and 
occurs with approximately the same frequency in 
usage of the English language. Nonsense syllables, 
such as suggested by Davidson, Suppes, and Siegel 
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TABLE 1 


DATA FOR CONTINGENCY TABLE ANALYSIS 





; Price levels of beer 
Ratings from 


Likert-type 





ae High Medium Low 

Undrinkable 0 4 1 4 
Poor 1 8 21 20 
Fair 2 26 22 23 
Good 3 15 12 9 
Very pleasant 4 7 4 4 

Total score 133 117 109 

Mean score 2.23 1.93 1.80 





(1957), were considered but rejected on the grounds 
that if they were “association free” they would be 
difficult to learn—an undesirable attribute for any 
brand name. It was decided to risk the possibility 
that the letters chosen might have meanings for 
some Ss to facilitate learning of the brand names. 
The price was shown to the customer in two ways. 
One, a card behind each test bottle told the price of 
that brand per six-pack since most consumers see it 
in the stores that way. As Ss were not paying for the 
bottle of beer they selected at each trial, the situa- 
tion was considered to be very artificial and not com- 
parable to the purchase decision situation that 
usually confronts a shopper. To minimize the effect 
of these free or “windfall” goods in the selection 
process, the less expensive brands had to represent 
a financial saving compared with the high priced 
brand, equivalent to the six-pack price differentials. 
This was achieved by taping the per bottle price 
differential of 5¢ on the low priced brand and 2¢ on 
the medium priced brand. Thus, an S selecting the 
low priced brand would be in a position similar to a 
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consumer who purchases a low priced brand and has 
more change in his pocket than he would have if he 
bought a high priced brand. 


Procedure 


The Ss were called on in their own apartments at 
prearranged times. A number were called on be- 
tween 8:30 A.M. and 9:15 A™M., while the greater 
majority were visited between 5:00 p.m. and 7:30 
P.M. 

On each visit Ss selected one bottle from the three 
brands (L, M, and P) which were set out on a 
wooden tray and backed by a card stating the price 
per six-pack. For each trial the positions on the 
tray were changed so that any selections based on 
position on the tray would be completely randomized. 
No brand occupied the same position on the tray 
for any two successive trials, and each brand oc- 
cupied each position with equal frequency over the 
24 trials. The 24 trials were conducted at the rate of 
three per week for 8 wk. The Ss undertook to drink 
each bottle selected at any time convenient to them 
before making their next selection. After their final 
selection in Trial 24, Ss were presented with a ques- 
tionnaire form to be completed. The Ss were asked 
on this form to rate each of the three brands of beer 
on a 5-point Likert-type scale. They were also asked 
to select from a panel of words, commonly used to 
describe beer, the three words that best described 
each brand. The words in the panel are shown in 
Table 3 and have generally favorable and unfavor- 
able connotations with respect to beer. 


RESULTS 


The data were analyzed in three ways. A 
simple contingency table analysis (Mood & 
Graybill, 1963) was made to test the relation- 
ship between the ratings obtained and the 
three price levels of the brands of beer. The 


TABLE 2 


SuMMARY OF RaTING SCORES BY BRAND AND VALUES FOR ? Usinc METHOD 
FOR CORRELATED OBSERVATIONS 














Brand M 
Numerical ($1.30) 
value per 
rating point 
No. responses | Score value 

0 4 0 

1 8 8 

Di 26 12 

3 15 45 

4 ey 28 
Total score 133 
Mean score 2.23 








No. responses 


Brand L 
($1.20) 


Brana P 
($0.99) 


Score value | No. responses | Score value 


1 0 4 0 


21 ait 20 20 
22 44 23 46 
12 36 9 27 
4 16 4 16 
117 109 

1295 1.80 





Note.—According to the Student’s #, mean differences between Brand M versus Brand P = 2.46, p < .01; Brand M versus 


Brand L = 1.64, p < .06; Brand L versus Brand P = .93, ns. 


PRICE AND PERCEPTION OF QUALITY 


TABLE 3 


PANEL OF Worps USED By SUBJECTS TO 
DESCRIBE THE THREE BRANDS OF BEER 
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TABLE 5 


SUMMARY OF ANALYSIS OF VARIANCE FOR FAVORABLE 
AND UNFAVORABLE Worps ATTRIBUTED TO 























BRANDS 
Favorable Unfavorable = = 
(implied high quality) (implied low quality) Source of variation SS df MS F 

Tangy Flat Total 1800.20 | 179 
Rich flavored Biting Between brands 
Smooth Acidy Mig 64.53 DWS2-265))\'3:29" 
Full bodied Watery Within 1735.65 | 177 | 9.801 
Light Bitter 2 : ‘ ee $F Ox, 
Dry Sour *p = 05, F = 3.07, df = 2/120. 





data for this analysis form Table 1. The rat- 
ings of “undrinkable,” “poor,” “fair,” “good,” 
and “very pleasant” were assumed to be 
monotonically increasing and were given 
values of 0, 1, 2, 3, and 4. 

The contingency table analysis produced 
highly significant findings, the y* distribution 
value being significant at p< .005 level, 
thereby indicating the existence of a price- 
quality relationship. 

The second test took the mean ratings ob- 
tained for each of the three price levels and 
tested each pair of means for significant dif- 
ferences, using the Student’s ¢ distribution and 
method for correlated observations (Winer, 
1962). This method was used because the 
ratings for each brand could not be assumed 
to be independent as each S rated each brand. 
The results of this analysis are given in Table 
2. A significant quality difference was per- 


TABLE 4 


Basis OF DETERMINING NUMERICAL SCORE FROM 
Worps CHOSEN BY SUBJECTS TO 
DESCRIBE BRANDS 





No. responses 





Numerical 
score 
Favorable Unfavorable 
words words 
3 0 8 
2 0 7 
2 1 6 
1 0 5 
i 1 4 
0 1 3 
1 y 2 
0 2 1 
0 3 0 





ceived between the high and low priced brands. 
The interesting aspect of this analysis, how- 
ever, was that the middle priced brand was 
not perceived as being of significantly higher 
quality than the low priced brand. The “per- 
ceptual distance” between the high and middle 
priced brands was greater than between the 
middle and low priced brands, although the 
actual price differential would have led one 
to suspect that the reverse should have been 
the case. 

The third test of the price-quality relation- 
ship used Ss’ selections from the panel of 
words with favorable and unfavorable con- 
notations listed in Table 3. 

To examine Ss’ selections statistically, a 
numerical score was assigned to each S’s se- 
lection on the basis of the number of favorable 
and unfavorable words chosen for each brand. 
This is shown in Table 4. 

This test permitted Ss to be 
“blocks,” and for price to be 
“treatments,” and a one-way 
variance (Winer, 1962) to be 
The results were significant at 
level as indicated in Table 5. 

Apart from this analysis of Ss’ word choices, 
the overall direction of selections is readily ap- 


regarded as 
regarded as 
analysis of 
carried out. 
the p< .05 


TABLE 6 


RELATIONSHIP BETWEEN PRICE OF BEER 
AND DeEscRIPTIVE WorRDS 








Type of Brand of beer and price per six-pack 
words 
chosen 
M ($1.30) L ($1.20) P ($0.99) 
Favorable 93 73 a 
Unfavorable Tid 82 101 





Note,—Totals chosen are number of words. 
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a 80 
5 70 UNFAVORABLE WORDS 
= 
we 60 
Oo 
oe 50 
wi 
a 
x 40 
=) 
Zz. 30 
20 
10 
op L M 
LOW PRICE MEDIUM PRICE HIGH PRICE 


BRANDS OF BEER 


Fic. 1. Plotting of favorable and unfavorable descrip- 
tive word responses by brand and price. 


parent from a cursory examination of Table 
6 and Figure 1. 


DiIscussION 


Price was used by Ss as a cue to product 
quality. With a physically homogeneous prod- 
uct and unknown brand names (which had so 
little meaning that many Ss never used them), 
the highest priced brand was perceived as 
being a “better quality” product than the 
other two brands. Brand L, the middle priced 
brand, was perceived as being marginally 
better than the lowest priced brand. In fact, 
if perception and reality had correlated per- 
fectly, each S would have tried each brand 
once and then chosen Brand P (the lowest in 
price) for the remaining trials. In this way Ss 
would have maximized their utility, receiving 
the same beer and 5¢ at each trial instead of 
something less. The Ss, however, selected 
Brand M (high price) for over 41% of all 
trials and Brand L (middle price) for 25% 
of all trials. Price, in the absence of other 
cues, was a powerful determinant of how the 
brands were perceived. 
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Perceived quality did not vary linearly with 
price. The greater price disparity existed be- 
tween Brand L and Brand P, a total difference 
of 3¢ a bottle. The price difference between 
Brand M and Brand L was only 2¢ a bottle. 
However, the perception of quality was such 
that Brand L and Brand P were perceived as 
much closer than Brand L and Brand M. 
This study found no causal explanation for 
this phenomenon. A first-order Markov analy- 
sis of brand switching revealed that Brand L 
lost to both Brands M and P with the greater 
movement to Brand M. This analysis tended 
to confirm the significance of the ratings ob- 
tained for quality. 

The study strongly supports the earlier 
work that price is an independent variable on 
which perception of a brand or product de- 
pends. The extent to which this may be gen- 
eralized beyond the United States culture and 
over a wide range of products is unknown. 
Further research is clearly indicated. 
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CLAYTON P. ALDERFER 1 
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This study reports the results of a field experiment carried out in an industrial 
organization designed to test the impact on fixed-alternative questionnaire 
answers of having some respondents be interviewed by a series of open-ended 
questions covering the same topics prior to taking the questionnaire. Support 
was provided for 1 of the 2 hypotheses tested. More negative data were pro- 
vided about peer relations by those who were interviewed before doing the 
questionnaire than by those who were not. This finding held for both low- 
and high-skill employees. There was no difference in the data about superiors 
under the 2 methodological conditions. The results were interpreted in the 
context of other research in this area, and 2 continua for explaining field 
method effects on questionnaire validity were proposed. 


The place of the interview in social science 
research has been a subject of some con- 
troversy. Some investigators have cautioned 
against the use of the technique because of the 
possibility that an interviewer might be able 
to lead a respondent to bias his answers to- 
ward those the researcher wanted (Dunnette, 
1962). Others have criticized the interview on 
the ground that it provides data only on a 
respondent’s perception of events (Becker & 
Greer, 1957). Nevertheless, a fairly extensive 
literature has developed around styles and 
types of interview (Kahn & Cannell, 1957; 
Richardson, Dohrenwend, & Klein, 1965; 
Whyte, 1960). Another methodological de- 
bate and inquiry has concerned the use and 
misuse of questionnaires. Public criticism has 
been voiced against ‘“‘survey sickness,” the 
agonizing pursuit of questionnaire data to 
answer trivial questions in the misguided 
quest of science (Hechinger, 1966). There 
have also been a number of studies directed 
toward empirically determining some of the 
conditions which facilitate obtaining valid 
data by the use of questionnaires. 

One variable studied has been the effect of 
anonymity of respondents. Although a num- 
ber of studies have been reported, the results 
are not entirely consistent. Pelz (1959), for 
example, compared the questionnaire responses 
of Ss who were completely anonymous to 

1The writer would like to thank Richard A. 
Dunnington and Chris Argyris, Fritz Steele, and Wil- 


liam F. Whyte for their comments on an earlier ver- 
sion of this paper. 


those who were identified but confidential. 
The basic conclusion was that there were no 
differences between the two methods of ad- 
ministering questionnaires in terms of the at- 
titude scores obtained. The investigator, how- 
ever, cautioned that these results would prob- 
ably have been different if the study were 
done under different conditions, for example, 
if careful preparation had not been carried 
out, if there were fear of layoff, or if there 
were widespread hostility toward manage- 
ment. Klein, Maher, and Dunnington (1967), 
on the other hand, found that questionnaire 
responses were more positive for identified 
than for anonymous respondents. The answers 
were also more positive when a plant man- 
ager told the identified Ss to use an identify- 
ing code than when a member of the corpora- 
tion personnel research group made the same 
request. Specific questions also had a greater 
tendency to be positively distorted, namely, 
ones about pay and evaluation by one’s 
superior. 

The Klein, Maher, and Dunnington study 
was carried out entirely by company staff. 
Dunnette and Heneman (1956) compared the 
responses of employees to whom a question- 
naire was administered by their personnel 
manager with those to whom the same ques- 
tionnaire was administered by an independent 
university researcher. They found significant 
positive distortion on general morale items 
and on supervisory-subordinate issues by the 
respondents to whom the questionnaire was 
administered by the insider. 
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Still another way to look at these methodo- 
logical issues is to ask about the effects of 
simultaneously using more than one method 
to investigate a problem area. For example, 
do respondents answer a fixed-alternative 
questionnaire differently if they have been 
interviewed prior to taking the questionnaire? 
The present study reports a field experiment 
designed to answer that question. 


Theoretical Issues 


One point of view about the research process 
suggests that it contains elements of mutual- 
ity (Argyris, in press; Maslow, 1966). If a 
researcher wants valid data, it helps if he is 
perceived as a trustworthy person. If he wants 
complete data, it helps if he makes himself 
accessible to observation, too. Of course, this 
is not the only way to conceive of the re- 
search process. One might also suggest that 
respondents do not want to know the re- 
searcher and prefer that he remain distant, 
uninvolved, and presumably neutral. How 
might one test between these two points of 
view using interview and questionnaire meth- 
odology? 

Suppose the researcher conducted an open- 
ended interview with each respondent before 
asking him to complete a questionnaire. The 
interview might serve a number of functions: 
(a) the respondent would become familiar 
with the topics of the study, (b) he would be 
able to answer the questions in his own terms 
rather than be forced into fixed-alternative 
answers on the questionnaire, (c) he would 
be able to ask questions of the researcher 
without being observed by other members of 
the organization, and (d) he would have con- 
siderable time to “size up” the investigator 
and to find ways to test his trustworthiness. 

If an interview provided a respondent with 
a mechanism whereby he could trust the re- 
searcher more, how would this affect question- 
naire responses? The impact of trust generally 
is to provide a climate of psychological safety, 
thereby allowing a person to talk more freely 
about potentially threatening issues. Some 
previous research provided a basis for a priori 
predictions about which research topics would 
be more likely to be distorted if adequate re- 
searcher-respondent trust were not present. 
Argyris (1962) has found that interpersonal 
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issues, such as dealing with emotions and 
facing conflict, tend to be difficult for many 
organization members in their day-to-day ac- 
tivities. If these issues present problems for 
people in their regular dealings with each 
other, then one would suggest that they would 
also be quite threatening in single encounters 
with an outsider whose trustworthiness might 
be open to question. In addition to the gen- 
eral threat posed by interpersonal issues, 
there is also the fact that some of the major 
issues in the client-researcher relationship are 
questions of trust, confidence, and respect. 
So one would expect the research encounter 
itself to evoke concerns about interpersonal 
issues. Consequently, the attitudes which 
would be expected to be more fully recounted 
if additional trust could be developed in the 
researcher-client relationship would be those 
dealing with interpersonal relationships. If 
the attitudes are more fully described, their 
overall sentiment would be more negative 
because the factors people would withhold 
due to lack of trust would be the negative 
aspects. The following hypotheses, therefore, 
were subject to test in a field experiment. 

Hypothesis 1: Respondents who were in- 
terviewed by a series of open-ended questions 
covering the same subjects as those of the 
questionnaire would give more negative ac- 
counts of satisfaction with respect from supe- 
riors than respondents who were not inter- 
viewed before taking the questionnaire. 

Hypothesis 2: Respondents who were in- 
terviewed by a series of open-ended questions 
covering the same subjects as those of the 
questionnaire would give more negative ac- 
counts of satisfaction with respect from peers 
than respondents who were not interviewed 
before taking the questionnaire. 


MeEtTHOopD 
Procedure 


The study was carried out over a period of 3 mo. 
in a manufacturing organization employing approxi- 
mately 1,700 people. Three shifts were in opera- 
tion, and data were gathered on each of the shifts. 
A week preceding the time when an S was asked to 
appear to participate a notice was sent to him 
through his first-line supervisor. The memorandum 
indicated the time and place for the person to ap- 
pear and indicated whether he would be inter- 
viewed or asked to complete a questionnaire. 


COMPARISON OF QUESTIONNAIRE RESPONSES 


When a person came for an interview he was 
greeted by the investigator and informed about the 
purposes and topics of the study. While this explana- 
tion proceeded, a small tape recorder was setting on 
a nearby table. The case was open so that it could 
be clearly seen that it was not turned on. When the 
initial introduction had been completed, the inter- 
viewer asked for the respondent’s permission to tape 
the interview. The person was assured that no one 
in the company would ever be allowed to listen 
to the tapes. 

All of the interviews were conducted by the 
writer. Almost without exception the questions were 
asked in the same order. Some sample questions 
were, “What do you like to get out of a job?” or 
“Would you describe your boss for me?” There were 
15 standard questions. Additional probing questions 
were used to clarify what a respondent meant when 
it seemed unclear to the investigator.? 

Questionnaires, for people not interviewed, were 
distributed during meetings attended by 10-20 per- 
sons at one time. The meetings were held between 
shifts and were attended by people who were leaving 
after having completed their work for the day. No 
more than three people from a given department 
were asked to attend a questionnaire administration 
at one time. No more than two people from a given 
shift in a given department were interviewed in 1 day. 

Persons who were interviewed were given a ques- 
tionnaire and a stamped envelope addressed to the 
investigator after the interview had been completed. 
The questionnaire had a code number on the back. 
The S was told that this number existed so the re- 
searcher could keep all the information provided by 
each person together. The confidentiality of the study 
was reiterated at this point. 

Persons who attended questionnaire meetings 
either completed the instruments in the meetings or, 
if there was not sufficient time, also received a 
stamped envelope addressed to the researcher. Ques- 
tionnaires which were completed in the meetings had 
no identifying labels; neither names nor code num- 
bers were provided. Where the questionnaires were to 
be mailed back, then identifying numbers were used. 
The Ss were informed that the investigator wanted 
to be able to remind them in case they forgot to send 
back their questionnaires. Confidentiality was reaf- 
firmed. Approximately 60% of the questionnaires 
were completed in the meetings. 


Subjects 


The respondents in this study were all nonsuper- 
visory employees. Two groups of Ss are distinguished 
according to skill level. People holding the low-skill 
jobs had duties which included: custodial work, 
highly repetitive assembly-line operations, machine 
tending in continuous process operations, inspection, 
and stock accounting. The high-skill group included: 
the trade groups such as mechanics, machinists, and 


2A complete account of the interview questions 
and introductory procedures is given elsewhere (Al- 
derfer, 1966). 
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electricians; quality control technicians who de- 
veloped new inspection standards and worked away 
from the production lines; and advanced machine 
operators who tended highly complex machines and 
carried out selected staff functions as well. 

The Ss were randomly selected from the list of 
employees in the organization. Approximately 90% 
of the people who were selected participated in the 
study. Roughly one-third of each skill group were 
randomly assigned to be interviewed before taking 
the questionnaire. Forty-four low-skill and 39 high- 
skill employees were interviewed, while 88 low-skill 
and 75 high-skill took only the questionnaire. 


Dependent Variables 


The data used to test the experimental hypotheses 
were taken from the questionnaires. Sample items 
from the four scales are presented below. They ap- 
peared in a Likert scale format. Agree or disagree 
after the item indicates the direction scored for satis- 
faction. A full listing of the items in each scale is 
given elsewhere; the total number of items in each 
scale is indicated (Alderfer, 1967). 

Pay satisfaction: (7 items). 

Compared to the rates for similar work here, my 
pay is good. (agree) 

I find it hard to do some of the things I enjoy 
most on the pay I receive. (disagree) 

Respect by superiors satisfaction: (8 items) . 

My boss takes account of my wishes and desires. 
(agree) 

My boss will play one person against another. 
(disagree) 

Respect by peers satisfaction: (8 items). 


I can count on my co-workers to give me a hand 
when I need it. (agree) 


My co-workers are uncooperative unless it’s to 
their advantage. (disagree) 

Use skills and abilities satisfaction: (12 items). 

My job requires that a person use a wide range of 
abilities. (agree) 

In my job I have same thing to do over and over. 
(disagree) 

RESULTS 


Table 1 presents the means and standard 
deviations for the two skill groups under the 
different experimental conditions on each of 
the four attitude scales. The data do not pro- 
vide support for Hypothesis 1, for either skill 
group. There are no statistically significant 
differences on the questionnaire measure of 
respect by superiors satisfaction between the 
two experimental conditions in either skill 
group. 

There is support for Hypothesis 2, how- 
ever. The low-skill employees who were in- 


TABLE 1 


QUESTIONNAIRE MEASURES OF NEED SATISFACTION FOR 
QUESTIONNAIRE-ONLY AND INTERVIEW- 
AND-QUESTIONNAIRE GROUPS AT 
Two SKILL LEVELS 




















Group N x SD Z 
Pay satisfaction 
Low-skill questionnaire-only 88 | —.03 | 1.01 E 
Low-skill interview-and- —.65 
questionnaire 44 | —.16] 1.13 
High-skill questionnaire-only 75 mae 97 "| 
High-skill interview-and- 72 
questionnaire 39 eis 91 
Respect by superiors satisfaction 
Low-skill questionnaire-only 88 oS .79 x 
Low-skill interview-and- 72 
questionnaire 44 .38 90 
High-skill questionnaire-only Toil ald 20 - 
High-skill interview-and- —.70 
questionnaire 39 | —.31} 1.10 
Respect by peers satisfaction ; 
Low-skill questionnaire-only 88 -10} 1.05 
Low-skill interview-and- —1.95* 
questionnaire 44 | —.33 | 1.28 
High-skill questionnaire-only eS Ri 74 
High-skill interview-and- —2. 74% 
questionnaire 39 | —.29 | 1.06 
Use skills and abilities satisfaction 
Low-skill questionnaire-only 88 | —.43 | 1.08 
Low-skill interview-and- Ll 
questionnaire 44] —.45 .93 
High-skill questionnaire-only ie) .20| .80 
High-skill interview-and- —.50 
questionnaire 39 cL Se 


Note.—The attitude scale scores were standardized with a 
mean of 0 and standard deviation of 1 for the whole organiza- 
tion. Negative signs on the means refer tc the dissatisfaction 
end of the scale; positive signs to the satisfaction end. 

*p < .026, one-tailed test. 

** » < .005, one-tailed test. 


terviewed prior to completing a questionnaire 
gave significantly less satisfied accounts of 
respect by peers than the low-skill employees 
who were not interviewed. A similar finding 
holds for the high-skill group. The high-skill 
respondents who were interviewed prior to 
taking a questionnaire showed significantly 
more negative respect by peers satisfaction 
attitudes than the high-skilled employees who 
were not interviewed. 

Data on the other two attitude scales, pay 
satisfaction and use skills and abilities satis- 
faction, show no significant differences for 
either job group as a function of the experi- 
mental conditions. 

Perhaps it should be mentioned that the 
four dependent variables are essentially uncor- 
related with each other. Their median cor- 
relation is zero; the extreme values are .02 
and —.04. Therefore, tests on the separate 
scales are meaningful, and positive or negative 
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findings on one scale are essentially indepen- 
dent of results on other scales. 

Also, the reliabilities of the four scales are 
reasonably high. Spearman-Brown estimates 
for the pay, superiors, peers, and abilities 
scales are, respectively, .88, .84, .79, and .88. 
Therefore, it is unlikely that failure to detect 
differences between experimental conditions 
can be attributed to lack of reliability in the 
dependent measures. 


DiIscUSSION 


The experimental findings provide support 
for only one of the hypotheses. This support, 
however, was reasonably strong in that it held 
for both the low- and high-skill groups. Why 
was there not support for the respect by 
superiors satisfaction variable as well as for 
the respect by peers satisfaction measure? 
Two explanations are suggested. 

The first implies that the experimental con- 
ditions were not strong enough. That is, while 
having an interview prior to doing a question- 
naire raised the respondents’ trust enough so 
they revealed more negative attitudes about 
their co-workers, the procedure was not enough 
to have a similar effect on attitudes about 
superior-subordinate relations. This explana- 
tion assumes that discussion of superior-sub- 
ordinate relations is more threatening than 
discussion of peer relationships and therefore 
requires a higher level of researcher-client 
confidence for the respondents to feel free to 
admit to negative attitudes about their bosses. 

The second explanation says that as far 
as superior-subordinate relationships were con- 
cerned, the additional trust made no differ- 
ence because S was not very threatened. This 
is to imply that the respondents said all there 
was to say on the questionnaires, and the in- 
crement of trust which interviews might have 
added had no payoff in more valid data about 
superior-subordinate relations. Following the 
Dunnette and Heneman (1956) findings, one 
might reason that the investigator being an 
outsider may have been all that was necessary 
to obtain complete data on this topic. In 
terms of the data of this study, this explana- 
tion assumes that discussion of superior- 
subordinate relations was Jess threatening than 
discussion of peer relationships. 


COMPARISON OF QUESTIONNAIRE RESPONSES 


If the second explanation is valid, then com- 
bining the data from this study with those 
obtained by Dunnette and Heneman (1956) 
and by Klein, Maher, and Dunnington (1967) 
suggests two generalizations about the impact 
of field-research methodology on the validity 
of questionnaire responses. 

It should be emphasized, however, that 
these generalizations are intended to be quite 
tentative. They represent one way of bring- 
ing some order out of the results of the 
present and related studies on this subject. 
Since they are not empirically confirmed, they 
may best be seen as hypotheses suggestive 
of profitable directions for future research. 

The first generalization concerns a con- 
tinuum of research methods ranging from 
those which seem to lead to more response 
distortion to those which lead to less bias on 
questionnaires. The following order from most 
to least distortion is suggested: 


(a) Identified questionnaires with admin- 
istration in part by line management; 

(6) Unidentified questionnaires with ad- 
ministration in part by line management; 

(c) Identified questionnaires with adminis- 
tration by internal (personnel) staff; 

(d) Unidentified questionnaires with ad- 
ministration by internal (personnel) staff; 

(e) Identified questionnaires with admin- 
istration by external (university or research 
institute) researchers; 

(f) Unidentified questionnaires with ad- 
ministration by external (university or re- 
search institute) researchers; and 

(g) Respondents interviewed by external 
researcher prior to filling out questionnaires. 


The reader will note that although this 
order is presented as a single dimension, more 
than one variable enters the definition of 
the points. For example, who the adminis- 
trator is receives a heavier weight than 
whether the respondent is identified. Such an 
assumption involves going a bit beyond what 
the results of the studies cited above and the 
present results strictly show, but the exten- 
sion does not seem unreasonable. Furthermore, 
there is no way of knowing what effect in- 
terviewing preceding the questionnaire ad- 
ministration would have if the interviewer 
were an internal researcher. 
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The second tentative pattern emerging from 
combining this study results with those of 
other writers is an ordering of issues whose 
validity increases as one moves along the 
methodological continuum. As the method 
provides more respondent-researcher trust, the 
issues seem to have more valid data provided 
about them in the following order: 


(a) Pay satisfaction; 

(0) Superior-subordinate relations, particu- 
larly around subjects of mutual trust and 
equity; and 

(c) Peer relations, concerning such issues 
as acceptance, respect, and trust. 


It would seem that there is a basic dimen- 
sion underlying the ordering of these issues. 
As one moves from pay, through superiors, 
to peers, the object of interest changes from 
a material factor external to an important 
person who (at least formally) is somewhat 
distant from the respondent to co-workers 
who, quite likely, are closer to the respondent. 
The underlying continuum appears to be 
intimacy. It seems that negative data about 
more intimate issues by questionnaire become 
more available as a safer and closer relation- 
ship is developed between researcher and re- 
spondent. 

When satisfaction with pay and satisfaction 
with use of skills and abilities did not show 
differences between the experimental condi- 
tions, indirect support was provided for the 
idea that the preceding interview affects pri- 
marily questionnaire responses about inter- 
personal relationships. From these data, there 
is little reason to suggest that an external re- 
searcher should invest the time and money re- 
quired to interview people if he does not want 
to obtain data about interpersonal relation- 
ships in the organization. On the other hand, 
where data about interpersonal relationships 
are desired, there is reason to believe that 
preceding questionnaire administration with a 
semistructured interview can reduce distor- 
tion of questionnaire responses. 
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ADDENDUM TO “AN EMPIRICAL INVESTIGATION 
OF TWO IMPLICATIONS OF THE TWO-FACTOR 
THEORY OF JOB SATISFACTION” 


GEORGE B. GRAEN ann CHARLES L. HULIN 


University of Illinois, Urbana 


Multiple hypotheses were tested between the 2-factor theory of Herzberg, 
Mausner, and Snyderman and the traditional theory of job satisfaction. A 
procedure was used that granted the assumption of the 2-factor theory of a 
dimensional difference between overall satisfaction and dissatisfaction. Instru- 
ments employed were designed to capitalize on any unique variance associated 
with the satisfaction or the dissatisfaction dimension. Analysis consisted of 
1-way analyses of variance of the Job Description Index variables of Work 
Itself, Promotion, Salary, Supervision, and Co-workers on overall satisfaction 
and on overall dissatisfaction. Results support the hypotheses of the traditional 
theory at the expense of the 2-factor theory. 


Attempts to test the two-factor theory of 
job satisfaction (Herzberg, 1966; Herzberg, 
Mausner, & Snyderman, 1959) employing 
methods other than storytelling, in addition to 
failing to support hypotheses derived from the 
theory, have assumed a single continuum of 
overall job satisfaction-dissatisfaction. The 
single exception was a recent study by Hulin 
and Smith (1967) in which special instru- 
ments were designed to measure independently 
satisfaction and dissatisfaction. The purpose 
of the present study was to examine these 
unique data from the perspective of strong 
inference (Platt, 1964). The strong-inference 
approach pits hypotheses from competing 
theories against one another and demands a 
determination of their relative value for future 
research. It differs from the more usual ap- 
proach by emphasizing hypotheses testing 
rather than theory confirmation (Dunnette, 
1966). 

Hypotheses derived from the two-factor and 
the traditional theory (Ewen, Smith, Hulin, 
& Locke, 1966; Graen, 1966) and tested in 
this study are as follows. The two-factor the- 
ory hypothesizes that those variables classified 
as “satisfiers” (Work Itself and Promotion) 
should control a significant amount of vari- 
ance in overall satisfaction and not in overall 
dissatisfaction, whereas those variables clas- 
sified as ‘dissatisfiers’ (Co-workers, Super- 
vision, and Salary) should control a significant 
amount of variance in overall dissatisfaction 
and not in overall satisfaction. In contrast, 


traditional theory hypothesizes that both so- 
called “satisfiers” and “dissatisfiers” should 
control a significant amount of variance in 
overall satisfaction and overall dissatisfaction. 


METHOD 


The Ss were male and female home-office em- 
ployees of one corporation. Each S$ completed the 
Job Description Index (Smith, 1967) and either an 
overall job satisfaction or overall job dissatisfaction 
instrument. The Job Description Index (JDI) mea- 
sures satisfaction with the job variables of Work 
Itself, Promotion, Co-workers, Supervision, and Sal- 
ary. In this study, 146 Ss completed the overall satis- 
faction measure, and another 155 completed the 
overall dissatisfaction measure. For further details, 
see Hulin and Smith (1967). 

The analysis consisted of performing one-way 
analyses of variance to test the effects of the five 
JDI variables on overall satisfaction and on overall 
dissatisfaction (Winer, 1962). Each JDI variable was 
trichotomized into dissatisfied, neutral, and satisfied 
using cutting points supplied by Ewen (1965). 
Omega-squared (Hays, 1963) was employed to 
estimate the strength of relationships between JDI 
variables and the two criteria of overall satisfaction 
and overall dissatisfaction. 


RESULTS 


Those variables classified as satisfiers (Work 
Itself and Promotion) showed significant re- 
lationships on satisfaction and on dissatisfac- 
tion. In contrast, those variables classified 
as dissatisfiers (Co-workers, Supervision, and 
Salary) showed significant relationships on 
satisfaction and not on dissatisfaction. The 
percentage of variance in satisfaction ac- 
counted for by the job variables was 23% 
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TABLE 1 


ANALYSIS OF VARIANCE AND OMEGA-SQUARE OF JOB 
VARIABLES ON SATISFACTION AND DISSATISFACTION 











Dependent variable 











Independent | q |. facti - Dissatisfaction 
warble Satisfaction only only 
df F Te N taf F % 
Satisfier 
Work itself Die ee On eel DaRoro on ee OO 
Promotion 2 | 14.05* | .16 Zeal Soldat OS 


Dissatisfier 





Co-workers Dees Soap ake: ee Odie OL 

Supervision | eels ere al Dale On ee 

Salary DN eto | 0Y6) 2 ilsw || Onl 
Residual 143 152 











a % indicates the proportion of total variance accounted for 
by the independent variable. 

* p< .01. 
for Work Itself, 16% for Promotion, 14% for 
Co-workers, 11% for Supervision, and 6% for 
Salary. The percentage of variance in dis- 
satisfaction accounted for by the job variables 
was 6% for Work Itself, 9% for Promotion, 
and a maximum of 1% each for Co-workers, 
Supervision, and Salary. 


Discussion 


Even when the assumption of a dimensional 
difference between satisfaction and dissatis- 
faction is made and instruments designed to 
capitalize on any unique variance associated 
with each dimension are employed, the results 
clearly disconfirm predictions of the two-factor 
theory. The findings that “satisfier” variables 
(Work Itself and Promotion) contribute to 
both satisfaction and dissatisfaction and that 
“dissatisfier” variables (Co-workers, Super- 
vision, and Salary) contribute to satisfaction 
and not dissatisfaction are incompatible with 
the two-factor theory. 

The present results are more compatible 
with the traditional theory. The traditional 
theory hypothesizes that if job variables are 
categorized according to the two-factor clas- 
sification system, both so-called satisfiers and 
dissatisfiers contribute over the entire con- 
tinuum of overall job  satisfaction-dissatis- 
faction. In addition, the traditional theory 


GrorcE B. GRAEN AND CHARLES L. HULIN 


accounts for the differences on overall satisfac- 
tion-dissatisfaction between satisfiers and dis- 
satisfiers by hypothesizing that intrinsic vari- 
ables (satisfiers) are related more strongly 
to overall satisfaction-dissatisfaction than are 
extrinsic variables (dissatisfiers). The tradi- 
tional theory views the differences between 
satisfiers and dissatisfiers as primarily a mat- 
ter of potency rather than a matter of direc- 
tionality. The only finding not supporting the 
traditional theory was the failure of the dis- 
satisfier variables to contribute to dissatisfac- 
tion. This lack of relationship was probably 
due to the severe restriction of range in the 
measure of overall dissatisfaction. In future 
studies with samples representing lower levels 
of dissatisfaction, higher relationships would 
be expected between all job variables and 
overall dissatisfaction. The results of this 
study must be viewed as damaging to the two- 
factor theory and as supporting the traditional 
theory. 
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A PSYCHOPHYSICAL DETERMINATION OF 
EQUITABLE PAYMENT: 


A METHODOLOGICAL STUDY? 


SHELDON ZEDECK anp PATRICIA CAIN SMITH 


Bowling Green State University 


The practicality of using psychophysical methods to determine ranges for 
payment plans was explored. Thresholds of perceived equitable payment 
(PSE) and the “just meaningful difference” (jmd) of payment were determined 
by an adaptation of the psychophysical Method of Limits. PSE and jmd were 
significantly greater for junior executives than for secretaries of an academic 
institution, whereas the respective Weber ratios, K (the proportionate meaning- 
ful additions to the base salary), were not significantly different. There were 
no significant differences between 2 secretarial subgroups in jmd and K, but 
a significant difference in PSE. The limitations and implications of this method, 
and its relevance to equity theory, were discussed. 


Several theorists have independently pro- 
posed models for the determination of equity 
of payment for work (Adams, 1963, 1965; 
Homans, 1961; Jaques, 1961; Patchen, 1961; 
Sayles, 1958; Zaleznik, Christenson, & 
Roethlisberger, 1958). Each model assumes 
that all employees perceive a fair, just, or 
equitable return for what they contribute to 
their jobs. Furthermore, each model involves 
a social comparison (an employee determines 
what his equitable return should be after com- 
paring his inputs and outcomes, or his invest- 
ments and rewards, with those of co-workers 
or reference groups). Also, each model as- 
sumes, implicitly, that the dissonant cogni- 
tion by the employee of over- or undercom- 
pensation results in feelings of tension, dis- 
satisfaction, anger, or guilt. 

Methods of assessing and _ evaluating 
equitable payment differ. Typical methods in- 
clude interviews and discussions with em- 
ployers and employees, observations by in- 
vestigators, and the collection of biographical 
data. Adams’ (1963, 1965) extensive research 
characteristically induced the feelings of 


1 This research is based in part on a master’s 
thesis at Bowling Green State University. 


under- and overcompensation and then evalu- 
ated Ss’ production records. 

A premise that has not been directly 
investigated is that an employee can evalu- 
ate his income as equitable or inequitable in 
terms of some composite internal absolute 
standard which is not tied to any one group or 
person. A further related problem is the 
sensitivity of the worker to salary inequities. 
Both Jaques (1961) and Adams (1965) sug- 
gest that workers are more sensitive to under- 
compensation than to overcompensation. An- 
drews (1967) attempted to reflect this idea in 
the hypothesis that a small underpayment 
will affect job performance as much as or more 
than a large overpayment. The hypothesis was 
partially confirmed, yet there still is no direct, 
reliable evidence for the differential sensitivity 
hypothesis. 

The present study investigated the intuitive 
internal perceptions of salary. Also, the study 
investigates directly the sensitivity of workers 
to inequities of pay. To determine the upper 
and lower thresholds of perceived equitable 
payment and the “just meaningful difference” 
(jmd) of payment (analogous to the jnd in 
psychophysical measurement), the study used 
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an adaptation of the Method of Limits which 
is the one method that most clearly defines the 
concept of thresholds (Woodworth & Schlos- 
berg, 1954, p. 199). This method also makes 
it possible to determine amounts of propor- 
tionate meaningful increases (K) to base 
salary. 


MeEtTHOopD 


Subjects 


Questionnaires were mailed to 25 male junior ex- 
ecutives (Group I), and 96 female secretaries—45 in 
the maintenance department (Group II) and 51 in 
the executive department (Group III)—of a large 
midwestern academic institution. Each had at least 
1 yr. of experience in his position. 

Returns of the questionnaire were 80% (n= 20) 
for Group I, 51% (n=23) for Group II, 45% 
(n= 23) for Group III, and 6 from the secretarial 
pool for whom the secretarial office could not be 
identified. 


Research Material 


The questionnaire contained general job-satisfaction 
questions, intended to disguise the intent of the 
study, and the basic questions necessary for the psy- 
chophysical analysis. The format of the basic ques- 
tion was: 


TABLE 1 


PsYCHOPHYSICAL DETERMINATIONS 








Psychophysical 


























aD measure df t 
PSE 
I $7920.00 15 
IT & Tl | $3814.29 21 34 | 18.42* 
II $3574.29 7 
III $3919.09 11 16 287% 
jmd 
I $ 912.00 15 ; 
T&M} $ 235.71 21 34 6.19* 
Il $ 295.71 a 
UI $ 196.36 11 16 1.30 
K 
I abs 15 
II & II 062 21 34 38 
II 083 7 
II .050 11 16 .29 
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Considering the type of work required of you on 
your present job, is a year 

a fair salary 

more than a fair salary 

less than a fair salary. 


The salary levels within the basic question (one 
level, randomly placed, per page) were based on 
the average salary and probable error (PE) for the 
respective groups. There were nine salary levels, four 
levels above and four levels below the average. 
Each successive change above and below the average 
was an approximation of the PE: $180 and $720 a 
year for secretaries and executives, respectively. 


Procedure and Design 


The point of subjective equity (PSE), or the 
salary that the S perceived to be fair, and jmd were 
determined for each group by the Method of Limits 
with the adaptation of presentation of single stimuli 
in haphazard order (Guilford, 1954; Woodworth & 
Schlosberg, 1954). 

Two criteria were used in evaluating the data 
matrices (on which the responses were recorded in 
serial order). First, if the responses within an em- 
ployee’s questionnaire did not contain an “=” (fair), 
“4” (more than fair), and “—” (less than fair), 
then that questionnaire was not used in the analysis. 
Second, one inversion within an employee’s results 
was tolerated. If more than one inversion occurred, 
then the following rules were applied: If in in- 


specting the series in ascending order, a — re- 
sponse was found after there had been a series of 
— and at least two = responses, or an = and a 


+, such a series was not evaluated. Similarly, in 
descending inspection, the appearance of a + after 
two =s, or an = and a —, disqualified the question- 
naire. 


RESULTS 


After applying the criteria for evaluation, 
the sample sizes for Groups I, II, and III 
were reduced to 15, 7, and 11, respectively, 
with 21 secretaries forming the combined 
secretarial group (including 3 unclassified 
secretaries). 

In comparison of the junior executives with 
the total secretarial pool and with each sample 
of secretaries, both the PSE and the jmd were 
significantly greater for the executives (Table 
1). In contrast, however, a comparison of the 
respective Weber ratios reveals that the 
proportionate meaningful additions to the base 
salary were neither large nor significantly 
different. 

A comparison of the two secretarial groups 
showed that although the actual earned 
salaries for Groups II and III were within the 
same range, the PSE was significantly greater 
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TABLE 2 


PERCEIVED MEAN SAaLAry RANGES 








: Equitable 5 
Group n <Fair PSE + jmd) >Fair 
Junior executives} 15 | $7008.00 | $7008.00-8832.00 $8832.00 
Secretaries 21 | $3578.58 | $3578.58-4050.00 $4050.00 
Group II 7 | $3278.58 | $3278.58-3870.00 $3870.00 
Group III 11 | $3722.73 | $3722.73-4115.45 $4115.45 


Ee 


for Group III, whereas the jmd was not sig- 
nificantly different. The difference between K 
was also small and not significantly different. 

Based on this information, equitable pay 
ranges for all groups were constructed (Table 
2)" 

Since the sample size for each group was 
small, the combined sample of returned ques- 
tionnaires was analyzed (Table 3). The lower 
threshold, T(—), for each employee who had 
indicated an upper threshold, T(+), was com- 
pared with the T(—) of those employees who 
had not stated a T(+). For both executives 
and the total secretarial group, the T(—) was 
significantly smaller for the subgroups that 
had indicated a T(+). 


Discussion 


This study demonstrates the potential 
utility of this method, despite the small sample 
size. The results of this study are, of course, 
situation specific; the PSE and jmd that were 
determined apply only to the groups on which 
the data were collected. The results cannot be 
generalized to a similar work group in the 
same setting (as evidenced by the differences 
obtained with the two secretarial groups), 
much less to a similar group in a different 
setting. The method may, however, have 
general applicability. 

It is to be expected that, as an employee 
goes up the work-level scale, PSE and jmd 
will increase. However, the nature of the 
functional relationships involved is an interest- 
ing point for speculation based on the data 
obtained. Since there was no significant dif- 
ference between the K values for the groups, 
either Weber’s, Fechner’s (Guilford, 1954), 
or Stevens’ (1957) law might hold. Applying 
this technique to a greater number of groups 
along the work and pay-grade scale would 
reveal the nature of the wage curves in 
operation for any specific company. 
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TABLE 3 


LowEer THRESHOLD, T(—), FoR Groups WITH AND 
WitHouT Upper THRESHOLD, T (+) 














Gan With Without ‘ 
. TH) | TH 

Junior executives | $7008.00 | $9150.00} 18 | 3.67* 
(mn = 15) | (wv = 5) 

Secretaries $3578.57 | $3893.48 | 22 | 3.01* 
(w= 21) | (7 = 23) 

: 
*> <.05. 


The technique also detects differences be- 
tween groups at the same job level. The sig- 
nificant difference between PSE for the two 
secretarial groups could be attributed to 
different working conditions for the two 
groups, to differences in personal needs, or 
other causes which can be_ subsequently 
investigated. 

Though the two secretarial groups had dif- 
ferent perceptions of fair salaries, the amount 
of meaningful increase was approximately the 
same. It is not suggested that the average 
K of 6.2% should be applied in determining 
raises, but that a range such as 5-8% be 
used. This range would be useful in determin- 
ing raises with consideration for education, 
experience, seniority, etc. The use of ranges 
should be applied for all determinations based 
on the method, instead of using the exact 
PSE, jmd, or K, and could correct for the 
individual differences within the groups. 

A point to be considered in using this tech- 
nique is that of sensitivity to inequities. 
Adams (1965), Jaques (1961), and Andrews 
(1967) suggest that Ss are more sensitive to 
underpayment as opposed to overpayment. 
The data are not inconsistent with these 
generalizations. Since the choices included 
four PEs above and four PEs below the 
average salary, one concludes that Ss were 
more willing to indicate inequities below than 
above equitable salaries. For the junior execu- 
tives, 100% of the sample indicated a T(—) 
whereas only 75% indicated a T(+). The 
results were more striking with the secretaries, 
98% of whom indicated a T(—) while only 
49% indicated a T(+). However, these find- 
ings cannot be taken as conclusive for it is 
possibly an artifact of the number and values 


346 SHELDON ZEDECK AND 
of the levels used, and might also indicate 
dissatisfaction. 

Since more than 50% of the returned ques- 
tionnaires did not meet the criteria for evalua- 
tion, an attempt was made to extract some 
information from the unevaluated question- 
naires by comparing the T(—) responses of 
the unevaluated with the evaluated question- 
naires. The significant difference between the 
T(—) responses for both the executives and 
secretaries could be interpreted as indicating 
that the differences were only in intervals of 
uncertainty (equitable range) and not a dif- 
ference in jmd. 


Limitations 


Several technical limitations must be con- 
sidered. The results were obtained by a mailed 
questionnaire. This method of data collection 
is hazardous for this type of study. It is im- 
portant, for validity purposes, that S fills out 
one page at a time and does not look back to 
his previous answers. The S should not have 
the opportunity to scan the questionnaire, 
familiarize himself with the levels, and then, 
with the aim of being consistent, fill in his 
responses. A suggested procedure is to ad- 
minister the questionnaire to a group of em- 
ployees simultaneously, requesting each S$ to 
answer one page at a time, hand in or place 
a completed page in an envelope, and then 
receive another page, the process being re- 
peated until the task is completed. Also, with 
this type of administration, the sample size 
would be increased. 

Another deficiency in the technique is the 
number and size of increments. The value of 
the increments should be sufficiently small so 
as to be most sensitive to the inequities. The 
desired sensitivity can be gained by following 
a test-retest procedure. After administering 
the questionnaire with nine levels of salary, 
one could retest the same group a short time 
after with the same filler questions, but with 
nine different, though appropriate, salary 
levels, resulting in 18 levels from which to 
determine PSE, jmd, and K. 


Implications 


Once the technique is refined, general in- 
formation as to who perceives himself fairly 


PATRICIA CAIN SMITH 


paid, less than fairly paid, and more than 
fairly paid will be provided. On the basis of 
these determinations, several of the hypotheses 
that Adams and his associates have tested 
could be examined without inducing feelings 
of inequity. The quantity and quality of work 
for each employee could be compared and 
differences, if any, between those perceiving 
themselves equitably and inequitably paid 
evaluated. The influence of the input variables 
on perceptions could be examined; for ex- 
ample, what is the jmd for someone with 20 
yr. of experience as opposed to someone with 
5 yr. of experience? The effects of task dif- 
ferences on perceived equity for groups with 
similar salary scales could be investigated. 

Another analysis could be made of the 
relationship of a person’s perceptions to his 
satisfaction. It is conceivable that anyone 
who would deviate from the equitable interval 
would also be dissatisfied with his job and its 
conditions. 

Still another interesting analysis would be 
the determination of jmd for males as opposed 
to females in the same work group, or career 
women as opposed to women supplementing 
their families’ incomes. 

Finally, it would be possible to use the re- 
sults when considering the granting of raises 
and increases due to promotions, or for dis- 
tinguishing different levels of jobs. It appears 
that this would be an improvement over the 
arbitrary techniques that are presently being 
used in industry. As Jaques (1961) pointed 
out, equitable payment should prove econom- 
ically sound as well as socially just, because 
it would diminish wrangling and encourage 
efficiency. Furthermore, Opsahl and Dunnette 
(1966) found in their review of the role of 
financial compensation in industrial motiva- 
tion that companies ‘“‘assign” each employee a 
“pay curve” which is the result of successive 
alterations in compensation and compensa- 
tion policies through the years. They noted 
that the usual way of doing this is with little 
or no advanced planning. Increments are 
simply given haphazardly on a year-to-year 
basis and the resulting career curve is equally 
haphazard. If this is the situation, if incre- 
ments are given without planning and regard 
for its meaning to the employee, then a better 
technique is needed. 


PSYCHOPHYSICS AND EQUITABLE PAYMENTS 


REFERENCES 


AvaMs, J. S. Toward an understanding of inequity. 
Journal of Abnormal and Social Psychology, 
1963, 67, 422-436. 

Apams, J. S. Injustice in social change. In L. Berko- 
witz (Ed.), Advances in experimental social psy- 
chology. Vol. 2. New York: Academic Press, 1965. 

Anprews, I. R. Wage inequity and job performance: 
An experimental study. Journal of Applied Psy- 
chology, 1967, 51, 39-45. 

GurtrorD, J. P. Psychometric methods. New York: 
McGraw-Hill, 1954. 

Homans, G. C. Social behavior: Its elementary forms. 
New York: Harcourt, Brace & World, 1961. 

Jaques, E. Equitable payment. New York: Wiley, 
1961. 

Opsaut, R. L., & DunneEtTE, M. D. Role of financial 


347 


compensation in industrial motivation. Psycho- 
logical Bulletin, 1966, 66, 94-118. 

PatcHen, M. The choice of wage comparisons. 
Englewood Cliffs, N. J.: Prentice-Hall, 1961. 

Saytes, L. R. Behavior of industrial work groups: 
Prediction and control. New York: Wiley, 1958. 

STEVENS, S. S. On the psychophysical law. Psy- 
chological Review, 1957, 64, 153-181. 

WoopwortH, R. S., & ScHLosBEeRrG, H. Experimental 
psychology. New York: Holt, Rinehart & Winston, 
1954. 

ZALEZNIK, A., CHRISTENSON, C. R., & ROETHLISBERGER, 
F. J. The motivation, productivity, and satisfac- 
tion of workers: A prediction study. Boston: 
Harvard University, Graduate School of Business 
Administration, 1958. 


(Received October 24, 1967) 


Journal of Applied Psychology 
1968, Vol. 52, No. 5, 348-356 


SEARCHING FOR LETTERS OR CLOSED SHAPES IN 


SIMULATED ELECTRONIC DISPLAYS* 


E. C. POULTON 
Applied Psychology Research Unit, Cambridge, England 


16 men counted the number of bright symbols of a particular kind in a simu- 
lated electronic display presenting a total of between 19 and 59 symbols of 8 
different kinds. In some displays all the symbols were bright, in others bright 
and dim symbols were mixed. 4 sets of 8 symbols were compared. Targets were 
missed less often from sets of capital letters than from sets of closed shapes 
(p < .01) ; thus the letters were the more easily discriminated from each other. 
The probability of missing at least 1 target increased approximately linearly 
with the number of targets present (p< .01). The rate of increase was ap- 
proximately proportional to the total number of bright symbols displayed, and 
was somewhat greater when dim symbols were present also (p< .05). For a 
combined total of 57-59 bright and dim symbols, the probability of missing 
at least 1 target increased from about .03 per target when 10 of the symbols 
were bright, to .09 when 26 were bright. 35% of the underestimations were of 
size 2 or greater. The time taken to count the targets increased with the total 
number of bright symbols present (p< .01). Dim symbols tended to be 


counted as bright (p< .01). 


Electronic display systems are capable of 
showing the locations of large numbers of ob- 
jects at a time on a single cathode-ray tube. 
Where the objects can be classified into 
classes, the objects belonging to a class can all 
be represented by the same symbol. For ex- 
ample, a different capital letter can be used for 
each class. Since part of the man’s difficulty 
in spotting items of a particular class consists 
in discriminating between them and the re- 
maining items, the letters used should be 
easily discriminable from each other (Howell 
& Kraft, 1959). Men working with displays 
of this kind may be accustomed to using 
simple shapes such as squares, triangles, and 
diamonds; in which case they are not likely 
to relish a change to capital letters unless 
letters can be shown to be more easily dis- 
criminable. The principal aim of the experi- 
ment was to compare the discriminability of 
a set of capital letters with a set of closed 
shapes. Symbols intermediate between letters 
and shapes were also included (see Figure 1) 
in an attempt to determine the characteristics 
of letters and shapes which help or hinder 
discriminability. No experimental work com- 


1 The research was carried out in 1960 with the 
financial support of the British Medical Research 
Council. D. C. V. Simmonds and S. G. Blake-Smith 
very kindly made the slides. Statistical advice from 
P. M. E. Altham is also gratefully acknowledged. 


paring the relative discriminability of letters 
and shapes appears to have been published 
previously. 

In an experiment of this nature additional 
independent variables are the number of 
symbols of the class for which the man is 
searching, and the total number of symbols 
displayed (Sleight, 1952). If the symbols are 
presented bright against a background of ad- 
ditional dim symbols of similar shapes, the 
number of dim symbols can be another in- 
dependent variable. A subsidiary aim of the 
experiment was to determine the effect of these 
three distinct sources of display clutter upon 
the man’s ability to spot target symbols. 


METHOD 
Materials 


The four sets of symbols are illustrated in Figure 
1. The symbols were about 2.7 mm. tall, which is the 
height of 12-point newspaper capitals. The shapes in 
Column 4 were chosen to be representative of the 
kinds of shapes to which controllers might have be- 
come accustomed during the course of their duties. 
The capital letters in Column 1 were selected to give 
a set of shapes with features similar to those of the 
shapes in Column 4 when the spaces within the 
letters were enclosed as in Column 3. Only one 
shape, the triangle with the apex at the top, is com- 
mon to Columns 3 and 4. But both columns con- 
tain three pairs of shapes which can be confused 
(see Table 2). Column 2 represents a degree of letter 
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closure intermediate between those in Columns 1 
and 3. 

Twelve basic spatial patterns of symbols were 
selected, 2 for practice and 10 for the experiment 
proper. Both the practice patterns and 6 out of the 
10 test patterns contained both bright and dim 
symbols. The remaining four test patterns contained 
only bright symbols. Each of the 12 basic patterns 
was rotated once through 90°, once through 180°, 
and once through 270°, to give altogether 48 ar- 
rangements of symbols, 8 for practice and 40 for the 
experiment. Thus all the practice arrangements and 
24 of the test arrangements contained both bright 
and dim symbols, while the remaining 16 test ar- 
rangements contained only bright symbols. 

The 48 arrangements were reproduced four times, 
once in each of the four sets of symbols shown in 
the columns of Figure 1. The symbols were white on 
a black background as in the figure. To produce 
both bright and dim symbols on the same photo- 
graphic negative, the display was photographed twice 
by a clamped camera without changing the plate. 
First a short exposure took all the symbols. Then the 
symbols which were to be dim were removed, and a 
second exposure was made of the symbols which 
were to be bright. The pictures were finally reduced 
in size so that the positives could be mounted in 
2 X 2 in. slide cases. 

For each of the 48 arrangements the symbols in 
one of the rows of Figure 1 were selected as the 
targets. In the 8 practice arrangements of a set, 
each of the 8 kinds of symbol in the set was 
selected once. The number of bright target symbols 
on the slides ranged from 0 to 6. In the 24 test ar- 
rangements with both bright and dim symbols, each 
of the 8 kinds of symbol in the set was selected as 
the target 3 times. In the 16 test arrangements with 
only bright symbols, each of 8 kinds of symbol was 
selected twice. Selection was quasi-random with the 
restriction that the same symbol was not a target in 
two tests derived from the same basic pattern. The 
number of bright target symbols on the slides ranged 
from 0 to 11. Where both bright and dim symbols 
were present, the total number of bright symbols 
ranged from 10 to 26, the total number of dim from 
1i to 47. Where only bright symbols were present, 
their numbers ranged from 19 to 55. 


Experimental Design and Subjects 


For each of the four experimental conditions cor- 
responding to the four sets of symbols in Figure 1, 
the 40 experimental slides were arranged in four 
lots of 10, each lot containing 1 of each of the 10 
basic patterns. The order of the four lots, and the 
order of the arrangements within lots, was the same 
for each experimental condition. 

Sixteen men acted as experimental Ss. Their ages 
ranged from the mid 20s to the late 50s. One group 
of four men was experienced with electronic displays 
of this nature, another group of four men was very 
experienced, while the other two groups were, re- 
spectively, not very experienced and inexperienced. 
Each man in a group of four received the four ex- 
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bai 
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oe 
O 
a 


Fic. 1. The four sets of symbols compared. (Col- 
umn 1: letters; Column 2: semiclosed letters; Column 
3: closed letters; Column 4: shapes.) 


perimental conditions corresponding to the sets of 
symbols in Figure 1 in a different order, according 
to one of four separate Latin-square designs. 


Procedure 


Each man was tested separately. He sat in front 
of a vertical semitransparent circular screen of 12 in. 
diameter, upon the back of which the slides were 
projected. The center of the screen was approximately 
level with his eyes, and about 18 in. away. He was 
told that he could move the screen if desired to focus 
the image of a slide, and to center it. 

Before each slide was presented the man was told 
which was to be the target symbol, and whether or 
not the slide would contain dim symbols as well as 
the bright ones which alone had to be looked at. 
The slide was not presented until he indicated that 
he was ready to begin. As soon as the slide ap- 
peared, he had to count all the bright target symbols 
and call out his answer. He was told to make sure 
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that his answer was correct before he gave it, but 
not to waste time unnecessarily. A stop watch was 
started as each slide appeared, and stopped as soon 
as he gave his answer. While the answer and time 
taken were being recorded by the #’s assistant, the 
man was told which was to be the next target symbol. 

After each of the eight practice slides the man was 
told whether his answer was right or not. If he was 
wrong, E went over the slide with him until he dis- 
covered his mistake. No knowledge of results was 
given after any of the 40 test slides. At the end of 
the session the man was thanked and generally en- 
couraged. He was asked which symbols he confused 
with which and his preferences for individual symbols 
and sets of symbols. Including instruction and prac- 
tice, the first session took an average of just under 
30 min., subsequent sessions just under 20 min. 
Normally each man performed one condition per 
day, or one condition in the morning and a second 
in the afternoon. But two men worked alternately 
one afternoon, since this was the only time at which 
they were available. 


Scoring and Calculations 


Errors were classified either as underestimations of 
the correct number of target symbols or as over- 
estimations. Since over half the errors were only one 
different from the correct answer, the sizes of the 
errors have been neglected in most of the quantita- 
tive analyses. Two-tailed statistical tests have always 
been used except where stated. 


RESULTS 
Differences between Sets of Symbols 


Incidence of underestimations. Table 1 gives 
the incidence of errors separately for the four 
sets of symbols, both when only bright sym- 
bols were present in the display, and when dim 
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symbols were present as well as bright. An- 
swers underestimating the correct number of 
target symbols are shown separately from 
overestimations. The data in the bottom row 
of the table, pooled over all four sets of 
symbols, show that the incidence of under- 
estimations was virtually unaffected by the 
presence of dim symbols in the display. In 
comparing the four sets of symbols on under- 
estimations, the results from all 40 slides have 
therefore been pooled, as shown in the right- 
hand column of the table. 

Analysis of variance on the pooled under- 
estimations indicated that the effects of sets 
of symbols and of practice were both highly 
reliable (p < .001). Differences between indi- 
viduals were also highly reliable, but the 
small differences between the groups of men 
with different degrees of experience of elec- 
tronic displays were not reliable. On Tukey’s 
method for multiple comparisons (Ryan, 
1959), both the letters and the semiclosed 
letters gave reliably fewer underestimations 
than either the closed letters or the shapes 
(p < .05 or better). 

As a result of the Latin-square design, the 
shapes could have been at a disadvantage if 
there were greater positive transfer between 
the three sets of symbols resembling letters, 
than from these three sets to the shapes. To 
check that the underestimations of the shapes 
in the last column of Table 1 were not due 
entirely to this cause, the performances were 


TABLE 1 


MEAN PERCENTAGE OF INCIDENCE OF UNDERESTIMATIONS AND OVERESTIMATIONS 
ACCORDING TO SyMBOLS DISPLAYED 








16 slides with only 
bright symbols 


24 slides with bright and 


dim symbols All 40 slides 








Set of symbols Under- 
Under- Over- Under- Over- estimalcn® 
estimations estimations estimations estimations 

Letters 16.4 4% 12.0 12.3> 13.8¢ 
Semiclosed letters 113 3.58 16.4 1.6 14.44 
Closed letters 19.2 ey Dow, Sill 21.6 
Shapes 28.9 8 21.4 3.6 24.4 

All 4 sets of symbols combined 19.0 15 18.2 Sal 18.5 





a Letters-semiclosed letters p < .02. 

b Letters reliably different from all otter sets of symbols at the .01 level or better. 
¢ Letters-closed letters or shapes p < 

4 Semiclosed letters-closed letters or see pb <..05 or better. 
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examined of the six men whose first two tests 
involved shapes and either letters or semi- 
closed letters. Two men first searched for 
shapes and then for semiclosed letters, while 
two other men had these two conditions in the 
reverse order. There were also a pair of men 
whose orders of searching for shapes and 
letters balanced each other. Of these six men, 
five gave more underestimations on the shapes. 
The sixth gave the same number of under- 
estimations on the semiclosed letters as on 
the shapes, but had the shapes as the second 
condition. Correcting for order, the difference 
between the shapes and the letters or semi- 
closed letters is reliable on a two-tailed sign 
test (p < .05). Thus the greater number of 
underestimations of the shapes in Table 1 is 
unlikely to have been due entirely to cumula- 
tive transfer effects. 

Incidence of overestimations and reported 
confusions. The bottom row of Table 1 shows 
that overestimations were over three times as 
common on the slides carrying both bright 
and dim symbols, as on the slides carrying 
only bright symbols. A Wilcoxon test (Siegel, 
1956, pp. 75-83) indicated that the difference 
was reliable (p < .01). The two sets of slides 
thus need to be treated separately on this 
measure. 

The slides with only bright symbols pro- 
duced a total of only 15 overestimations. A 
sign test indicated that there were reliably 
fewer overestimations with the letters than 
with the semiclosed letters (p < .02). None 
of the other differences between sets of sym- 
bols was reliable here. All but 1 of the 15 
overestimations were just one greater than 
the correct number. They could have been 
due to counting a single target symbol more 
than once, or to counting one wrong kind of 
symbol as a target. The single remaining 
overestimation was eight too large; it was 
probably due to counting W’s instead of U’s. 

On the slides containing both bright and 
dim symbols, half the overestimations were 
two or more greater than the correct answer. 
Wilcoxon tests indicated that there were re- 
liably (p < .01 or better) more overestima- 
tions on the series of slides with letter sym- 
bols than on any of the other three series, 
which did not differ reliably from each other 
(p > .05). Since the letters gave if anything 
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TABLE 2 


REPORTED CONFUSIONS BETWEEN 
SYMBOLS 


COMMONLY 








Set of symbols Pairs confused 





Letters W and M 


Semiclosed letters V and Y 

Closed W and M 
Closed V and Y 
Closed V and A 


Closed letters 


Triangle and diamond 
Diamond and split diamond 
D on side and split D 


Shapes 





the fewest overestimations when only bright 
symbols were displayed, most of these over- 
estimations with the combined bright and dim 
symbols must have been due to counting dim 
symbols as bright. 

According to the reports of the experimental 
Ss, the most common confusions between sym- 
bols were those indicated in Table 2. 

Time taken. The mean time taken to search 
for all the bright target symbols and to call 
out the number counted was 11.9 sec. on the 
slides with only bright symbols, compared 
with 11.4 sec. on the slides with both bright 
and dim symbols. This difference was reliable 
on a Wilcoxon test against the differences be- 
tween experimental Ss (p< .02), but the 
difference was not reliable when tested against 
the differences between slides. (As will be 
pointed out later, the mean response time de- 
pended upon the number of bright symbols 
displayed. The group of slides with only 
bright symbols happened to include 4 slides 
with very large numbers of bright symbols, 
as well as 12 slides with average numbers— 
see Figure 3.) Separate analyses of variance 
on the two sets of slides indicated that the 
only other reliable effect was produced by 
the differences between individuals (p< .001). 
Mean time taken was not affected reliably 
by the set of symbols used, by practice dur- 
ing the experiment, nor by previous experi- 
ence with electronic displays. 


Sources of Clutter in the Display 


Incidence of underestimations. Figure 2 
shows how the incidence of underestimations 
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NUMBER OF TARGET SYMBOLS 


Fic. 2. Mean percentage of incidence of answers underestimating the number of target 
symbols, shown according to the number of target symbols actually present. (In A for 
slides with only bright symbols, the numbers printed beside the functions indicate the 
total number of symbols on the slide. In B for slides with both bright and dim symbols, 
the first number indicates the number of bright symbols on the slide, the number in 
brackets indicates the total number of combined bright and dim symbols present. Most 
points represent data from the same 16 men pooled over 4 slides, one with each set of 
symbols. But in functions with only 3 points, one of the points represents data pooled 


over 8 slides.) 


of the correct number of target symbols was 
influenced both by the number of target 
symbols to be counted, and by the total num- 
ber of bright symbols displayed. Each func- 
tion represents one basic pattern of symbols 
(see Method section). The number printed 
against the function gives the total number of 
bright symbols in the basic pattern. In 
Figure 2B, for the slides in which dim 
symbols were displayed as well as bright, 
the number in brackets gives the total num- 
ber of symbols in the basic pattern, both 
bright and dim. The four points on the func- 
tion are for the four different orientations of 
the basic pattern. Usually the number of 
signals of the kind designated as target was 
different for each orientation, but five of the 
functions have only three points because the 
number of target signals was the same for two 
orientations. The data for the four sets of 
symbols have been pooled. The orientation of 


the basic pattern of symbols, the particular 
symbol designated as the target, and the 
serial position of the slide in the set of 40 
test slides have been treated as random vari- 
ables in this analysis. They may account for 
some of the residual variability displayed by 
the data. 

Each function in Figure 2 was fitted by a 
straight line using the method of least squares. 
In this first analysis the three points at the 
origin in Figure 2B were excluded, since it is 
not possible to underestimate the number of 
targets when none is displayed, yet any func- 
tion with a point at the origin is almost 
bound to show a positive slope. The com- 
puted slopes of all 10 functions were found 
to be positive (p < .01). This means that as 
more target symbols were added to the dis- 
play, the probability of failing to report at 
least one target symbol increased. 

Each of the 10 functions was therefore 
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fitted by a straight line passing through the 
origin, again using the method of least squares. 
The slopes found on this second analysis are 
given in Table 3. The fitted slopes of the 
functions in Figure 2A ranged from 2.1% 
underestimations per target symbol for a total 
of 19 symbols to 7.6% per target symbol for 
a total of 55 symbols. The tau coefficient of 
rank correlation (Siegel, 1956, pp. 213-223) 
between the fitted slopes and the total num- 
ber of symbols displayed was +.67. Thus the 
probability of missing at least one target sym- 
bol tended to increase more rapidly with the 
number of target symbols as the total number 
of symbols in the display was increased. 

The fitted slopes of the functions in Figure 
2B were rather steeper, ranging from 2.9 for 
a total of 10 bright symbols to 9.0 for a total 
of 26. The four functions for totals of be- 
tween 18 and 21 bright symbols all had 
steeper fitted slopes than any of the three 
corresponding functions in Figure 2A for 
totals of between 19 and 23 symbols. A Mann- 
Whitney U test (Siegel, 1956, pp. 116-127) 
indicated that the average difference was just 
reliable (# = .028 on a one-tailed test). Thus 
the probability of missing a target symbol 
increased when dim symbols were added to a 
display containing only bright symbols. 

For the functions in Figure 2B, tau be- 
tween the fitted slopes and the total numbers 
of bright symbols in the display was +.60. 
For the total numbers of both bright and dim 
symbols in the display, and for the total num- 
bers of dim symbols only, taus were only 
+.07 and —.07, respectively. Thus the prob- 
ability of missing a target symbol depended 
a good deal more upon the total number of 
bright symbols in the display, than upon the 
total number of bright and dim symbols com- 
bined, or of dim symbols only. 

Even the larger values of tau were not re- 
liable statistically (p > .05), as there were 
not enough functions in either of the figures. 
However where two functions have one or 
more points in the same vertical columns of 
the figure, it is possible to compare their rela- 
tive heights above the abscissa at these points 
with the number of target symbols held con- 
stant. In Figure 2A the functions for 55, 23, 
and 19 total symbols all have points in the 
columns for both 3 and 2 target symbols. For 
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TABLE 3 
ComputEeD SLopES OF FUNCTIONS IN FicuRE 2 
Total number of symbols 
on slide (Figure 2A) | (Figure 2B) 
Bright sym- | Bright and 
: bols only | dim symbols 
Bri (Bright 
right + dim) 
10 (57) 2.86 
18 (45) 6.608 
18 (48) 5.438 
19 2.138 
19 (30) 5.3/8 
20 5.328 
21 (36) 7.458 
23 4.098 
26 (59) 8.99 
55 7.61 








Note.—Mean percentage of incidence of underestimations 
per target symbol. 

8 Of the functions for numbers of bright symbols ranging 
from 18 through 23, those for dim symbols as well (Figure 2B) 
all had steeper slopes (pb < .05 on a one-tailed test). 


these two numbers of target symbols com- 
bined, the function for the total of 55 dis- 
played symbols lies reliably above the other 
two functions, which give almost identical re- 
sults (p < .02 or better on Wilcoxon tests). 
The function for 55 displayed symbols pooled 
over 4 and 2 target symbols also lies reliably 
above the function for 20 displayed symbols 
(p < 02). 

In Figure 2B toward the bottom, the func- 
tions for 10 (57) and 18 (45) total symbols 
both have points in the columns for both 3 
and 4 target symbols. For these two numbers 
of target symbols combined, the function for 
18 (45) displayed symbols lies reliably above 
the function for 10 (57) displayed symbols 
(p < .01). Toward the top of the figure this 
same function for 18 (45) displayed symbols 
pooled over 1 and 4 target symbols lies re- 
liably below the function for 26 (59) dis- 
played symbols (p < .02). So does the func- 
tion for 18 (48) displayed symbols pooled 
over 4 and 5 target symbols (p < .01). Thus 
the total number of bright symbols in the 
display clearly affects the heights of the points 
in Figure 2. 

Sizes of underestimations. When two or 
more target symbols were displayed, under- 
estimations of the correct number of targets 


354 


A. BRIGHT 
ONLY 


MEAN RESPONSE TIME (SECONDS) 


oe Ye “ck i 7 9 





ieee Pourton 


B. BRIGHT 
AND DIM 





ie On |e G2 463/645 On, 


NUMBER OF TARGET SYMBOLS 


Fic. 3. Mean time taken to count target symbols according to the number of target 
symbols present. (The parameters are the same as in Figure 2.) 


of size 2 and greater averaged 35% of the 
total number of underestimations. This pro- 
portion is a good deal larger than that pre- 
dicted by a Poisson distribution of under- 
estimations. It strongly suggests that on trials 
where the man had missed one target symbol, 
he was likely to miss one or more additional 
target symbols. 

As was to have been expected, the propor- 
tion of underestimations of size 2 and greater 
increased as the number of target symbols 
increased, from 25% when there were 2 
through 4 target symbols to 55% when there 
were 7 or more target symbols. For the data 
pooled over Ss, the tau correlation between 
the proportion of underestimations of size 2 
and greater and the number (2 or greater) 
of target symbols displayed was +.56 (p 
4.05): 

Time taken. Figure 3 is similar to Figure 2, 
but the dependent variable on the ordinate is 
the mean time taken to count all the bright 
target symbols present, and to call out the 
number. In Figure 3A only the top function 
for a total of 55 bright symbols shows an 
appreciable slope. Here the difference between 


11 and 4 target symbols was highly reliable 
(p < .001), and the difference between 4 and 
both 3 and 2 target symbols was also reliable 
(p < .01). In Figure 3B three of the six func- 
tions slope up to the right, while the other 
three slope down to the right. Clearly, except 
for the slides with 55 bright symbols, the 
number of target symbols did not appreciably 
affect the mean time taken to respond. 
However mean response time tended to in- 
crease with the number of bright symbols dis- 
played. In Figure 3A the top function for 55 
displayed symbols pooled over 3 and 2 target 
symbols lies reliably above the functions for 
both 23 and 19 displayed symbols (p < .001). 
Pooled over 4 and 2 target symbols it also lies 
reliably above the function for 20 displayed 
symbols (p< .001). At the bottom of the 
figure the function for 19 displayed symbols 
pooled over 3 and 2 target symbols lies re- 
liably below the function for 23 displayed 
symbols (p < .01). Pooled over 2 and 1 target 
symbols it also lies reliably below the func- 
tion for 20 displayed symbols (p < .001). The 
tau correlation between the mean heights of 
the four functions and the total numbers of 
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symbols on the slides was +1.00 (p < .05 on 
a one-tailed test). 

In Figure 3B the differences in height cor- 
relate with the total number of bright sym- 
bols on the slide only when there are few 
bright symbols. The function at the bottom 
for 10 (57) displayed symbols pooled over 4, 
3, and O target symbols lies reliably below 
the function for 18 (45) displayed symbols 
(p < .01). For 3 target symbols it also lies 
reliably below the function for 19 (30) dis- 
played symbols (p < .05). But the relation- 
ship between the total number of bright sym- 
bols on the slide and the mean time taken 
does not hold for the functions higher up in 
the figure. Whereas the function for 18 (45) 
displayed symbols is one of the lowest, the 
function for 18 (48) displayed symbols is on 
average the highest. The tau correlation be- 
tween the mean heights of the six functions 
and the total numbers of bright symbols on 
the slides was +.53. For the total numbers of 
both bright and dim symbols in the display, 
and for the total numbers of dim symbols 
only, tau was only +.07 and —.07, respec- 
tively. 

The mean response times of the functions in 
Figure 3A for totals of 19, 20, and 23 bright 
symbols can be compared with the means of 
the functions in Figure 3B for totals of 18 
(45), 18 (48), 19 (30), and 21 (36) symbols. 
Both lots of functions are for conditions with 
about the same total numbers of bright sym- 
bols, but the conditions of Figure 3B involved 
dim symbols also. A Mann-Whitney U test 
showed no reliable difference between the two 
lots of functions (p > .05). Thus the presence 
of dim symbols in the display did not ap- 
preciably affect the mean time taken to re- 
spond. 


DIscusSsION 
Letters versus Shapes 


The right-hand column of Table 1 shows 
that the number of target symbols displayed 
was less likely to be underestimated when a 
set of capital letters was used than when a 
set of closed letters or shapes was used. A 
possible reason for the greater efficiency of 
the set of letters is that the forms of capital 
letters have been enormously overlearned. 
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Ever since a child first learns the alphabet, he 
has been perceiving and identifying capital 
letters, whereas some of the forms of the 
closed letters and of the shapes are relatively 
unfamiliar (see Figure 1). Thus in searching 
for a particular letter among a set of letters, 
the man knew precisely what he was looking 
for; he may have been somewhat less sure 
when searching for a particular closed letter 
or shape. 

Another possible reason is that the letter 
forms may be more easily discriminable from 
each other than are the forms of the closed 
letters or the shapes. V, W, U, and Y all have 
an open end at the top; A and M have an 
open end at the bottom; N has an open end at 
both the top and the bottom; D is a fully 
closed letter; whereas the closed letters and 
shapes are all closed with the single exception 
of the U. It is clear from Howell and Kraft’s 
(1959) confusion matrices that a less con- 
fusible set of capital letters could have been 
chosen, and the same probably applies to 
sets of closed shapes (Bowen, Andreassi, 
Truax, & Orlansky, 1960). But in so far as 
some of the letters are closed and some are 
open with the openings pointing in different 
directions, the direction of the opening sup- 
plies a cue which is not available with sets 
of closed forms. This point does not always 
appear to have been appreciated in consid- 
ering sets of symbols for electronic displays 
(Bowen et al., 1960; Sleight, 1952; Vicino, 
Andrews, & Ringel, 1966). 


Cluttered Displays 


Figures 2 and 3 illustrate the effect of three 
types of clutter: total number of symbols dis- 
played, number of bright symbols when only 
bright symbols require attention, and number 
of target symbols requiring immediate atten- 
tion. Figure 2 shows that the probability of 
missing at least one target symbol increased 
more or less linearly with the number of target 
symbols displayed, as if each target symbol 
had an equal chance of being the first one to 
be missed, which was independent of the 
number of other target symbols present. The 
data for multiple underestimations suggest 
that once one target symbol had been missed, 
there was an increased chance of missing at 
least one more target symbol. 
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Figure 3 shows that the number of target 
symbols displayed did not affect the time 
taken to count them when the total number 
of bright symbols was 26 or less. But with 
55 bright symbols in the display, four or more 
target symbols did cause extra delay, pre- 
sumably as a result of recounts. 

Figure 2 shows that the probability per 
target symbol of missing at least one of them 
increased as the total number of bright sym- 
bols in the display increased. Table 3 in- 
dicates that for the data of both Figures 2A 
and B, an increase in the total number of 
bright symbols by rather less than a factor of 
3 is accompanied by an increase by rather 
more than a factor of 3 in the probability of 
missing at least one target symbol. The ad- 
ditional bright symbols must either have 
masked, or else distracted attention from, the 
target symbols. 

Figure 3 shows that it also took longer to 
count the target symbols when there were 
more bright symbols in the display. Pre- 
sumably the man had to examine all the bright 
symbols to ensure that he did not miss any 
targets. It follows that the man should be 
provided with a facility for removing from 
the display categories of bright symbol which 
are not relevant to his present purpose. 


E. C. PouLton 


The chief effect of adding dim symbols to 
the display was to increase the probability 
of overestimating the number of bright target 
symbols (Table 1), some dim symbols being 
counted as targets. Table 3 shows that adding 
dim symbols also slightly but reliably in- 
creased the chances of missing at least one 
bright target symbol. Thus if the man does 
not need to refer to certain symbols, he should 
be able to remove them completely from the 
display, rather than being able only to reduce 
their brightness. 
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EFFECT OF CONTROL PLACEMENT ON INFORMATION 


TRANSFER RATE USING BISENSORY SIGNALS 
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A 2X24 factorial experiment was designed to examine the effect which 
various display-control relationships had upon the rate at which Ss could trans- 
mit information from a bisensory display. Ss were required to respond to a 
visual signal simultaneously presented with an auditory signal of equal un- 
certainty. Responses were made by depressing push buttons, The 3 response- 
code variables examined were (1) the code carried by each of the operating 
limbs, (2) the code carried by the operating digits of each hand, and (3) the 
degree of separation between the button pairs. The only variable significantly 
affecting information transfer rate was found to be the code carried by the 
operating limbs, although the other 2 variables combined with it to produce 
significant interactions. The results support the contention that the S-R ex- 
pectancies, which S$ brings to the task, are important determinants of perform- 
ance. The flexibility of the human component in man-machine systems is also 


illustrated. 


A previously reported study (Perriment & 
Webster, 1967) drew attention to some of the 
factors influencing optimal display-control ar- 
rangements, where a bisensory signal required 
a bimanual motor response. The present paper 
reports an experiment designed to examine 
further two of the previous findings, which 
were held to be of particular interest. One of 
these was the apparent superiority, in terms 
of information transfer rate (ITR), of con- 
trols compactly arranged and located directly 
in front of S, compared with controls which 
were laterally separated from the midline. The 
other was the evidence obtained for complex 
interaction effects between the three response- 
code variables studied. The work reported 
here was directed at examining the generality 
of the control separation effect, and investi- 
gating in greater detail the interaction effects 
obtained. 


MeEtTHOD 
Apparatus 


The apparatus has been described in detail else- 
where (Perriment & Webster, 1967). For the present 
study, the response panel was modified to permit a 
graduated lateral displacement of the response but- 
tons. Briefly, the signal display consisted of a 1000- 
cps tone and a light flash, simultaneously presented 
through earphones and a pair of lamps located di- 
rectly in front. The buttons were arranged on the 
panel in one of four ways: in the form of a rectangle 
directly in front of S (this arrangement is referred 
to as zero separation), or with each pair laterally 


separated from the other by 6, 12, or 24 in. The S’s 
task was to record the directional characteristics of 
the signal components, that is, whether the left or 
right lamp flashed, and whether the tone occurred 
at the left or right earphone. As each of the signal 
components required a binary judgment, the un- 
certainty associated with each bisensory event was 
2 bits. Signals were presented at approximately 5-sec. 
intervals, and immediately preceded by the onset of 
a red warning light centrally placed between the two 
signal lamps. The £ was screened from S and pre- 
sented the signals in a prearranged order, which was 
randomized within the constraint of equiprobability 
of occurrence for each signal type. Responses were 
recorded manually by E. The circuitry of the system 
allowed the identification of responses to each com- 
ponent of the signal, together with a measure of 
reaction time (elapsed time between the onset of the 
signal and the completion of the total response) in 
milliseconds. 


Subjects 


Sixty-four volunteer Ss were drawn from an in- 
troductory psychology class. No inducement was 
offered for participation. The Ss were allocated to 
the various treatment groups according to their at- 
tendance at the laboratory. None had previous ex- 
perience of the task required. As the earlier study 
had failed to produce evidence supporting the hy- 
pothesis of sex differences in performance on this task, 
sex was disregarded in the allocation of treatment 
groups. 


Procedure 


The S sat at a horizontal panel with the middle 
and index finger of each hand resting on a pair of 
response buttons. In those conditions in which the 
pairs of buttons were laterally separated, the right 
hand operated the pair on the right of the panel, 
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Fic. 1. Layout of signal display and response panel. 


and the left hand those on the left. In conditions of 
zero Separation, that is, the buttons being arranged 
in a rectangle directly in front, the right hand op- 
erated the pair farthest from S, and the left hand 
the pair closest to him. Figure 1 illustrates the layout 
of the response panel. 

The operating digits of the two hands carried 
either corresponding (DC) or noncorresponding 
(DN) codes. (See Figure 2.) For example, where 
the limb was coded to the signal’s direction of ori- 
gin, both index fingers responded to the visual 
component of the signal, one to direction right and 
the other to direction left, as in Code Condition 3. 
Alternatively, one digit responded to the visual and 
the other to the auditory components, as in Condi- 
tion 4. Similar conditions of digit correspondence 
applied when the code carried by the limbs was re- 
lated to the sensory mode of the signal components. 
In this case, correspondence was defined as both 
digits carrying the same directional code, as in Con- 
dition 1, or different codes, as in Condition 2. The 
experimental design used is illustrated in Table 1 and 
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TABLE 1 


EXPERIMENTAL DESIGN 














Digit Digit 
corresponding | noncorresponding 
oe Separation (in inches) 
OP 6 129) 24.) 0" || 6 aes 
Limb-mode a” age “Us “40 wir, a “ea a 


4 
Limb-direction | 4 | 4 | 4 4|}4],4 





Note.—Cell entries are number of Ss used. 


the display-control relationships generated by this 
design in Figure 2. Instructions appropriate to the 
code condition were read to Ss from a prepared text. 
This was followed by a training session consisting of 
two blocks of 100 signals each, separated by an in- 
terval of 3 min. A test series of 60 signals was then 
presented. The previous study had shown that per- 
formance in terms of the information transfer rate 
(ITR) had continued to improve over 10 test trials 
of 20 signals each for the 80 Ss used. The training 
session employed in this study aimed at obtaining 
performance measures which were relatively unaf- 
fected by gross learning effects. 


RESULTS 


The dependent variable used for analysis in 
this study was the ITR achieved by S. 

The multivariate analysis technique sug- 
gested by McGill and summarized by Att- 
neave (1959) was used to calculate the in- 
formation measures used. ITRs were obtained 
by dividing the absolute amount of informa- 
tion transmitted by S (T[x,;y]) by the mean 
reaction time. The obtained ITRs were sub- 
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Fic. 2. Control-display arrangements for Limb: Digit:Separation conditions. 


CONTROL PLACEMENT AND INFORMATION TRANSFER 


TABLE 2 


ANALYSIS OF VARIANCE SUMMARY OF THE OBTAINED 
INFORMATION TRANSFER Rates (BITS/SEC) 








Source df MS F ratios 
Control separation (A) 3 .16420 | 1.683 
Digit code (B) 1 .26596 | 2.726 
Limb code (C) 1 1.62741 | 16.679** 
AXB 3 .29655 | 3.039* 
INDIE 3 .64694 | 6.626** 
Bae 1 .20461 | 2.097 
Ae BX C 3 .03925 402 
Error 48 09757 

Total 63 











Note.—The homogeneity of the treatment variances was 
tested using Hartley’s Fmax test (Winer, 1962). The obtained 
F ratio of 65 was well below the level required for significance 
at the .05 pee 

0. 


p<. 
a0. 


jected to an analysis of variance, the results 
of which are summarized in Table 2. 

Examination of Table 2 shows that the only 
variable making an independent contribution 
to the total variance is that of the limb code 
used. The mean ITR obtained where the op- 
erating limb responded to the mode of the 
signal components was 1.17 bits/sec. When 
the limb responded to the directional compo- 
nents, ITR increased to 1.49 bits/sec. That 
this effect was not consistent for all treatment 
groups is shown by the significance of the A X 
C interaction (Figure 3). The significant fea- 
ture of this effect is that it is confined exclu- 
sively to the displacement of the response 
buttons from a position on the midplane of S 
to the first position of lateral separation. 

To illustrate the effect of control separation 
under each of the four response codes, treat- 


2:0 
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Fic. 3. Interaction of limb code and control separa- 
tion. (LD = limb direction; LM = limb mode.) 


ITR (bits/sec) 





Control Separation (ins.) 


Fic. 4. Interaction of digit code and control sepa- 
ration. (Comparison between limb codes: LD = limb 
direction; LM = limb mode; DC = digit correspond- 
ing; DN = digit noncorresponding.) 


ment means are plotted separately in Figure 
4, 

As the values plotted in Figure 4 represent 
the performance of the 16 treatment groups 
separately, independent tests were made to 
determine the significance of the differences 
displayed between limb-digit code groups for 
each of the lateral separation conditions, and 
between each of the separation conditions for 
each limb-digit code group. (See Tables 3 and 
4.) Table 3 summarizes the effect upon ITR 
of laterally separating the response buttons 
for each of the limb-digit code conditions. The 
impression conveyed by Figure 4 is supported 
by the results of these analyses. Only the 
limb-mode-digit-noncorresponding condition 
was significantly affected by control separa- 
tion, and this was restricted to the displace- 
ment from 0 to 6 in. separation. Table 4 pre- 
sents the results of the separate analyses made 
of the differences between the means obtained 
by each of the limb-digit groups for the lateral 
separation conditions. The differences between 
the limb codes found in the main analysis of 
variance (Table 2) are reflected in these sepa- 
rate analyses, with the exception that the 
groups operating under the condition where 
the controls were separated by 24 in. did not 
differ significantly in performance. 


DISCUSSION 


One feature of the results obtained from 
the present study, which is incongruent with 
those obtained earlier, relates to the signifi- 
cance of the control separation factor. In 
contrast to the superiority of centrally located 
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TABLE 3 


ANALYSIS OF VARIANCE SUMMARIES OF OBTAINED INFORMATION TRANSFER 
Rates (Bits/SEC): Four Copinc ARRANGEMENTS SEPARATELY 


| 


Control code 











Source df LM-DC LM-DN LD-DC LD-DN 
MS F MS F MS | F MS F 
Between groups 3 0791 15 .6753 16.98* 3435 1.96 0484 69 
Within groups 12 .1049 0397 1754 0712 
Total 15 


Note.—LM = limb mode; LD = limb direction; DC = digit 


* p< -01; 


controls, which was demonstrated in the 
earlier study, the present experiment produced 
no evidence of significant differences in per- 
formance which could be related directly to 
this factor. The explanation of this apparent 
inconsistency may be approached by consider- 
ation of the experimental designs used in the 
two studies. The previous study employed 
only two levels of each factor, giving a 2 X 
2 X 2 design in which the control separation 
factor had only two levels, centrally located, 
and 30 in lateral separation. As one of the 
aims of the present study was to determine 
the function which described the influence of 
control separation on ITR, a 2 X2 x4 design 
was used in which the separation factor had 
four levels, one of zero separation, and three 
of lateral separation. The results of the pres- 
ent study replicate, to some extent, those ob- 
tained previously in that significant differences 
in performance were found between some 


corresponding; DN = digit noncorresponding. 


groups operating with central control place- 
ment and those operating with controls later- 
ally separated (Figure 4). Further increases 
in the extent of this displacement had no sig- 
nificant effect upon performance, however. 
With the display-control relationships used 
in this study, the effect of control placement 
upon ITR was found to be of significance for 
some limb-digit codes, but not for others. 
Where the limb-digit code was affected by this 
factor, the significant displacement was from 
a central location to that where the button 
pairs were separated by 6 in. 

As stated earlier, the aim of this study was 
to examine in detail the nature of the inter- 
actions previously observed between the three 
response variables examined. Although the 
question of which, if any, of the display- 
control arrangements was the “best” was not 
considered, it is felt that some comments re- 
lated to this issue would not be inappropriate. 


TABLE 4 


ANALYSIS OF VARIANCE SUMMARIES OF OBTAINED INFORMATION TRANSFER 
Rates (Bits/sEc): Four CONTROL SEPARATIONS SEPARATELY 








Control separations 











Source df 0 in. 6 in WP a, 24 in. 
MS PF MS F MS F MS F 
Between groups 3 4854 10.65** 5342 5.54* 4041 5.92* Zod 1.43 
Within groups WW? 0455 .0964 .0682 1800 
Total 15 
*p < .05 
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In the course of analyzing the data from the 
present experiment, Duncan’s New multiple- 
range Test (Edwards, 1963) was applied to 
the means of the 16 treatment groups. It was 
found that the upper eight values plotted in 
Figure 4 formed a subset of means which did 
not differ significantly from each other at the 
.05 level. It is of interest to note that the 
treatment groups concerned represent the fol- 
lowing coding arrangements: limb-direction— 
digit-noncorresponding = 0 separation; limb- 
direction—digit-noncorresponding = 6—24 in. 
separation; limb-direction—digit-corresponding 
= 6-24 in. separation; limb-mode—digit-non- 
corresponding = 0 separation. This represents 
a considerable variety of display-control rela- 
tionships, and gives emphasis to the concept 
of flexibility which is often quoted as an attri- 
bute of major importance of the human com- 
ponent in man-machine systems. 

The results of this experiment are in ac- 
cord with previous studies concerned with 
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display-control relationships, in that they 
again demonstrate the effect upon performance 
of the S-R expectancies which S brings to the 
task. They also highlight the now well-docu- 
mented fact that satisfactory design features 
of the man-machine interface cannot be de- 
scribed on a continuum of “good, better, 
best,” but reflect a range of characteristics 
which has relatively broad limits of accepta- 
bility. 
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EVOLUTIONARY PERFORMANCE, MANAGERIAL 
ABILITIES, AND CHANGE: 


AN EXPLORATORY INVESTIGATION OF ORGANIZATIONS 


ERIC P. SANDERS 1 
San Diego State College 


In the laboratory, 50 3-man simulated organizations were observed, each 
operating a model railroad for 1 hr. and each given identical instructions. For 
each S (N =150), data were collected from the Ghiselli Self-Description In- 
ventory, the Gough Adjective Check List, and the performance variable “num- 
ber of cars per train.” The main measurements used were means, standard 
deviations, and skewness. It was shown that certain properties of an organiza- 
tion’s membership can and do affect performance through the evolution of the 
organization, and that changes of both a task and functional nature continually 


decrease from the inception to the completion of an organization. 


Over the past few years there has been a 
change in the types of problems which have 
drawn the attention of investigators in in- 
dustrial psychology. One of the new areas 
which has commanded interest may be termed 
“organizational psychology.” As Schein (1965) 
has indicated, this new field emphasizes the 
“recognition that organizations are complex 
systems . . . and human behavior within the 
organization has to be viewed from the per- 
spective of the entire . . . system [p. 3].” 

The present problem is concerned not with 
the study of individuals comprising groups, 
but rather with an examination of the total 
organization’s behavior as a complex entity, 
where the interest is in the exploration of 
changes in the organization’s dynamic “life- 
span.” 

The term “organization” as used in this 
study will be defined as a social group where 
the individuals within it are differentiated 
with respect to their function in connection 
with a task of achieving some common goal 
(after Stogdill, 1950). 

Organizational evolution is defined as the 
maturation of the organization from its in- 
ception to its completion. Ghiselli (1966) and 
Ghiselli and Lodahl (1958) have studied 
properties of groups in relation to perform- 


1A study done in partial fulfillment of the re- 
quirements for a doctoral degree at the University of 
California, Berkeley. The author wishes to thank 
Edwin E. Ghiselli, University of California, Berkeley, 
for his judicious help and constructive criticism 
which helped make this study possible. 
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ance, and some work has been done in view- 
ing organizational properties (e.g., Ghiselli, 
1963; Smelser, 1961). However, the relation- 
ship between organizational properties and 
performance over time has not been investi- 
gated and is the concern of the present study. 

The other part of this investigator’s con- 
sideration will be focused upon “organiza- 
tional stability.” As the organization develops 
it tends to become fixed in its procedures. 
This is implicitly stated in Weber’s (1946) 
classic bureaucratic theory and other theories 
in small group research, but it has not, to 
this researcher’s knowledge, been examined 
in an experimental setting. 

The two major problems, therefore, upon 
which attention is focused in this investiga- 
tion are: 

(a) organizational evolution and the prop- 
erties of the organization’s membership which 
affect performance during evolution, and 

(6) organizational stability—-whether or 
not an organization maintains its original 
functions or whether they are changed and 
modified by the members over time. 


METHOD 


In order to examine the problems raised above, a 
complex model railroad task was developed. This 
type of apparatus, but on a much simpler scale, was 
used before by Ghiselli and Lodahl (1958), Ghiselli 
(1966), and Smelser (1961) for studying group per- 
formance. The railroad is a useful task for the 
present purpose because it is inherently interesting 
and motivating for the young men used as Ss. 

Fifty three-man organizations worked on the actual 
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task for a total time of 1 hr. Participants were in- 
structed that it was a cooperative task and each 
group was given identical instructions. None of the 
members had worked together prior to the experi- 
mental situation. The task was subdivided into thirds: 
the first third (Trials 1-4); the second third (Trials 
5-8); and the last third (Trials 9-12). Each trial 
was 5 min. in length. 

After completing the task session, Ss were given 
the Ghiselli Self-Description Inventory (SDI) and 
the Gough Adjective Check List (ACL). The former 
gives an indication of the extent to which individuals 
possess certain managerial abilities. The six scales of 
the Ghiselli SDI relevant to this investigation were 
titled: Intelligence, Initiative, Supervisory Ability, 
Perceived Occupational Level, Decision-Making Ap- 
proach, and Maturity. The Gough ACL is used to 
describe certain psychological attributes of a person. 
The selected scales used were: Achievement, Domi- 
nance, and Aggressiveness. The average reliability for 
both the Ghiselli SDI and the Gough ACL was .80. 
For each simulated organization the mean, range, 
and skewness of scores were determined for each of 
the properties measured by the SDI and ACL. 

The reliability of performance was determined by 
the odd-even correlation method. As a measure of 
“efficiency” of performance (rather than just sheer 
amount), “cars per train’? was used. The uncorrected 
reliability of this measure was .70. 

In order to ascertain whether or not changes took 
place, specific job changes and changes in functional 
procedures were recorded. 


RESULTS 


The data indicate that as organizations 
evolve performance improves considerably. 
The mean number of “cars per train” (ef- 
ficiency) increased from 3.80 to 4.40 from 
Stage I to Stage III, respectively, the differ- 
ence being significant at the .01 level. 

Although one might expect that, as the in- 
telligence and initiative of the organization 
members, on the average, increase, the better 
would be the performance at the inception of 


TABLE 1 


CORRELATION BETWEEN AVERAGE INTELLIGENCE, 
AVERAGE INITIATIVE, AND AVERAGE MANAGERIAL 
ABILITIES AND PERFORMANCE THROUGH THE 
INITIAL AND FINAL STAGES OF ORGANIZA- 

TION EVOLUTION 
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TABLE 2 


MEAN DIFFERENCES BETWEEN HicH AND Low DMA 
DIFFERENTIATION AND PERFORMANCE DURING 
EARLY AND LATER STAGES OF ORGANIZA- 

TION EVOLUTION 








Low DMA | High DMA 








ee. skew means | skew means aloe 
i 46.6 50.0 93 
IIT 46.0 51.0 eye 
*p >.10. 


the organization, Table 1 shows this relation- 
ship does not appear to be the case for aver- 
age intelligence, though it does seem to be so 
for initiative. That is, one’s expectation would 
be for a higher correlational relationship at 
the early stages of evolution and a lower rela- 
tionship at the later stages. The data do not 
show this to be the case. 

The various “managerial abilities” were 
formed into a composite “managerial abilities 
variable” following Ghiselli’s (1964) formula, 
and then correlated with performance during 
the initial and final stages of organization 
evolution. The data indicate that fewer man- 
agerial abilities are manifested, on the aver- 
age, by the organization membership as per- 
formance improves over time. 

If there were one person who was more 
highly differentiated with regard to his de- 
cision-making approach (DMA) than the 
other members of the organization, the better 
would be the organization’s later performance 
(Table 2). The groups that distinguished the 
strong decision maker did perform better 
than those that did not. This is most readily 
seen in the differences in performance during 
the final evolutionary stage (Stage III). 

It appears to be quite significantly estab- 
lished, as shown in Table 3, that there is 


TABLE 3 


DIFFERENCES IN MEAN NUMBER OF ORGANIZA- 
TION CHANGES DURING THE STAGES OF 
ORGANIZATION EVOLUTION 

















Properties Stage I Stage | Differ- 
P III | ence Stages | ID | III I & IT| Mean changes 
Average intelligence 08 I ns I DOr 4.00* Be 
Average initiative zis) —.12 ns II 3.00* 46 
Average managerial ability .06 —.21 2.40* Ii Eoin 16 
*p = .05. *p > .005 
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greater stability (fewer organization changes) 
at the later stages of organization evolution 
than at the earlier stages. The number of 
changes dropped off some 80% from the first 
to the last stage. 


DISCUSSION 


Generally speaking, performance through- 
out the “life” of the organization continually 
improves. This supports the results found by 
Ghiselli (1966) in that the “learning” of total 
organizations appears to follow a similar pat- 
tern to that of individuals—practice improv- 
ing performance with the improvement shown 
in the characteristic negative acceleration. 
Since this “learning” or performance was 
measured in terms of a single organizational 
score (number of cars per train), rather than 
just the sum of aggregate member scores, it 
can be termed organizational learning. 

The mean intelligence level of the groups 
forming the organizations is unrelated to per- 
formance whether early or later in the or- 
ganization’s history. This agrees with Ghiselli’s 
(1966) results which indicate that organiza- 
tion performance is not necessarily related to 
the mean scores for the organization. Since 
intelligence is defined here as being a wide 
domain of mental abilities rather than any 
particular traits, it is passible that perform- 
ance during the initial stages of organizational 
evolution may not be related to the mean 
intelligence of the organization members, but 
may be accounted for only by those certain 
properties associated with particular behavior 
at this evolutionary stage (e.g., ability to deal 
with ambiguity, etc.), which may be masked 
by using a mean score measure, the implica- 
tion being that coordination among members 
of an organization may be impeded in co- 
operative situations by good problem solvers 
(implied high intelligence). 

Initiative, on the other hand, presents an- 
other picture. Initiative is defined by Ghiselli 
(1956) as “involving the beginnings of ac- 
tions and the capacity to note or discover 
new means of goal achievement [p. 311].” 
This implies acting independently of others 
and the ability to develop some sort of struc- 
ture from a nonstructured situation. If an 
organization has members who score higher, 
on the average, on initiative, the better will 
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be the performance during the initial stages 
since these individuals will be better able to 
structure the situation. Such appears to be 
the case with the groups studied. 

It seems that if an organization differen- 
tiates a single member who is a strong deci- 
sion maker (high positive skewness), while 
the remainder of the members are relatively 
homogeneous, then the later performance of 
that organization tends to improve. Since there 
is a relationship between those members who 
score high on the DMA skewness scale and 
high in initiative (r= .24, p> .10) and 
since initative is significantly related to cer- 
tain managerial abilities (e.g., supervisory 
ability, 7;, = .48; perceived occupational level, 
Yip = .59; achievement, 7ia = .39; and domi- 
nance, 7jqa = .45; all significant at the .01 
level), one would expect that an uncontested 
strong decision maker in the organization 
would not only be able to influence other 
member’s performance but also, resulting from 
his self-confidence and strong initiative, main- 
tain its upward direction through the organiza- 
tion’s evolution. Generalizations to all types 
of organizations, however, must be made with 
caution since the results are based on loosely 
structured task groups which allow flexibility 
of interaction as well as complete freedom in 
sharing in the decision-making process. With 
a strong hierarchically structured “formal or- 
ganization,” these results might show a differ- 
ent type of relationship. However, in this 
situation, it is evident that with a highly 
differentiated decision maker, the organiza- 
tion will perform better later. These results 
coincide with some of the classic bureaucratic 
theories of Weber (1946) and the studies by 
Ghiselli and Lodahl (1958) and Ghiselli 
(1966). 

The classic bureaucratic assumption regard- 
ing organization stability is tenable, accord- 
ing to these data. This indicates that at the 
inception of an organization there tends to be 
an unstructured, ambiguous situation wherein 
the members are just trying to understand 
their particular tasks, trying to fit their tasks 
into the total pattern of organizational func- 
tioning, and looking for some unifying direc- 
tion. Resulting from this situation, more 
changes occur in the tasks, procedures, and 
structure during the early stages of evolu- 
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tion, which is to say that the stability of 
organizations increases throughout organiza- 
tional evolution. 
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TESTING TRADITIONAL AND TWO-FACTOR HYPOTHESES 


CONCERNING JOB SATISFACTION 


GEORGE B. GRAEN ? 


University of Illinois 


Traditional and 2-factor theories make contradicting hypotheses concerning 
the relationships between satisfaction with job factors and overall job satisfac- 
tion. Traditional theory predicts essentially linear relationships, whereas 
2-factor theory predicts particular nonlinear relationships. Ss were 167 male 
and 152 female office workers employed by a single corporation. Data collected 
included satisfaction with work itself, promotion, and pay as measured by 
the scales of the Job Description Index and overall job satisfaction as measured 
by the General Motors Faces Scale. Data were analyzed by analysis of 
variance. Results indicated not only that 2-factor predictions were not con- 
firmed on 4 out of 6 relationships but also that 3 of these 4 misses were 


opposite that predicted. Traditional predictions generally were supported. 


The two-factor theory of job satisfaction 
(Herzberg, 1966; Herzberg, Mausner, & 
Snyderman, 1959) hypothesizes two sets of 
job variables—job-content or “satisfier” vari- 
ables and job-context or “‘dissatisfier” vari- 
ables. Moreover, these two sets of variables 
(job-content and job-context) are assumed to 
contribute to overall job satisfaction in dif- 
ferent manners. Job-content variables are as- 
sumed to account primarily for variance on 
the satisfaction portion of the continuum of 
job satisfaction—dissatisfaction. According to 
two-factor theory, when the positive aspects 
of job-content variables, such as work itself 
and advancement, are present on the job, 
they can increase job satisfaction from in- 
difference to lust for the job. However, when 
the negative aspects of these variables are 
present, they can do little or nothing to pro- 
mote dissatisfaction. In contrast, when the 
positive aspects of job-context variables, such 
as salary, are present, they cannot, according 
to the theory, increase satisfaction much 
above indifference. However, when their nega- 
tive counterparts are present, they can in- 
crease dissatisfaction from indifference to 
scorn for the job. Thus, two-factor theory 
predicts nonlinear relationships between satis- 
faction with job variables and overall job 
satisfaction. 

As an alternative formulation, the tradi- 
tional theory of job satisfaction (Ewen, Smith, 


1 The author wishes to thank Marvin D. Dunnette 
and Charles L. Hulin for their cooperation and sup- 
port. 


Hulin, & Locke, 1966; Graen, 1966) hypothe- 
sizes that if job variables are categorized ac- 
cording to the two-factor classification, both 
job-content and job-context variables account 
for variance over the entire satisfaction—dis- 
satisfaction continuum. Moreover, job-content 
variables are more potent contributors to sat- 
isfaction—dissatisfaction than are job-context 
variables. Traditional theory hypothesizes 
that if the presence of a variable contributes 
to job satisfaction, the absence of that vari- 
able will contribute to dissatisfaction and 
vice versa. Thus, traditional theory predicts 
essentially linear relationships between satis- 
faction with job variables and overall job 
satisfaction. Viewed in this way, two-factor 
and traditional theories specify different func- 
tions between job variables and overall job 
satisfaction. Two-factor theory predicts non- 
linear relationships; traditional theory pre- 
dicts essentially linear relationships. 

Two recent articles (Ewen et al., 1966; 
Graen, 1966) have presented evidence indi- 
cating the two-factor hypotheses do not hold 
for work itself, promotion, or pay. These in- 
vestigators found that dissatisfaction with the 
job-content variables (work itself and promo- 
tion) contributed to overall dissatisfaction 
and that satisfaction with the job-context 
variable (pay) contributed to overall satis- 
faction. Taken as a whole, these results clearly 
supported traditional theory at the expense 
of two-factor theory. 

These results of Ewen and Graen using 
male industrial workers so clearly contradicted 
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two-factor predictions that further research 
is needed to determine their generality. The 
purpose of the present study was to test the 
generality of the Ewen and Graen findings 
by applying their methods to samples of male 
and female office workers. As in the Ewen 
and Graen studies, two major hypotheses 
were tested on which two-factor and tradi- 
tional theories conflict. These hypotheses are 
stated below. 

Hypothesis 1. If one group of employees is 
dissatisfied, a second group neutral, and a 
third group satisfied with regard to a job- 
content (satisfier) variable, two-factor theory 
predicts equal overall satisfaction for the first 
and second groups and higher satisfaction 
than the other two groups for the third. In 
contrast, traditional theory predicts an in- 
creasing linear trend in satisfaction from 
the first to the third group. 

Hypothesis 2. If one group of employees is 
dissatisfied, a second group neutral, and a 
third group satisfied with regard to a job- 
context (dissatisfier) variable, two-factor the- 
ory predicts equal overall satisfaction for the 
second and third groups and lower satisfaction 
than the other two groups for the first. In 
contrast, traditional theory again predicts an 
increasing linear trend in satisfaction from 
the first to the third group. 

In addition to these two major hypotheses, 
the Ewen and Graen studies generated two 
other hypotheses that will be tested in the 
present study. These two hypotheses are 
stated below. 

Hypothesis 3. Job-content variables are 
more strongly related to overall satisfaction— 
dissatisfaction than are job-context variables. 

Hypothesis 4. The relationships between 
job-context variables and overall satisfaction— 
dissatisfaction depend on the level of satisfac- 
tion with the job-content variables. 


METHOD 


The Ss were home-office employees of one cor- 
poration. All employees were surveyed as part of a 
job-attitudes study conducted by the company. A 
mail questionnaire assuring anonymity to the re- 
spondents was administered through departmental 
representatives. A gratifyingly high rate (83%) of 
return was obtained and all questionnaires were 
usable. The returns included 167 questionnaires from 
males and 152 from females. 
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Among other instruments included in the ques- 
tionnaire were the Job Description Index (JDI) and 
the General Motors Faces scale. The JDI developed 
at Cornell is an adjective checklist dealing with five 
job variables, namely, work itself, promotion, pay, 
supervision, and co-workers. The JDI has been 
shown to possess acceptable convergent and dis- 
criminant validity (Hulin, Smith, Kendall, & Locke, 
1963; Kendall, Smith, Hulin, & Locke, 1963; Locke, 
Smith, Hulin, & Kendall, 1963; Locke, Smith, 
Kendall, Hulin, & Miller, 1964). This study employed 
only the work itself, promotion, and pay scales of 
the JDI. The General Motors Faces Scale (Kunin, 
1955) was used as the measure of overall job satis- 
faction. For details of the data-collection procedure, 
see Hulin and Smith (1967). 

In the analyses of Ewen and Graen, the variables 
of work itself and promotion were analyzed as a 
composite variable. In the present study, work itself 
and promotion were analyzed separately. Analyses 
employed a weighted means, two-way analysis of 
variance to test the relationships between the JDI 
variables of work itself, promotion, and pay and 
overall job satisfaction measured by the General 
Motors Faces Scale. These analyses were performed 
separately for males and females. Each of the three 
JDI variables (work itself, promotion, and pay) was 
trichotomized into dissatisfied, neutral, and satisfied 
with regard to the job variable. Cutting points on 
these JDI variables were supplied by Ewen (1965). 

In all ANOVAs, the first dimension was a job-con- 
tent variable (either work itself or promotion), and 
the second was the job-context variable (pay). Each 
dimension had three levels, namely, dissatisfied, 
neutral, and satisfied. The criterion measure in all 
ANOVAs was overall job satisfaction as measured 
by the General Motors Faces Scale. In addition, a 
priori contrasts were tested on the main effects 
(Winer, 1962). For each dimension, one contrast 
(Ai and Bi) tested the difference on overall satis- 
faction between the group dissatisfied and the group 
neutral with regard to the JDI dimension. A second 
contrast (Az and Bz) tested the difference on overall 
satisfaction between the group neutral and the group 
satisfied with regard to the JDI dimension. Omega- 
squared (Hays, 1963) was employed to measure the 
strength of relationships between JDI dimensions and 
overall job satisfaction. Finally, the trend in means 
on overall satisfaction for groups dissatisfied, neutral, 
and satisfied with regard to each JDI variable was 
tested for curvilinearity (McNemar, 1962). 


RESULTS 


Results of the ANOVAs for male employees 
are shown in Tables 1 and 2. According to 
Tables 1 and 2, work itself was related to 
high and not low overall satisfaction; promo- 
tion was related to low and not high satisfac- 
tion; pay was related to high and not low 
satisfaction. Thus, work itself was predicted 
and promotion was not predicted in Hypothe- 
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TABLE 1 


ANALYSIS OF VARIANCE FOR A PRIORI CONTRASTS AND 
OMEGA-SQUARED FOR STANDARD EFFECTS OF THE 
SATISFIER (WORK) AND THE DISSATISFIER 
(Pay) Dimensions ON OVERALL JOB 
SATISFACTION FOR MALES 








Source of variation | df | MS F Omega? 
Satisfier (work) Dele olesen ozone a2 

Ai:Sp versus Sy 1 |] 6.89} 2.86 

As: Sn versus Sg Lae23. 0459 502% 
Dissatisfier (pay) 2 le SoOl, o. Oe .03 

Bi: Dp versus Dy 1 10 04 

Be: Dy versus Dg Ae OG Aso Oe 
Interaction AR Bysehsh||) P2285} 03 
Error 158 | 2.41 





Note.—S and D represent the satisfier and dissatisfier 
factors, respectively. Subscripts D, N, and S represent the 
dissatisfied, neutral, and satisfied levels, respectively. 

* p< .05. 

E> < :01. 


sis 1 by the two-factor theory. Moreover, 
promotion not only failed to support this 
two-factor hypothesis, but actually demon- 
strated a relationship the reverse of that 
predicted. Two-factor theory predicted, in 
Hypothesis 1, that content variables should 
show significant differences, if any, between 
neutral and satisfied groups and not between 
dissatisfied and neutral groups. Although this 
was the case for work itself, the opposite was 
the case for promotion. In Hypothesis 2, two- 
factor theory predicted that context variables 


TABLE 2 


ANALYSIS OF VARIANCE FOR A Priori CONTRASTS AND 
OMEGA-SQUARED FOR STANDARD EFFECTS OF THE 
SATISFIER (PROMOTION) AND THE Dis- 
SATISFIER (PAY) DIMENSIONS ON 

OVERALL JOB SATISFACTION 








FOR MALES 
Source of variation | df | MS PF Omega? 
Satisfier (promotion) 2 | 33.08 | 13.48** oL2 
Ai:Sp versus Sy a2 Sono Ooms 
A2:Swn versus Sg 1!/ 6.04] 2.47 
Dissatisfier (pay) 2 | 13.94] 5.68** .05 
Bi: Dp versus Dy fl 2:55 | 1.04 
Be: Dy versus Dg 1/ 9.08} 3.71 
Interaction 4) 3.16} 1.29 01 
Error 158 | 2.45 





Note.—S and D represent the satisfier and dissatisfier 
factors, respectively. Subscripts D, N, and S represent the dis- 
satified, neutral, and satisfied levels, respectively. 


** > < 101, 
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should show significant differences, if any, be- 
tween dissatisfied and neutral groups and not 
between neutral and satisfied groups. In con- 
tradiction to this hypothesis, pay showed a 
relationship opposite that predicted by two- 
factor theory. Overall, these results for male 
employees confirm two-factor predictions for 
only a single content variable (work itself) 
and strongly disconfirm two-factor predictions 
for both the second content variable (promo- 
tion) and the context variable (pay). 

Results of the ANOVAs for female em- 
ployees are presented in Tables 3 and 4. As 
shown in Tables 3 and 4, work itself was re- 
lated to both high and low overall satisfac- 
tion, and both promotion and pay were re- 
lated to high but not low satisfaction. In the 
case of females, work itself was not predicted 
and promotion was predicted in Hypothesis 1 
by two-factor theory. In agreement with the 
results for males, pay showed a relationship 
opposite that predicted in Hypothesis 2 by 
two-factor theory. These results for females 
were similar to those for males with the ex- 
ception that promotion was predicted and 
work itself was not predicted. Therefore, two- 
factor predictions fared no better in either 
sample: Hypothesis 1 was supported by one 
variable and not supported by the other vari- 
able; Hypothesis 2 was not supported. 

Results on the predictions of traditional 
theory are shown in Table 5. According to 
Table 5, the means on overall satisfaction 


TABLE 3 


ANALYSIS OF VARIANCE FOR A PRIORI CONTRASTS AND 
OMEGA-SQUARED FOR STANDARD EFFECTS OF THE 
SATISFIER (WoRK) AND THE DISSATISFIER (Pay) 
DIMENSIONS ON OVERALL JOB SATISFACTION 
FOR FEMALES 











Source of variation | df | MS PB Omega? 
Satisfier (work) 2 | 91.50] 34.81* eo 

Ai:Sp versus Sn 1 | 58.06 | 22.08* 

Ao:Sn versus Sg ft 25.05 O25 
Dissatisfier (pay) 2 | 19.78 eon 05 

B,;:Dp versus Dy 1 .67 a} 

Bo: Dn versus Dg SHES 7aie2. OO 
Interaction 4 1.38 Ow .00 
Error 143 | 2.63 


Note.—S and D represent the satisfier and dissatisfier 
factors, respectively. Subscripts D, N, and S represent the 
dissatisfied, neutral, and satisfied levels, respectively. 


*p< 01 
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TABLE 4 


ANALYSIS OF VARIANCE FOR A PRIORI CONTRASTS AND 
OMEGA-SQUARED FOR STANDARD EFFECTS OF THE 
SATISFIER (PROMOTION) AND THE DISSATISFIER 
(Pay) DrweNnsIONS ON OVERALL JoB 
SATISFACTION FOR FEMALES 








Source of variation | df | MS F Omega? 
Satisfier (promotion) ZNoO20elOuses 10 

Ai:Sp versus Sn 1]; 4.00) 1.19 

Ao:Sn versus Sg 1 | 20.91 | 6.20* 
Dissatisfier (pay) 2 2050s 7/42" .08 

Bi: Dp versus Dy 1 45 13 

Bo: Dy versus Ds 15e355509 ee OL O0R 
Interaction 4 | 2.34 69 .00 
Error 143°) 3:37 





Note.—S and D represent the satisfier and dissatisfier 
factors, respectively. Subscripts D, N, and S represent the dis- 
satisfied, neutral, and satisfied levels, respectively. 

= Di< 05, 

arp <= .01., 


demonstrated an increasing monotonic trend 
for groups dissatisfied, neutral, and satisfied 
with the job variables. This relationship, pre- 
dicted by traditional theory, was shown in 
both samples by work itself, promotion, and 
pay. Of the six relationships tested for curvi- 
linearity, only that between work itself and 
satisfaction for females was significantly curvi- 
linear. However, the trend in means on satis- 
faction for this single curvilinear relationship 
was opposite that predicted by two-factor 
theory. Although two-factor theory predicted 
no difference in satisfaction between the 
groups dissatisfied and neutral with regard to 
work itself, the largest difference was between 
these very groups. Thus, predictions made by 
traditional theory in Hypotheses 1 and 2 were 
supported by these data. Moreover, these data 
clearly are not compatible with two-factor 
predictions of nonlinear relationships be- 
tween satisfaction with job variables and 
overall satisfaction. 

The results relevant to Hypothesis 3 are 
the measures of strength of association 
(Omega-squared). These results strongly sup- 
port the hypothesis that content variables 
(work itself and promotion) are more highly 
related to overall satisfaction—dissatisfaction 
than context variables (pay). In the male 
sample, work itself and promotion accounted 
for 12% of the total variance in overall satis- 
faction, and pay accounted for 3% and 5%. 
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TABLE 5 


MEANS ON OVERALL JOB SATISFACTION FOR THE 
Groups DIssATISFIED, NEUTRAL, AND 
SATISFIED WITH REGARD TO WORK 
ItsELF, Pay, AND PROMOTION 














Work ay Promotion 
Group 
n M n M n M 
Males 
Dissatisfied | 12 | 6.17 DATA Aa 714. 
Neutral 18 | 7.17 PSY) IKE || SBE | S72 
Satisfied 137 | 8.47 | 117 | 8.43 | 90 | 8.68 
Females 
Dissatisfied | 19 | 5.21* |} 18] 6.89 | 61 | 7.36 
Neutral 26 | 7.73 PN TADS) |) PAE Soiealal 
Satisfied 107 | 8.89 | 112 | 8.67 | 64 | 9.11 




















Note,—Standard deviations on overall satisfaction were 1.73 
and 2.07 for males and females, respectively. 
* Significantly nonlinear at p < .01. 


Similarly, in the female sample, work itself 
accounted for 27% of the total variance in 
satisfaction, promotion accounted for 10%, 
and pay for 5% and 8%. Further evidence of 
the large differences in strength of relation- 
ship can be seen in the differences between 
means shown in Table 5. 

Results on Hypothesis 4 are the ANOVA 
tests on interactions. According to Tables 1 
through 4, none of the interactions between 
content and context dimensions was signifi- 
cant. Therefore, the hypothesis that the func- 
tioning of context variables depends on the 
level of satisfaction with the content variables 
was not supported by these data. 


DISCUSSION 


The results of this study, taken as a whole, 
must be viewed as damaging to the two-factor 
theory. Although failure by two-factor theory 
to predict four out of six relationships seems 
discouraging enough, three of these four 
misses actually were opposite that predicted. 
In addition, the trends of the means on over- 
all satisfaction for the groups dissatisfied, 
neutral, and satisfied with the job variables 
were increasing monotonic and essentially 
linear, rather than nonlinear as predicted 
by two-factor theory. In contrast, hypotheses 
of traditional theory generally were supported 
in this study. The prediction of increasing 
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linear trends in satisfaction for groups dis- 
satisfied, neutral, and satisfied with regard 
to job variables was supported for both the 
content variables of work itself and promo- 
tion and the context variable of pay. These 
findings held for both males and females with 
a single exception. 

This study also supports the conclusions 
of Friedlander (1964), Wernimont (1966), 
Ewen et al. (1966), and Graen (1966) that 
content (intrinsic) variables are more strongly 
related to overall satisfaction—dissatisfaction 
than are context (extrinsic) variables. In the 
previous study by Graen (1966) the content 
variable accounted for 18% and the context 
variable 2% of the total variance in overall 
satisfaction. In the present study, the content 
variables accounted for between 10% and 
27% and the context variable accounted for 
between 3% and 8%. 

In agreement with Graen (1966), this study 
failed to yield a significant interaction be- 
tween content and context variables. Thus, the 
hypothesis by Ewen et al. (1966) that the 
functioning of extrinsic variables depends on 
the level of satisfaction with the intrinsic 
variables finds no support in this study. 

One major difference in results appeared be- 
tween this replication study and the original 
studies by Ewen and Graen. The sample em- 
ployed by Ewen and Graen consisted of male 
employees selected to represent the popula- 
tion of industrial concerns employing 50 
or more persons (Kendall, 1963). In those 
studies, the content variable (a composite of 
work itself and promotion) and the context 
variable (pay) contributed significantly to 
both high and low overall satisfaction. In the 
male sample of the present study, work itself 
contributed only to high satisfaction, promo- 
tion only to low satisfaction, and pay only to 
high satisfaction. The main difference be- 
tween these two sets of results was that pay 
failed to contribute to low satisfaction in the 
present study. This difference probably was 
due to the situational variables associated 
with a single company. 

Even within the same company, this study 
found large sex differences. In the sample of 
female employees, work itself contributed to 
both high and low overall satisfaction, and 
promotion and pay only to high satisfaction. 


GrEoRGE B. GRAEN 


In contrast, for males working in the same 
location, promotion contributed only to low 
satisfaction, and work itself and pay only to 
high satisfaction. 

This study illustrates once again that the 
major hypotheses of the two-factor theory, in 
their existing form, cannot predict results pro- 
duced by the present method. Although two- 
factor hypotheses summarize the results pro- 
duced by the story-telling method very well, 
they appear useless in predicting results of 
studies employing other methods. For docu- 
mentation on this point, see Dunnette, Camp- 
bell, and Hakel (1967). Unless the two-factor 
theory can be modified so as to account for 
the disconfirming results obtained when dif- 
ferent methods have been employed it should 
be regarded as of limited usefulness in the 
attempt to understand work motivation. 
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Proposes a new emphasis in the prediction of future job behavior. The essence 
of the suggested procedure is the establishment of consistencies between relevant 
dimensions of job-behavior and preemployment-behavior samples obtained from 
real or simulated situations. If samples instead of signs are employed, a number 
of prediction and measurement problems seem to be alleviated or at least 
confronted more directly. An emerging technology of behavior sampling and 
measurement would appear to put these goals within reach. 


Many writers (e.g., Dunnette, 1963; 
Ghiselli & Haire, 1960; Guion, 1965; Wallace, 
1965) have expressed concern about the dif- 
ficulties encountered in trying to predict job 
performance, and in establishing the validity 
of tests for this purpose. In general, their mis- 
givings center around the low validities ob- 
tained and misapplications of the so-called 
“classic validity model.” To help ameliorate 
these difficulties it is proposed here that the 
concept of validity be altered as it is now 
applied to predictive and concurrent situations 
and introduce the notion of “behavioral con- 
sistency.”’ By consistency of behavior is meant 
little more than that familiar bit of conven- 
tional wisdom, ‘The best indicator of future 
performance is past performance.” Surpris- 
ingly few data seem to exist to either support 
or refute this generalization. It deserves con- 
siderably more attention. 


SomE HIsToRY 


It is perhaps not too difficult to trace the 
steps by which applied psychologists arrived 
at their present situation. During both World 
War I and World War II general intelligence 
and aptitude tests were effectively applied to 
military personnel problems. Largely as the 
result of these successes, the techniques de- 
veloped in the armed services were trans- 
ported to the industrial situation and applied 
to the personnel problems of the business or- 
ganization. From a concentration on global 
measures of mental ability, validation efforts 
branched out to include measures of specific 
aptitudes, interests, and personality dimen- 
sions. The process is perhaps most clearly il- 
lustrated by the efforts of the United States 
Employment Service to validate the General 


Aptitude Test Battery across a wide range 
of jobs and occupations. In general, testing 
seemed to be a quick, economical, and easy 
way of obtaining useful information which 
removed the necessity for putting an individ- 
ual on the job and observing his performance 
over a trial period. 

It was in the context of the above efforts 
that an unfortunate marriage occurred, 
namely, the union of the classic validity 
model with the use of tests as signs, or in- 
dicators, of predispositions to behave in cer- 
tain ways (Cronbach, 1960, p. 457), rather 
than as samples of the characteristic behavior 
of individuals. An all too frequent procedure 
was to feed as many signs as possible inte 
the classic validity framework in hopes that 
the model itself would somehow uncover 
something useful. The argument here is that 
it will be much more fruitful to focus on 
meaningful samples of behavior, rather than 
signs of predispositions, as predictors of later 
performance. 


THe CoNnsIsTENCY MopEL 


To further illustrate the point, consider a 
hypothetical prediction situation in which the 
following five measures are available: 


1. Scores on a mental ability test; 

2. School grade-point average (GPA); 
3. Job-performance criterion at Time 1; 
4. Job-performance criterion at Time 2; 
5. Job-performance criterion at Time 3. 


Obviously, a number of prediction oppor- 
tunities are possible. Test scores could be cor- 
related with GPA; school achievement could 
be correlated with first-year job success; or 
the test scores and GPA could be combined 
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in some fashion and the composite used to 
predict first-, second-, or third-year job per- 
formance. All of these correlations would be 
labeled validity coefficients and all would 
conform to the classic validity model. It is 
less clear what label should be attached to 
the correlation between two different measures 
of job performance. Few would call it validity; 
many would probably refer to it as relia- 
bility. There seems to be a tendency among 
applied psychologists to withhold the term 
validity from correlations between measures 
of essentially the same behavior, even if they 
were obtained at two different points in time. 
That is, the subtleties of the concept of re- 
liability and the ingredients of the classic 
validity model seem to have ingrained the 
notion that validity is a correlation between 
a predictor and a criterion and the two should 
somehow be dissimilar. 

However, each of the 10 correlations that 
one could compute from the above situation 
represents the degree of common variation 
between the two variables, given the ap- 
propriateness of the linear correlation model. 
After all, that is what correlation is all about. 
In this sense there is no logical reason for 
saying that some of the coefficients represent 
validity and others reliability, although there 
certainly may be in other contexts. An im- 
plicit or explicit insistence on the predictor 
being “different” seems self-defeating. Rather 
one should really be trying to obtain measures 
that are as similar to the criterion or criteria 
as possible. This notion appears to be at least 
implicit in much of the work on prediction 
with biographical data where many of the 
items represent an attempt to assess previous 
achievement on similar types of activities. Be- 
havior sampling is also the basis on which 
simulation exercises are built for use in man- 
agerial assessment programs. 

At this point it should be emphasized 
that for the consistency notion to be con- 
sistent, the measures to be predicted must 
also be measures of behavior. For example, 
it would be something less than consistent 
to use a behavior sample to predict such 
criteria as salary progression, organizational 
level achieved, or subunit production. The 
individual does not always have substantial 
control over such variables, and, even with 
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the more obvious biasing influences accounted 
for, they place a ceiling on the maximum 
predictive efficiency to be expected. Further- 
more, they are several steps removed from 
actual job behavior. In this respect, the au- 
thors are very much in accord with Dunnette 
(1966) who argues strongly for the measure- 
ment of observable job behavior in terms of 
its effect on meaningful dimensions of per- 
formance effectiveness. A recently developed 
method for accomplishing this aim is the be- 
havior retranslation technique of Smith and 
Kendall (1964). The applied psychologist 
should reaffirm his mandate and return to 
the measurement of behavior. Only then will 
one learn by what means, and to what extent, 
an individual has influenced his rate of pro- 
motion, salary increases, or work group’s 
production. 

In general terms, what might the selection 
or prediction procedure look like if one tried 
to apply a consistency model? First, a com- 
prehensive study of the job would be made. 
The results of this effort would be in the 
form of dimensions of job performance well 
defined by a broad range of specific behavior 
incidents which in turn have been scaled 
with respect to their “criticalness” for effec- 
tive or ineffective performance. 

Next, a thorough search of each applicant’s 
previous work experience and educational his- 
tory would be carried out to determine if any 
of the relevant behaviors or outcomes have 
been required of him or have been exhibited 
in the past. Items and rating methods would 
be developed to facilitate judging the fre- 
quency of such behaviors, the intensity with 
which they were manifested, the similarity of 
their context to the job situation, and the 
likelihood that they will show up again. These 
judgments can then be related to similar 
judgments concerning significant and con- 
sistent aspects of an individual’s job behavior. 

Such a procedure places considerable em- 
phasis on background data and is similar in 
form to the “selection by objectives” concept 
of Odiorne and Miller (1966). However, the 
aim is to be considerably more systematic and 
to focus on job behavior and not summary 
“objectives.” 

After the analysis of background data it 
might be found that the required job be- 
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haviors have not been a part of the applicant’s 
past repertoire and it would be necessary to 
look for the likelihood of that job behavior 
in a variety of work-sample tests or simula- 
tion exercises. A number of such behavior 
measures are already being used in various 
management assessment programs. 

Finally, individual performance measures 
of psychological variables would be given 
wider use where appropriate. For example, 
the Wechsler Adult Intelligence Scale (Wechs- 
ler, 1955) might be used to assess certain 
cognitive functions. Notice that such a mea- 
sure is a step closer to actual performance 
sampling than are the usual kinds of group 
intelligence tests. 

How does the above procedure compare to 
conventional practice? The authors hope 
they are not beating at a straw man if the 
usual selection procedure is described as fol- 
lows. First, a thorough job analysis is made 
to discover the types of skills and abilities 
necessary for effective performance. This is 
similar to the consistency approach except 
that the objective seems to be to jump very 
quickly to a generalized statement of skills 
and abilities rather than remaining on the be- 
havioral level. The conventional approach next 
entails a search for possible predictors to try 
out against possible criteria. Based on knowl- 
edge of the personnel selection and individual 
differences literature, personal experience, and 
“best guesses,” some decisions are made con- 
cerning what predictors to include in the 
initial battery. It is the authors’ contention 
that the classic validity model has forced an 
undue amount of attention on test and in- 
ventory measures at this stage. Witness the 
large amount of space devoted to a discus- 
sion of “test validation” in most books deal- 
ing with the selection problem. Again, signs 
seem to take precedence over samples. Lastly, 
one or more criterion measures are chosen. 
Too often the choice seems to be made with 
little reference to the previous job analysis 
and is based on a consideration of “objec- 
tivity” and relevance to the “ultimate” cri- 
terion. Unfortunately, even a slight misuse of 
these considerations can lead to criteria which 
are poorly understood. In contrast, working 
within the framework of a consistency model 
requires consideration of dimensions of actual 
job behavior. 
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It might be added that the above charac- 
terization of the conventional approach is 
meant to be somewhat idealized. Certain de- 
partures from the ideal might reinforce the 
use of signs to an even greater extent. For 
example, there is always the clear and present 
danger that the skill requirements will be 
stated in terms of “traits” (e.g., loyalty, re- 
sourcefulness, initiative) and thus lead even 
more directly to criteria and predictors which 
are oriented toward underlying predisposi- 
tions. 


RELATIONSHIP TO OTHER ISSUES 


The consistency notion has direct relevance 
for a number of research issues that appear 
frequently in the selection and prediction 
literature. One important implication is that 
selection research should focus on individuals 
to a much greater extent than it has. That is, 
there should be more emphasis on intrain- 
dividual consistency of behavior. In their in- 
sightful discussion of the criterion problem, 
Ghiselli and Haire (1960) point out that in- 
traindividual criterion performance sometimes 
varies appreciably over time, that is, is “dy- 
namic.” They give two examples of this 
phenomenon. However, after an exhaustive 
review of the literature, Ronan and Prien 
(1966) concluded that a general answer to 
the question, “Is job performance reliable?” 
is not really possible with present data. They 
go on to say that previous research has not 
adequately considered the relevant dimensions 
that contribute to job performance and very 
few studies have actually used the same 
criterion measure to assess performance at 
two or more points in time. In the absence of 
much knowledge concerning the stability of 
relevant job behaviors it seems a bit danger- 
ous to apply the classic validation model and 
attempt to generalize from a one-time cri- 
terion measure to an appreciable time span 
of job behavior. Utilizing the consistency no- 
tion confronts the problem directly and forces 
a consideration of what job behaviors are 
recurring contributors to effective perform- 
ance (and therefore predictable) and which 
are not. 

In addition, the adoption of signs as pre- 
dictors in the context of the classic model 
has undoubtedly been a major factor con- 
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tributing to the lack of longitudinal research. 
It makes it far too easy to rely on concurrent 
studies, and an enormous amount of effort 
has been expended in that direction. Em- 
phasis on behavior samples and behavior con- 
sistency requires that a good deal more at- 
tention be devoted to the former, along with 
very explicit consideration of the crucial 
parameters of a longitudinal study. 

The moderator or subgrouping concept also 
seems an integral part of the consistency ap- 
proach. The basic research aim is to find 
subgroups of people in a particular job family 
for whom behavior on a particular perform- 
ance dimension is consistent. Subgrouping 
may be by individual or situational charac- 
teristics but the necessity is clear and in- 
escapable. Only within such subgroups is 
longitudinal prediction possible. 

Lastly, the process the authors are ad- 
vocating demands a great deal in terms of 
being able to specify the contextual or situa- 
tional factors that influence performance. It 
is extremely important to have some knowl- 
edge of the stimulus conditions under which 
the job behavior is emitted such that a more 
precise comparison to the predictor behavior 
sample can be made. Because of present dif- 
ficulties in specifying the stimulus conditions 
in an organization (e.g., Sells, 1964), this may 
be the weakest link in the entire procedure. 
However, it is also a severe problem for any 
other prediction scheme, but is usually not 
made explicit. 

It is important to note that the authors’ 
notion of a consistency model does not rest 
on a simple deterministic philosophy and is 
not meant to preclude taking account of so- 
called “emergent” behaviors. Relative to ‘“‘crea- 
tivity,’ for example, the question becomes 
whether or not the individual has ever ex- 
hibited in similar contexts the particular kind 
of creative behavior under consideration. If a 
similar context never existed, the research 
must investigate creative performance and 
outputs obtained in a test situation which 
simulates the contextual limitations and re- 
quirements in the job situation. 

An additional advantage of the consistency 
approach is that a number of old or persistent 
problems fortunately appear to dissipate, or 
at least become significantly diminished. Con- 
sider the following: 
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1. Faking and response sets—Since the em- 
phasis would be on behavior samples and not 
on self-reports of attitudes, beliefs, and inter- 
ests, these kinds of response bias would seem 
to be less of a problem. 

2. Discrimination in testing—According to 
Doppelt and Bennett (1967) two general 
charges are often leveled at tests as being 
discriminatory devices: 


(a) Lack of relevance—It is charged that 
test items are often not related to the work 
required on the job for which the applicant is 
being considered, and that even where rela- 
tionships can be shown between test scores 
and job success there is no need to eliminate 
low-scoring disadvantaged people since they 
can be taught the necessary skills and knowl- 
edge in a training period after hiring. 

(6) Unfairness of content—It is further 
maintained that most existing tests, especially 
verbal measures, emphasize middle-class con- 
cepts and information and are, therefore, un- 
fair to those who have not been exposed to 
middle-class cultural and educational in- 
fluences. Consequently, the low test scores 
which are earned are not indicative of the 
“true” abilities of the disadvantaged. Predic- 
tions of job success made from such scores 
are therefore held to be inaccurate. 


The examination of past behaviors similar 
in nature to desired future behavior, along 
with their contextual ramifications, plus the 
added techniques of work samples and simula- 
tion devices encompassing desired future be- 
havior, should markedly reduce both the real 
and imagined severity of problems of unfair- 
ness in prediction. 

3. Invasion of privacy—The very nature of 
the consistency approach would seem to al- 
most entirely eliminate this problem. The link 
between the preemployment or prepromotion 
behavior and job behavior is direct and ob- 
vious for all to see. 


CONCLUDING COMMENTS 


The preceding discussion is meant to be 
critical of the concepts of predictive and con- 
current validity. Nothing that has been said 
here should be construed as an attack on 
construct validity, although Campbell (1960) 
has pointed out that reliability and validity 
are also frequently confused within this con- 
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cept. Neither do the authors mean to give the 
impression that a full-scale application of the 
consistency model would be without difficulty. 
Using available criteria and signs of assumed 
underlying determinants within the frame- 
work of the classic model is certainly easier; 
however, for long-term gains and the eventual 
understanding of job performance, focusing on 
the measurement of behavior would almost 
certainly pay a higher return on investment. 

Some time ago, Goodenough (1949) di- 
chotomized this distinction by referring to 
signs versus samples as indicators of future 
behavior. Between Hull’s (1928) early state- 
ment of test validities and Ghiselli’s (1966) 
more recent review, almost all research and 
development efforts have been directed at 
signs. Relatively small benefits seem to have 
resulted. In contrast, some recent research 
efforts directed at samples seem to hold out 
more promise. The AT&T studies, which used 
ratings of behavior in simulated exercises 
(Bray & Grant, 1966), and the In-basket 
studies reported by Lopez (1965) are suc- 
cessful examples of employing behavior sam- 
ples with management and administrative 
personnel. Frederiksen (1966) has reported 
considerable data contributing to the construct 
validity of the In-basket. In addition, Ghiselli 
(1966) has demonstrated that an interview 
rating based on discussion of specific aspects 
of an individual’s previous work and educa- 
tional history had reasonably high validity, 
even under very unfavorable circumstances. 
In a nonbusiness setting, Gordon (1967) 
found that a work sample yielded relatively 
high validities for predicting final selection 
into the Peace Corps and seemed to be largely 
independent of the tests that were also in- 
cluded as predictors. 

Hopefully, these first few attempts are the 
beginning of a whole new technology of be- 
havior sampling and measurement, in both 
real and simulated situations. If this tech- 
nology can be realized and the consistencies of 
various relevant behavior dimensions mapped 
out, the selection literature can cease being 
apologetic and the prediction of performance 
will have begun to be understood. 
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80 inspectors inspected materials containing defects at 4 different rates. The 
inspection task employed had been shown in a series of previous studies to 
be representative of a variety of visual inspection tasks. Performance measures 
were obtained and analyzed to determine the relationship between defect rate 
and inspection accuracy. Findings indicated that inspection accuracy decreases 
with reductions in defect rate. The percentage of true defects detected sig- 
nificantly decreased and the percentage of false reports significantly increased. 


Although product inspections are widely 
relied upon in industrial operations to provide 
information used in the control of product 
quality, relatively little is known about the 
conditions that affect inspection accuracy. 
There are a number of factors in the inspec- 
tion task which, under certain circumstances, 
may operate to increase or decrease the ac- 
curacy of the resulting quality information. 
For example, Harris (1966) found that the 
complexity of the inspection task had a sig- 
nificant negative effect on inspection accuracy. 
Even when an unlimited amount of inspection 
time was provided, the percentage of de- 
fects detected was significantly negatively 
correlated with the complexity of the item 
inspected. 

The objective of this study was to deter- 
mine the effect of defect rate on inspection 
accuracy. Most production programs have a 
typical learning curve associated with them. 
Early in the program product defect rate is 
high, but quality typically improves, until 
later on the defect rate is relatively low. 
Other circumstances, such as variations in 
production schedules, may also cause the de- 
fect rate to vary in a predictable way. Should 
inspections be conducted in the same manner 
regardless of the defect rate? This depends, 
of course, on the relationship between defect 
rate and inspection accuracy. 

Since visual inspection tasks have some 
elements in common with visual monitoring 
tasks requiring the detection of infrequent 
signals appearing at irregular intervals, previ- 
ous research on vigilance performance may 
suggest the general nature of this relationship. 
One would predict from vigilance research 
that inspection accuracy would decrease with 


a decrease in defect rate (Buckner & Mc- 
Grath, 1963). However, there are differences 
between typical production inspection tasks 
and the monitoring tasks which have ap- 
peared in the studies on vigilance performance. 
Consequently, there is little in vigilance re- 
search results to indicate the specific nature 
of the relationship between defect rate and 
accuracy for an inspection task. 


MeETHOopD 


A representative inspection task was developed 
and materials prepared which included defects at 
four different defect rates. Materials at each defect 
rate were inspected by a sample of naive Ss. Inspec- 
tion accuracy was measured in terms of the per- 
centage of defects detected and the percentage of 
false reports made. 


Inspection Task 


A paper-and-pencil inspection task was developed. 
The task was similar to that employed in a previ- 
ously developed selection test for visual inspection 
work (Harris, 1964). In a series of validation 
studies, performance on this task was found to be 
significantly correlated with performance on a wide 
variety of visual inspection tasks. The correlations 
are listed in Table 1. 


TABLE 1 


CORRELATIONS BETWEEN PERFORMANCE ON Test TASK 
AND VaARiouS INSPECTION TASKS 


—< 











N Correlation 
Inspection task : a with test 

inspectors tqalc 
Electronic chassis 11 .39 
Inertial instruments 8 .86* 
Module assemblies 19 .58* 
Circuit boards 27 .64* 
Microelectronic devices 10 aon 
Photomasks aS E527 


* pe 05. 
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Fic. 1. Sample inspection sheet with defects indicated. 


The inspection task consisted of detecting defects 
in sheets like the one shown in Figure 1. The en- 
closed area contained 80 items: 20 each of the four 
types shown in their correct form and orientation 
at the side of the sheet. Defective items were in- 
dicated by an X made over them with a colored 
pencil. 

Four different 40-sheet sets were prepared, one 
for each of the four different defect rates. The layout 
of each sheet was identical to the one shown in 
Figure 1. The starting point in the preparation of 
materials was drafting a defect-free sheet and print- 
ing 160 copies of it. Then a basic set of 32 defective 
items was drafted and copies printed. Defects were 
inserted by gluing defective items over appropriate 
items on the defect-free sheets. From these masters, 
20 copies of each 40-sheet set were printed. 

Defects were randomly assigned to form four 40- 
sheet sets with defect rates of 0.25%, 1%, 4%, and 
16%. Since each set contained a total of 3,200 items 
(40 sheets X 80 items per sheet), these percentages 
correspond to 8, 32, 128, and 516 defects per set of 
40 sheets. In preparing the 1%, 4%, and 16% defect 
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rates, the 32 defective items were assigned once, 4 
times, and 16 times to the 40-sheet set. For the 0.25% 
defect rate, four subsets of 40 sheets were each as- 
signed 8 of the basic set of 32 defective items. In 
this manner, each defective item appeared an equal 
number of times under each defect-rate condition. 
A table of random numbers was used to assign de- 
fects to sheets. 


Procedure 


Inspections were made under the four defect-rate 
conditions by a total of 80 naive inspectors. In- 
spectors were randomly assigned to the 4 conditions, 
20 per condition. The inspectors were paid student 
volunteers from California State College at Fullerton 
and Fullerton Junior College. Each inspector was 
given samples of the types of defects to be de- 
tected and instructed to X over each defect found 
on each sheet. Inspectors were encouraged to take 
as much time as necessary to perform accurate in- 
spections. They were given no instructions about the 
number of defects to expect; they were instructed, 
however, to ask for clarification on any item about 
which they were undecided. In this manner, the 
monitor performed a role similar to an inspection 
supervisor on the job. 

Inspection accuracy was measured in terms of de- 
fects detected and false reports made. For each in- 
spection condition, the number of defects detected 
was divided by the number of defects present to give 
the percentage of defects detected. The percentage of 
false reports was computed by dividing the number 
of defects reported which were not actually defects 
by the total number of defects reported. 


RESULTS AND DISCUSSION 


Inspection accuracy decreased with reduc- 
tions in defect rate. As shown in Figure 2, 


DEFECTS DETECTED 


PERCENTAGE 
oa ee 
a ee 
n-o -__ 


\ FALSE REPORTS 








DEFECT RATE (PERCENTAGE DEFECTIVE) 


Fic. 2. Relationship between defect rate and 
inspection accuracy. 
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the percentage of defects detected decreased 
slightly between defect rates of 16% and 1%, 
but dropped sharply between 1% and 0.25%. 
False reports, the second indicator of inspec- 
tion accuracy, became more frequent with 
reductions in defect rate. The percentage of 
false reports increased at an accelerated rate 
as the defect rate approached zero. The dif- 
ferences among the four defect rates for the 
percentages of defects detected and for the 
percentages of false reports were tested by 
means of the Kruskal-Wallis one-way analysis 
of variance by ranks (Siegel, 1956). Differ- 
ences were statistically significant beyond the 
.05 level for both the percentages of defects 
detected and percentages of false reports. 
These results demonstrate a potential prob- 
lem in using information obtained from visual 
inspections. When a relatively high quality 
product is being produced (the percentage of 
defective characteristics is about 1 or less), 
inspections may not be very accurate. The 
percentage of defects detected is likely to be 
low and the percentage of false reports is 
likely to be high. As a consequence, inferences 
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made about the true quality level from visual 
inspections may be in error. 

These findings have implications for the 
management of inspection operations. Super- 
visors should be aware that this tendency 
exists and, thus, be better prepared to counter- 
act the effect of this factor. For example, as 
quality level increases, considerations might 
be given to fewer, more intensive inspections 
made through the use of sampling plans. Much 
better information might be obtained at less 
cost, for example, from inspections of a 10% 
random sample of the product than from in- 
spections of 100% of the product. 
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Paired-comparison scaling procedures were applied to an investigation of 
preferences for 5 types of careers and 5 types of organizations. Scaling was 
accomplished for 10 work-satisfaction criteria, using 4 samples, 2 of college 
students and 2 of college preparatory high school students. The median correla- 


tion of preferences between 


samples 
preferences were higher for college students. 


.75; internal consistencies of 
The order of preferences for 


Was 


both careers and organizations depended upon the criteria. 


The time-worn debate between the “busi- 
nessman” and the “humanist” regarding the 
virtues of a business career continues to ap- 
pear in the popular press. As usual, both as- 
sert that their point of view conforms to 
current student preferences (Blough, 1966; 
Drucker, 1965). This assertion is possible 
because accurate data bearing on the question 
have not been available, and traditional voca- 
tional preference theory has not addressed 
itself directly to the problem. 

For today’s college student this debate may 
be more practical than at any time in the 
past. Employment opportunities in teaching, 
government, research, and social service have 
expanded greatly since 1929 (Fuchs, 1965). 
As the first step in research on this problem, 
this study attempts to assess student prefer- 
ences for a limited number of careers, using 
10 criteria, and to also assess preferences for 
the type of organization in which these careers 
might be practiced. 

Most vocational preference theory tries to 
account for the development of “interests”; 
for example, perhaps a graduating senior 
wants to become an engineer because he 
enjoyed playing by himself with an erector 
set rather than playing doctor with the little 
girl next door. This theory often focuses on 
the biographical antecedents of measured in- 
terest patterns; for example, a current area 
of research studies the development of a 
“self” concept, an increasing awareness of 
one’s own role (Hunt, 1967). Although voca- 


1This study was supported by a grant from 
Security Trust Corporation, Rochester, New York. 
Parts of this paper were presented at the meeting of 
the Midwestern Psychological Association, Chicago, 
May 1967. 


tional interests can be fairly well measured, 
neither the theories nor empirical research 
have resulted in a clear statement of the de- 
cision process which a student undertakes as 
graduation nears (Holland, 1964). 

This study attempts to measure attitudes 
which may affect career decisions by deter- 
mining whether widely held preferences exist 
for careers and organization types in which 
careers may be pursued. It is hypothesized 
that, for the same criteria, preference patterns 
for business organizations may be consider- 
ably different than those for a business ad- 
ministration career. 

The issue of preferences raises the question 
of the basis of career decisions. One model 
which has been proposed by economists as- 
serts that (assuming people have appropriate 
information) career decisions are based on an 
assessment of the present value of future 
earnings of alternatives at the time the career 
decision is made (Wilkinson, 1966). The 
present value model (in its simplest form) 
states that when people are nearly indifferent 
in their estimates of nonmonetary rewards, 
such as a desire for power or leisure, they 
choose a career on the basis of higher ex- 
pected earning power. A greater difference in 
monetary rewards is needed in order for the 
decision to be swung by the monetary factor, 
the less the indifference to the nonmonetary 
factors. Empirical evidence on_ individual 
career decisions does not exist to support or 
deny the idea that people behave in a fashion 
similar to that proposed above. Data show 
that aggregate shifts in numbers choosing 
careers usually follow shifts in salary offers. 
However, the question as to how individuals 
actually make their decisions is not answered 
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adequately since the data are purely post hoc 
for the aggregate. For present purposes it will 
be assumed that people are able to provide 
a qualitative ranking which includes responses 
intrinsic to the work activity and the mone- 
tary reward therein. 


For example, it could be argued that each 
person is able to estimate (qualitatively) 
some just noticeable difference in income. 
Within the range of this income difference, 
nonmonetary matters are used to decide be- 
tween alternative career patterns. Outside this 
range, trade offs between income and other 
criteria may be made. For some people it may 
be that possible choices, after income is taken 
into account, will be very wide. However, 
very accurate information on how these trade 
offs are made is not available. 

This study is set up to measure separately 
the preferences for careers and organizations 
on a basis of 10 criteria related to job satis- 
faction. 


METHOD 
Criteria 


Ten criteria for comparing careers and organiza- 
tions were established after consideration of a num- 
ber of reviews of the job-satisfaction literature 
(Bass, 1965; Herzberg, 1966; Vroom, 1964). The 
criteria are: 


(a) there are better opportunities to accomplish 
something worthwhile. 

(6) there are better advancement possibilities. 

(c) there will be more situations where you can 
apply what you learned in school. 

(d) your success would depend Jeast on lucky 
circumstances. 

(e) you would earn the highest salary. 

(f) you would have more important responsibili- 
ties. 

(g) you would be most recognized for your con- 
tributions. 

(h) you would receive 
competent superiors. 

(i) you would have the most secure future, with 
steady employment. 

(j) you would have the best chance of working 
with people with whom you could build last- 
ing friendships. 


supervision from most 


The 10 criteria were added to the following pair of 
questions: 


“On your first job in which [type of organization] 
do you think [Criteria a through j].” 

“On your first job doing which [type of job] do 
you think [Criteria @ through j].” 
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Paired-Comparison Instrument 


The following is a list of organizations and careers 
which were compared. The careers are ones which 
bright students may consider, and usually involve 
graduate school. 


CAREERS 
Business Administrator 
Lawyer 
Research Scientist 
Professor 
Clinical Psychologist 

ORGANIZATIONS 


Federal Government 
College Faculty 
Small Business 
Non-Profit Service 
Large Corporation 


The exact wording of every question asked will be 
found in Tables 1 and 2. Given these criteria and the 
above list of careers and future employers, a paper- 
and-pencil form containing a series of 20 blocks of 
10 paired-comparison items was developed. All 
possible pairs of either careers or organizations were 
included in a block. Each member of a pair was 
placed under columns designated “A” or “B,” fol- 
lowed by a blank space under a column labeled 
“choice.” The order of pairs within a block was de- 
termined by using a table of random numbers. The 
10 organizational comparison blocks were followed 
by the 10 career comparison blocks. Two sample 
pairs using items not included in the actual com- 
parisons were included in the instructions. The Ss 
were instructed to work rapidly. Most completion 
times, including response to a biographical informa- 
tion form, were between 30 and 40 min. This in- 
strument is at present being used in a number of 
validation studies. 


Subjects 


The instrument was administered to four samples 
of Ss. Two of the groups were high school seniors 
(7 = 72 and 25) who have indicated their intent to 
go to college. The high schools were located in 
middle-class suburban communities. The third group 
consisted of 116 freshmen students at a medium- 
sized eastern university with a highly selective ad- 
missions policy who were enrolled in a freshman 
psychology course. A fourth sample consisted of 69 
upperclassmen at the same university enrolled in an 
experimental psychology and a statistics course. 


RESULTS 
Computation of Preference Values 


The individual preferences were evaluated 
according to a procedure suggested by Guil- 
ford (1954).? 

2A program to perform these computations writ- 


ten by W. R. Johnson for the IBM 7074 is available 
from the authors. 


382 


Oviestion High school 


Sample 1 


On your first job, in which do you think 
there are better opportunities to accom- 
plish something worthwhile? 

On your first job in which do you think 
there are better advancement possi- 
bilities? 

On your first job, in which do you think 
there will be more situations where you 
can apply what you learned in school? 
On your first job, in which do you think 
your success would depend least on 
lucky circumstances? 

On your first job, in which do you think 
you would earn the highest salary? 

On your first job, in which do you think 
you would have more important re- 
sponsibilities? 

In your first job, in which do you think 
you would be most recognized for your 
contributions? 

In your first job, in which do you think 
you would receive supervision from most 
competent superiors? 

In your first job, in which do you think 
you would have the most secure future, 
with steady employment? 

In your first job, in which do you think 
you would have the best chance of 
working with people with whom you 
could build lasting friendships? 


(a 


ww 


—.027 061 


(b) 


.169 
i) 
—111 
(d) 


.044 —.061 


(e) 
394 


107 
(g) 


—.387 
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i) 
406 
(3) 


—.803 








Federal Government 


Sample 2 


156 
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534 


-209 


—.409 


233 


-679 


—.965 
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TABLE 1,—PREFERENCE 


College faculty 


University High school University 


Upper- 
classmen 


Fresh- 
men 


Upper- 
classmen 


Fresh- 


men Sample 2 


Sample 1 


Oy = 20 302 .229 -647 


.190 —.209 — .038 =. —.198 


alu) = 20 203 451 


—.095 —.057 369 cond 504 


194 .015 —.093 —.062 = —.089 


—, 209 Cath 341 AIT yA) 
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—.585 =u .162 .090 392 


—.034 a7, 302 .250 .562 


-645 SO -168 .192 .098 


—.593 —.630 Sits 424 oho -887 























Note.—Data are Z-score values representing the average preference for each item compared to all other items. A zero (neutral) 


probability of preference, when compared to the other items. 


High school: Sample 1 » = 72; Sample 2 » = 25. University: Freshmen ” = 116, Upperclassmen » = 69. 


The average proportion of preferences for 
every career and organization was calculated. 
This average proportion was converted to a 
scale on a standard score basis, with 0 as a 
neutral point. Positive scale values indicated 
a higher proportion of preferences than .5; 
negative, a lower proportion. 


Preference Values for Organizations 


The data for the organizational preferences 
are given in Table 1. There are considerable 
differences between the criteria as to the type 
of organization most preferred. Federal Gov- 
ernment is generally preferred for the security 
criterion; college faculty for accomplishment, 
application of school work, and least depend- 
ence on luck; large corporation for advance- 
ment and salary; non-profit service and small 
business shared preferred positions for the 
recognition and social criteria. 


Preference Values for Careers 


The data for the career preferences are 
given in Table 2. Business administrator is 


strongly preferred for its advancement op- 
portunities. Among the high school students, 
lawyer is viewed as offering high starting 
salaries, while the college Ss’ preferences for 
lawyer are lower on the salary criterion. Re- 
search scientist is preferred for recognition 
and competent supervision and, by all except 
the college upperclassmen, for accomplish- 
ment. The upperclassmen strongly prefer clin- 
ical psychologist for the accomplishment and 
responsibility; it is not highly preferred by 
the others (the upperclassmen in the present 
sample were psychology majors). College pro- 
fessor is strongly preferred, surprisingly, on 
the social criterion. 


Internal Consistency 


If such an obtained scale of preference 
values is to be maximally useful, one must as- 
sume that the scale results can be used as if 
the so-called “strong axiom of revealed pref- 
erence” held. That is, if A is preferred to B 
and B to C, then C ought not to be preferred 
to A. If it is, this triad would be inconsistent. 
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value indicates that the item is preferred half the time when compared to the others. The 


The presence or absence of this condition 
(consistency of behavior) was tested using 
the Dobell (1965) method. 

Consistency allows construction of a mean- 
ingful scale. If this condition, which might be 
called an internal consistency condition, is 
not met, then it is not possible to infer back 
from the scale the actual choices which would 
be observed in paired-comparison choice sit- 
uations. It should be noted that a lack of in- 
ternal consistency does not necessarily imply 
unreliability—preference choices may be re- 
liably inconsistent. 

Internal consistency was greater for the 
college students than the high school students. 
Eliminating the counting of ties as “incon- 
sistent,” all organizational criteria and all but 
two career criteria (competent supervision 
and security) were consistent for college stu- 
dents. 


Consistency between Samples 


Product-moment correlations between sam- 
ples were computed for all pairs of the four 
samples of data. 


higher the score, the higher the 


Tables 3 and 4 give the sample intercorrela- 
tions. They range from .52 to .84; the mode 
and median is .75. The range for careers is 
greater than that for organizations. All are 
significant beyond the .01 level. 


DISCUSSION 


This article began with the concept that 
preferences both for careers and organizations 
must be measured concurrently because pref- 


TABLE 3 


CORRELATION MATRIX: ORGANIZATIONS 








Sample 





1. High school, 
Sample 2* 
2. University upper- 
classmen? 
3. University freshmen® 
4, High school, 
Sample 14 


1.00 0.59 0.78 
0.61 


0.59 


0.75 
1.00 


0.81 
0.76 
0.78 


0.76 1.00 
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Business administrator 


- High school 
Question ea 
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TABLE 2.—PREFERENCE 


Lawyer 
University 


University High school 





Sample 1 


(a) On your first job, doing which do you 
think there are better opportunities to 
accomplish something worthwhile? 

On your first job, doing which do you 
think there are better advancement 
possibilities? 

On your first job, doing which do you 
think there will be more situations 
where you can apply what you learned 
in school? 

On your first job, doing which do you 
think your success would depend least 
on lucky circumstances? = 
On your first job, doing which do you 

think you would earn the highest 

salary? -. 
On your first job, doing which do you 

think you would have more important 
responsibilities? -. 
In your first job, doing which do you 
think you would be most recognized for 
your contributions? 

In your first job, doing which do you 
think you would receive supervision 
from most competent superiors? 

In your first job, doing which do you 
think you would have the most secure 
future, with steady employment? 

In your first job, doing which do you 
think you would have the best chance of 
working with people with whom you 
could build lasting friendships? 
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Note.—Data are Z-score values representing the average preference for each item compared to all other items. A zero (neutral) 


probability of preference, when compared to the other items. 


High school: Sample 1 = 72; Sample 2 » = 25. University: Freshmen m = 116; Upperclassmen ” = 69, 


erence patterns for business organizations 
may be considerably different than for busi- 
ness administration careers. The results tend 
to support the hypothesis. However, the dif- 
ferences which exist for any criterion would, 
the authors assert, be of small importance, 
if in fact the preference patterns of students 


TABLE 4 


CORRELATION Matrix: CAREERS 





Sample "| ~ahlioanel ns 





1. High school, 
Sample 2° 
2. University upper- 
classmen> 
3. University freshmen® 
4. High school, 
Sample 14 


1.00 0.52 0.59 0.84 


0.52 
0.59 


1.00 
0.74 


0.75 
1.00 


0.63 
0.75 


0.84 0.63 0.75 1.00 
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for careers and organizations were internally 
inconsistent. Vocational interest theory, com- 
bined with studies of the value of future in- 
come, must consider these social biases in 
order to clarify the career-decision-making 
process. 

This study also investigated the possibility 
that there would be substantial agreement 
among students regarding preferences which 
may have an effect upon career decisions. The 
four samples, two of high school students 
and two of college students, yielded prefer- 
ences which showed a high amount of agree- 
ment with each other. This agreement indi- 
cates that the preferences may have a con- 
siderable amount of stability across other 
such samples. 

The most important implication of this 
study is that neither business careers nor 
business organizations are universally down- 
graded by students in comparison to the other 
alternatives available to them, although gen- 
eral business administration careers may have 
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value indicates that the item is preferred half the time when compared to the others. The higher the score, the higher the 


difficulty in attracting these students com- 
pared to the professions for most career ob- 
jectives. The existence of commonly held 
stereotypes, such as found in this research, 
may indicate to businessmen, teachers, and 
guidance counselors that our culture may be 
transmitting some strong influences to stu- 
dents as they make their career decisions. 
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TASK COHERENCE, TRAINING TIME, AND RETENTION 
INTERVAL EFFECTS ON SKILL RETENTION’ 


JAMES C. NAYLOR,? GEORGE E. BRIGGS, anp WALTER G. REED 3 


Ohio State University 


Amount of training, secondary task coherence, and length of retention interval, 
each at 2 levels, were evaluated in terms of long-term skill retention effects. 
The criterion task was composed of a 3-dimensional tracking task (primary 
task) and a 9-event monitoring task (secondary task). Retention loss varied 
inversely with amount of training and with secondary task coherence (the 
latter only under the lesser amount of training) for both tasks. Absolute re- 
tention levels varied directly with the training and task coherence variables 
and inversely with retention interval. From these and previous data, task 
coherence emerges as an important variable in skill acquisition and retention. 


In their review of the literature on skill re- 
tention, Naylor and Briggs (1961) concluded 
“that the retention of an arbitrary sequence 
of responses (whether verbal or motor) is 
less than the retention of responses which oc- 
cur in meaningful sequences or with patterned 
organizations [p. 6].” At first sight the re- 
sults of some recent tracking research by 
Trumbo, Noble, Cross, and Ulrich (1965) and 
by Noble, Trumbo, Ulrich, and Cross (1966) 
seem to be opposite to the above conclusion: 
They found greater losses in performance 
over the retention interval for more predict- 
able tasks. However, the absolute level of 
performance at retention test was better for 
the more coherent tasks, thus supporting the 
above conclusion. In a recent summary and 
review of their research, Noble and Trumbo 
(1967) suggest that the greater retention 
losses for the more coherent tasks may be due 
primarily to a deterioration in timing ac- 
curacy over the retention interval. This in- 
terpretation is consistent with a study by 
Naylor and Briggs (1963) which found re- 
hearsal on the temporal aspects of a task 
much more effective than rehearsal on the 
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Contract No. AF 33(616)-7269 with the Ohio State 
University Research Foundation. Further reproduc- 
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States Government. 

2 Now at Purdue University. 

® Now at Armstrong Cork Company, Lancaster, 
Pennsylvania. 


spatial aspects in terms of retention perform- 
ance. Therefore, while deterioration occurs in 
both the temporal and the spatial aspects of 
learned responses, the former is the more 
crucial loss to skill. 

In a more recent study, Trumbo, Noble, 
and Swink (1967) utilized a tracking situa- 
tion as a primary task and added a secondary 
task (verbal anticipation or guessing a se- 
quence of numbers) during retention testing 
(Experiment I). Coherence was varied sys- 
tematically in both tasks, and, while primary 
task performance during retention was in- 
ferior when the subject (S) experienced the 
two tasks simultaneously, there was no differ- 
ential effect of the level of predictability of 
the secondary task on primary task perform- 
ance. As in their previous work, the authors 
found differential retention performance as a 
function of primary task coherence. 

It seems odd that secondary task coherence 
would not produce differential effects on pri- 
mary task performance: From the concept of 
a central, limited channel for information 
processing (Broadbent, 1958), one would pre- 
dict a differential influence of both primary 
and secondary task coherence provided, of 
course, that the experimenter (£) selected 
sufficiently different levels of secondary task 
predictability. In other words, it is conceiv- 
able that Trumbo et al. (1965) did not differ- 
entially load the channel in their two sec- 
ondary task conditions where the obtained 
average uncertainty in the stimulus sequence 
was .542 and 1.882 bits/signal. 

The present research examined the influence 
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Factors INFLUENCING Task RETENTION 


of two levels of secondary task coherence on 
primary (tracking) task performance during 
both learning and retention. Both tasks were 
more complex than those utilized by Trumbo 
et al. (1967), and an attempt was made to 
assure a significant separation of secondary 
task coherence levels. However, both of the 
coherence levels used here provided S with the 
opportunity for errorless performance on the 
secondary task, in contrast to the Trumbo 
et al. (1965) research where S could im- 
prove with practice on the secondary task but 
could not achieve errorless performance ex- 
cept by chance. 

It was predicted that both acquisition and 
retention performance would vary directly 
with secondary task coherence for both the 
primary and the secondary tasks. In addi- 
tion to secondary task coherence, the amount 
of training (2 or 3 wk.) and the retention in- 
terval (1 or 4 wk.) were varied systematically 
to determine if the above prediction would 
hold across these two highly pertinent vari- 
ables in retention research. 


MeEtTHOD 
Apparatus 


The S performed two tasks simultaneously: a 
three-dimensional tracking task and a nine-event 
monitoring task. The primary (tracking) task pre- 
sented S with three center-reading voltmeter dis- 
plays which provided system error information, and 
he attempted to null out such error by manipulating 
a single control column via left-right, fore-aft, and 
rotation responses. Each of the tracking dimensions 
involved rate dynamics and the control-display gain 
was .025 ips of display pointer movement per 
degree of control device movement. The control was 
spring loaded in all three dimensions at approxi- 
mately 1 oz/deg of arc, and a single sinusoid of .6 
cpm served as the tracking input signal to all three 
dimensions of the primary task. 

The secondary (monitoring) task presented S with 
nine rows of stimulus and response devices to indicate 
nine events. Within each row there were (from left 
te right) three response buttons, plus an amber- and 
a red-jewel light. An event occurred every 6 sec. 
If an amber light occurred, S was to press the but- 
ton labeled “OK” for that event; if a red light ap- 
peared, he first pressed the “Emergency” button 
(which extinguished the red light and activated the 
amber light), and then he was required to press the 
“OK” button; and if no light occurred at the ex- 
pected time, S pressed first a button labeled ‘“Check”’ 
and this activated the red light, which in turn re- 
quired him to press first the “Emergency” and then 
the “OK” buttons. In all cases a correct sequence 
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of responses within 4 sec. of event onset resulted in 
an amber light being locked in; an incorrect sequence 
resulted in a red light being locked in. 

The secondary task was located to the left of the 
tracking display and the separation was such as to 
require S to look away from the tracking display in 
order to respond, although the secondary display 
lights were of sufficient intensity to be clearly ap- 
parent in peripheral vision. 

Secondary task coherence was manipulated in terms 
of the spatial sequence of events: In the highly pre- 
dictable conditions the spatial sequence was 1, 2, 3, 
4, 5, 6, 7, 8, 9, where 1 refers to the top pair of 
lights, 2 to the next pair, and so on; and for the 
low coherence condition the spatial sequence was 1, 5, 
2, 9, 8, 3, 6, 7, 4. Stated in terms of the amount of 
change within the spatial sequence of events, the 
high coherence sequence involved 0 bits of uncer- 
tainty while the low predictability sequence involved 
2.16 bits of uncertainty per change. 


Design and Procedure 


Table 1 summarizes the treatments for each of 
the eight experimental conditions. The latter re- 
sulted from factorial combinations of (a) training 
time (2 or 3 wk.), (6) retention interval (1 or 4 
wk.), and (c) monitoring task coherence (high and 
low). Each group was filled with 16 male under- 
graduates who volunteered for service and were 
paid $1.00 per session. Group assignment was made 
on the basis of total tracking task performance during 
the sixth session. 

All Ss experienced part-task training through the 
fifth session with whole-task training and retention 
testing thereafter. Except for the first session, which 
involved four 70-sec. trials on each of the two tasks, 
there were twelve 70-sec. trials per session. The 
tracking task was always practiced as a whole (three- 
dimensional) task, and there was no change in the 
primary task from the first to the last session. How- 
ever, the secondary task was increased in complexity 
across the first five sessions in terms of the number 
of amber, red, and no-light events: during the first 
two sessions the stimulus events were always red or 


TABLE 1 


EXPERIMENTAL CONDITIONS FOR THE VARIOUS 
RETENTION GROUPS 











Amount of Tea Retention 

Group training Sas interval 

Gaeay organization Goi) 
1 yD High 1 
2 2 High 4 
3 Di Low 1 
4 2 Low 4 
5 3 High 1 
6 | 3 High 4 
in 3 Low 1 
8 3 Low 4 
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amber; during the third session S experienced first 
one no-light event (Trials 2 and 3) and then two no- 
light events (Trials 5 and 6); and in the fourth ses- 
sion this was increased first to three no-light events 
(Trials 2 and 3) and then to four no-light events 
(Trials 5 and 6). Beginning with the second trial of 
the fifth session, there were four no-light events on 
every trial including those during the retention test. 
The number of red- and amber-light events were 
scheduled so as to be equal across the several ses- 
sions. 

Tracking performance was scored over the final 
60 sec. of each 70-sec. trial which coincided with the 
time during which the monitoring task events oc- 
curred (the latter being an event every 6 sec. starting 
10 sec. after the beginning of a trial). Integrated 
absolute tracking error was recorded for each dimen- 
sion and summed for the three dimensions. These 
data were transformed from their original voltage 
units to units of linear extent (inches) on the track- 
ing display scale (Briggs, 1966). 

Monitoring task performance was scored in three 
ways: (a) total response time for the nine events 
on each trial, (6) number of omissive errors (failure 
to respond at the proper time or to complete the re- 
quired response sequence within the first 4 sec. of 
event onset), and (c) commissive errors (responses 
in excess of the required number and/or incorrect 
responses for the particular event). 

The instructions to S stressed the need to maintain 
minimal tracking error and to respond quickly and 
accurately to the monitoring task. Each S was shown 
his scores at the end of each session to emphasize 
these goals. 


RESULTS 
Tracking Performance 


Figures 1 and 2 provide the average track- 
ing performance levels during both training 
and retention test for the 2- and the 3-wk. 
training conditions, respectively. The data for 
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Fic. 2. Tracking performance during training and 
retention testing of the four groups receiving 3 wk. 
of training. 


the first five sessions are not included, as a 
part-task training procedure was in effect dur- 
ing that time and the monitoring task was 
being increased in complexity. Each data 
point in Figures 1 and 2 represents tracking 
performance averaged over 12 trials and 16 
Ss, a total behavioral sample of 192 min. 
per point. 

Training. It is apparent that secondary 
task coherence affected primary task perform- 
ance as predicted: Groups 1 and 2 were 
superior to Groups 3 and 4 (Figure 1) while 
Groups 5 and 6 were superior to Groups 7 and 
8 (Figure 2), on the average, during train- 
ing. Further, it is apparent by comparing 
Figures 1 and 2 that terminal performance 
was superior for the 3-wk. training groups. 
These observations are supported by the re- 
sults of an analysis of variance which was 
applied to the average performance of each 
S on his last block of four training trials: both 
amount of training (F = 5.92, df = 1/124) 
and secondary task coherence (Ff = 5.25, 
df = 1/124) were statistically significant at 
p < .05. However, the interaction of these 
two variables was not significant (F < 1.0). 

Retention. From the above, groups differed 
at the end of training with particularly supe- 
rior tracking performance by Groups 5 and 
6 which experienced 3 wk. of training under 
coherent secondary task conditions. Therefore, 
the retention data were analyzed in two ways: 
(a) Difference scores were calculated for each 
S (last training session minus retention test 
performance) which correct for the initial 
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TABLE 2 


ANALYSES OF VARIANCE OF DIFFERENCE SCORES (TRAINING-RETENTION) FOR 
ALL PERFORMANCE MEASURES 


389 


























Tracking Omissive errors | Commissive errors| Response time 
Source ae a 
MS F MS F Me Somy eer iow: 
Amount of training (T) Det 2399 21g 2-S Seale 2 O12 5 eal ideo 1 2.1321 1.98 38.8967 3.28 
Task organization (O) 1 | 1660.34} 3.02 2993 1.67 .6786 _ .0132 — 
Retention interval (I) 1 101.55 — .0037 — TSH O 72a peel cr .6877 — 
©) 1 | 3012.80} 5.48* 1.2940 eee 1.5269 | 1.42 14.0183 1.18 
ex 1 1 | 1664.63} 3.03 .0000 — 2794 = 1.5706 — 
Oar 1 188.16 — .0001 — 5279 — 27.6208 2.33 
tx OexG I | AO || eae .0469 — .7503 _- 1.8865 — 
Residual error 120 549.41 1795 1.0764 11.8582 
*p<.05. 
ED < .01. 


group differences and permit, therefore, an 
examination of the effects of the experimental 
variables on retention performance indepen- 
dently of their effects on training performance; 
and (6) the uncorrected retention perform- 
ance scores were analyzed to permit an ex- 
amination of the effects of the experimental 
variables on the absolute (rather than rela- 
tive) levels of retention performance. 

These two ways of analyzing retention per- 
formance permit one to examine two different 
aspects of the influence of the independent 
variables on skill retention. In the case of 
relative scores one is concerned with the Joss 
or amount of deterioration in performance 
over a time interval, while with the absolute 
scores one is interested in what remains after 
a time interval. 


The results of these analyses are shown in 
Tables 2 and 3. In terms of relative loss 
(Table 2) those groups which experienced 
only 2 wk. of training deteriorated more dur- 
ing the retention periods than did the groups 
which received 3 wk. of training, and, while 
secondary task coherence did not exert an 
overall significant effect on the amount of 
loss, the significant interaction of Coherence 
x Training indicates that a greater loss oc- 
curred for those Ss who received only 2 wk. 
of training on the less predictable secondary 
task (Groups 3 and 4) than for any other Ss, 
including those who also received only 2 wk. 
of training but did so on the more coherent 
version of the secondary task (Groups 1 and 
2). It is interesting to note that retention in- 
terval (1 versus 4 wk.) did not produce dif- 


TABLE 3 


ANALYSES OF VARIANCE OF ABSOLUTE RETENTION SCORES FOR ALL 
PERFORMANCE MEASURES 


























Tracking Omissive errors | Commissive errors} Response time 
Source df | 2 | er ne 
MS Roe ae dS: 1 aE AS F MS BF 
Amount of training (T) 1 | 9080.47 | 18.45** | 1.9404 | 16.10** | 5.2407 | 4.09* | 36.7440 | 4.22* 
Task organization (O) 1 | 6419.86 | 13.05** | 1.6608 | 13.78** .0069 = 11.2694 | 1.30 
Retention interval (I) 1 | 2263.62 | 4.62* 1891 Si 8.8200 | 6.89** 1.6562 — 
ao O) 1 | 1546.37_ |}. 3.16 1.1101 Oise 9940 — 10.3285 | 1.19 
xX L 1 380.53 — .0657 — .0443 — .0935 — 
Ox! 1 168.13 — 0741 | — 8224 — DiDAST. R33 
ExO%K 1 1 | 421.59 — .0587 — .4705 — 19.5625 | 2.25 
Residuai error 120 | 492.10 | 1205 1.2801 8.7001 | 





* p< .05. 
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TABLE 4 
MEAN PERFORMANCE PER SUBJECT FOR EACH OF THE EIGHT EXPERIMENTAL GROUPS 
DURING THE Last BLOCK OF TRAINING TRIALS AND AT RETENTION TEST FOR 
EAcH OF THE THREE MonitToriInc Task METRICS 
Omissive errors Commissive errors aes time (in sec.) 
Group La ills. i a -z - 
Training | Retention Training | Retention Training | Retention 
—| I- iz 

1 18 | 14 | 1.02 | .92 7.61 | 6.91 
2 25 | a7 aa | 1.36 8.24 | 8.89 
3 il .46 19 1.07 8.74 9.77 
4 30 68 60 1.59 137 | 8.35 
Sy) 08 08 .67 Su 7.98 7.24 
6 O05 iL .69 1.05 674 1255 
7 08 ml 58 34 7.04 7.41 
8 ls | 16 .60 117, 8.41 7.44 














ferential losses either as a main effect or in 
interaction with either of the other two in- 
dependent variables. 

In terms of absolute retention performance 
on the tracking task, Table 3 shows that all 
three independent variables were statistically 
significant as main effects. Thus, not only was 
superior retention performance obtained un- 
der high secondary task coherence and after 
3 wk. of training (compared to that under 
low coherence and after only 2 wk of train- 
ing), respectively, but also performance after 
a l-wk. retention interval was superior to 
that after 4 wk. 

In summary, then, amount of training in- 
fluenced both the amount of deterioration in 
tracking performance and the absolute per- 
formance level at retention test; secondary 
task coherence differentially influenced the 
amount of deterioration only for groups which 
received 2 wk. of original training, although 
there was an overall effect in terms of ab- 
solute performance levels at retention test; 
and, while there was no greater loss in per- 
formance over a 1-wk. retention period than 
over 4 wk., the final average performance 
level after 1 wk. was superior to that after the 
longer retention interval. 


Monitoring Task Performance 


Table 4 provides summaries of the per- 
formance levels of the eight groups during 
training (final block of four trials) and re- 
tention test. It can be seen that very few 
errors of either type occurred and, in gen- 


eral, S was more apt to commit errors (extra 
responses) than to fail to respond. Tables 2 
and 3 provide the results of analyses of vari- 
ance applied to the monitoring task scores 
during retention test. 

Omissive errors. During the final training 
trials there were significantly more omissive 
errors by Ss who experienced only 2 wk. of 
training (Groups 1-4) than by those who 
experienced 3 wk. of training (F = 5.81, df 
= 1/124); however, unlike the case with 
tracking performance data, secondary task 
coherence did not differentially affect omis- 
sive errors during training. In terms of the 
amount of deterioration over the retention in- 
terval, the omissive error data revealed the 
same pattern as with the tracking data: 
greater losses occurred in the 2-wk. training 
groups than in the 3-wk. groups, and the 
Training < Coherence interaction was _ sta- 
tistically significant (see Table 2). From 
Table 3 it can be seen that omissive errors 
followed the same pattern as found for track- 
ing performance, except that retention in- 
terval was not significant and the Training X 
Coherence interaction was significant at p < 
.05. Again, the pattern of this interaction was 
the same as that found for the relative loss 
scores on the tracking and monitoring tasks 
(Table 2): the worst performance was 
achieved under the 2-wk. training, low co- 
herence conditions (Groups 3 and 4) with 
fairly comparable performance for all other 
groups. 

Comissive errors and response time. These 
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two measures of monitoring task performance 
were less sensitive to the effects of the inde- 
pendent variables than either the omissive 
error or the tracking error scores. For ex- 
ample, there were no significant differences 
with either set of scores during training and 
there was only one significant F ratio from 
the two retention analyses of the response 
time scores: amount of training for the ab- 
solute retention data (see Table 3). Those few 
cases of significant F ratios for both commis- 
sive error and response time data do reveal 
results entirely consistent with those of the 
tracking error and omissive error data; for 
example, better retention performance oc- 
curred after 3 than after 2 wk. of training in 
terms of both commissive errors and response 
time (Table 3), and there were more com- 
missive errors after 4 wk. than after 1 wk. of 
retention (Table 3). It is interesting to note 
that the commissive error data were the only 
ones to show a significant effect of retention 
interval for both the relative loss and the 
absolute performance analyses (see Tables 
2 and 3). 

This lack of sensitivity of the response 
time and commissive error scores to the in- 
dependent variables may well have been due 
to the delay in feedback which S experienced 
with regard to these two aspects of perform- 
ance: the S obtained such information only 
as supplementary feedback at the end of 
each daily session when £ provided a debrief- 
ing. The feedback on tracking error was im- 
mediately and continuously available from 
the three tracking displays, and a failure to 
respond appropriately to each event in the 
monitoring task was signaled almost im- 
mediately by a red light on the monitoring 
display. 


DISCUSSION 


The above results indicate that each of 
the independent variables influenced both 
original acquisition and retention tracking 
performance in a straightforward and pre- 
dictable manner, with two exceptions. Thus, 
during original acquisition, tracking per- 
formance was superior after 3 wk. com- 
pared to that after 2 wk. of training, and 
superior tracking performance occurred when 
accomplished with simultaneous practice on 
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the more coherent than with the less pre- 
dictable secondary task. During retention, 
tracking performance, in terms of absolute 
scores (Table 3), was significantly better 
(a) after 3 rather than 2 wk. of original train- 
ing, (0) when accomplished simultaneously 
with the more coherent secondary task, and 
(c) after 1 wk. rather than after 4 wk. of 
retention. Finally, in terms of performance 
loss scores (Table 2), there was less loss fol- 
lowing 3 wk. of original training, and there 
was less tracking performance loss when S 
experienced the more predictable secondary 
task, bué this effect held only for those Ss 
who received only 2 wk. of original training. 
The latter was one of the two exceptions 
noted above: secondary task coherence did 
not appear as a main effect but as an interac- 
tive effect with amount of training in terms 
of retention loss. 

The other exception was that retention in- 
terval did not differentially affect perform- 
ance loss although it did influence absolute 
retention performance, as noted above. This 
latter exception is actually a bit puzzling, 
since the magnitude of differences between 
means due to the retention interval variable 
was actually larger in the difference score 
analysis than it was in the absolute score 
analysis. One possible explanation for obtain- 
ing nonsignificance due to retention interval 
in the difference score analysis is that the 
within-cell variance was markedly increased 
in this analysis due to the unreliability of in- 
dividual S’s performance. Since the variance 
of a set of difference scores is sp” = 542 + S;? 
— 2rapS4Sp, a low correlation between the 
training and retention test scores of indi- 
viduals within a group would result in a 
rather large sp*, which in turn would result 
in a corresponding increase in the MS error 
term used in the analysis of variance. 

Essentially the same pattern of results 
was found with performance on the secondary 
(monitoring) task. In terms of omissive er- 
rors, the only discrepancy between results 
on the two tasks was that retention interval 
did not influence performance as a main ef- 
fect for either the retention loss data or the 
absolute retention scores. Retention interval 
was a statistically significant effect in terms 
of commissive errors, however. 
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It follows, then, that the major prediction 
was supported, on the whole, by the results 
of this experiment: Both during training and 
at retention test, superior primary task per- 
formance was obtained when it occurred 
simultaneously with the more coherent ver- 
sion of the secondary task, and performance 
on the secondary task itself during retention 
(but not during training) was superior under 
the more coherent than under the less pre- 
dictable version. The latter result is con- 
sistent with those of Trumbo et al. (1965) 
and Noble et al. (1966), while the results 
noted here for the retention of tracking per- 
formance extend those of Trumbo et al. 
(1967, Experiment I) by showing that sec- 
ondary task coherence can affect primary 
task performance in a predictable manner. 

The most consistently influential and the 
most powerful independent variable in this 
experiment was the amount of original train- 
ing: on both tasks during training and at re- 
tention, performance levels were superior for 
groups which experienced 3 rather than 2 wk. 
of original training, and the Joss of proficiency 
on both tasks was less for the more highly 
trained groups. Further, the interactions noted 
in Table 2 of Training x Secondary Task 
Coherence indicate that not only can one 
reduce retention losses on both primary and 
secondary tasks with training, but also the 
detrimental effect of low coherence can be 
ameliorated with sufficient original learning. 

Surprisingly, the effect of retention interval 
was less consistent than that of either of the 
other two independent variables. The fact 
that retention interval did not interact sig- 
nificantly with secondary task coherence in- 
dicates that the effect of the latter is uniform 
across at least a 4-wk. interval. Logically, one 
would expect such an interaction to be sig- 
nificant, and visual inspection of Figure 1 
seems to support this expectation: there ap- 
pears to be a greater difference between Groups 
2 and 4 than between Groups 1 and 3 at 
retention test. However, again, this observa- 
tion was not supported by statistical test, and 
obviously from Figure 2 there is no greater 
difference between Groups 6 and 8 than be- 
tween Groups 5 and 7. Had a 6- or an 8-wk. 
retention interval been utilized instead of the 
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4-wk, interval, it is quite possible that the 
expected interaction would have been found. 
Indeed, one may predict that the interaction 
of Amount of Training x Secondary Task 
Coherence X Retention Interval would be sig- 
nificant provided one used evther lesser 
amounts of original training or greater reten- 
tion intervals than those used herein. 

While there is danger in speculation, the 
above does provide further (albeit indirect) 
evidence for the very powerful effects of the 
amount of training variable: Overlearning 
can (and apparently did) mask or ameliorate 
the effects of less powerful independent vari- 
ables, especially their interactive effects. 

The authors’ interpretation of the coherence 
variable, as indicated in the introduction, is 
that by manipulating task predictability one 
loads differentially the central information- 
processing channel. The less coherent a sec- 
ondary task input, the more the channel must 
be concerned with the selection and execution 
of responses for the secondary task, and, 
therefore, the less the capacity which is 
available for primary task performance. It 
follows, then, that during training primary 
task performance should vary directly with 
the coherence of the secondary task, and this 
was the result obtained. 

During retention test one encounters not 
only the above effect of secondary task co- 
herence but also less accurate retention of 
the input sequence for the less predictable 
secondary task which compounds the load on 
the central channel. Presumably, this reduces 
further the capacity of the channel to deal 
with the primary task. 

The finding that amount of training can 
ameliorate the effects of secondary task co- 
herence on retention performance (at least in 
terms of retention loss) is consistent with the 
interpretation of a hierarchical organization of 
skill (see Chapter 2 of Fitts & Posner, 1967). 
As (secondary task) skill becomes more au- 
tomated through practice, it places less load 
on the central channel, and thus there should 
be less potential interference to the processing 
of information on another (primary) task 
performed simultaneously. 

Whatever the theoretical interpretation, 
task coherence emerges from these and previ- 
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ous data as an important determinant of 
skilled performance under long-term retention 
conditions. 
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USE OF IMPORTANCE AS A WEIGHTING COMPONENT 


OF JOB SATISFACTION 


PATRICIA SMITH MIKES anp CHARLES L. HULIN 1? 


University of Illinois, Urbana 


Hypotheses dealing with the rated importance of various aspects of the job were 
tested on data obtained from 660 male and female workers in Montreal, 
Quebec. Importance used multiplicatively to weight satisfaction scores did not 
increase the predictability of turnover during an 11-mo. period over that ob- 


tained by using satisfaction measures alone. 


No interaction between satisfaction 


and importance predicting turnover was obtained. The general conclusion was 


that importance has little value 
attitudes and behavior. 


Strong and consistent relationships between 
employees’ responses to attitude question- 
naires and job behavior have only rarely been 
obtained. It has been implied frequently that 
those aspects of the job which are perceived 
as more important by the worker have a 
greater influence on the behavioral and over- 
all affective responses of the worker to his 
job than do less important aspects. However, 
most job-attitude questionnaires fail to ask 
employees about aspects regarded as more 
or less important. 

Several investigators (Ewen, 1967; Fried- 
lander, 1965; Glennon, Owens, Smith, & Al- 
bright, 1960; Weiss, Dawis, England, & 
Lofquist, 1964a, 1964b) have used importance 
measures in conjunction with satisfaction in 
an effort to demonstrate theoretically and 
empirically more appealing measures. All but 
one concluded that a consideration of im- 
portance is necessary. Ewen, however, con- 
cluded that importance added little. His re- 
sults showed that a summed composite score 
weighted differentially for importance ratings 
was no more strongly related to a measure of 
overall satisfaction than was an unweighted 
composite of the same satisfaction measures. 

The research reported in this paper presents 
what the authors believe to be more definitive 
evidence on the importance of importance. If 
importance is a vital component in the assess- 
ment of job satisfaction then overall job 

1 These data were gathered in conjunction with 
and reported as part of the senior author’s master’s 
thesis submitted to the University of Illinois. The 
authors would like to thank George Graen and 
Harry Triandis for the comments on an earlier draft 
of the manuscript. 


in a_ prediction 


situation involving job 


satisfaction is dependent on the worker’s at- 
titude toward the various aspects of the job, 
and also on the perceived importance of these 
aspects to him. His general reaction to his 
job would depend more heavily on the feeling 
he holds toward those aspects which he con- 
siders as more important than on his feel- 
ings toward those aspects which are less im- 
portant. A weighted composite measure should 
yield a measure of overall satisfaction with the 
job which is more highly related to behaviors 
such as turnover or absences than an un- 
weighted composite measure. The model pro- 
posed is a multiplicative interaction of im- 
portance and satisfaction with a linear com- 
bination of the products. 

The S’s composite job attitude as measured 
by the product of his importance rating and 
satisfaction score for each job aspect summed 
over all aspects will correlate more highly 
with termination decisions than will either 
the summed importance ratings or an un- 
weighted sum of the satisfaction measures. 

Evidence supporting this hypothesis would 
indicate that the perceived importance of job 
aspects is necessary if respectable predictive 
ability is to be achieved. Negative evidence 
would indicate that Ewen’s (1967) basic con- 
clusions can be generalized to the prediction 
of behavioral responses. (Such negative evi- 
dence could mean that the importance of any 
aspect of the job is reflected in the reported 
satisfaction with that aspect of the job. That 
is, only those job aspects which are important 
to S will be responded to by him with a satis- 
faction score which deviates substantially 
from the neutral point. Unimportant job 
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aspects would receive essentially neutral satis- 
faction scores and would enter the predictive 
equations with very small effective weights. ) 

Negative evidence could also be interpreted 
as a reflection of the reliability of a composite 
measure as compared to the reliability of a 
simple sum. Every time one measures a vari- 
able some error is introduced. If satisfaction 
with each job characteristic is measured with 
a reliability of .80 then one would expect 
the sum of the five importance scores to have 
reliability considerably in excess of .80.2 How- 
ever, this reliability would be greater than 
the reliability of the same five satisfaction 
scores if they were multiplied by a fallible mea- 
sure of importance before they were summed. 
Even if the measures of importance used 
have high (but less than perfect) reliability, 
the resulting sum of Importance X Satisfac- 
tion would have an overall reliability much 
below the reliability of the simple sum. The 
maximum obtainable correlation between the 
simple sum and any criterion would be greater 
than the maximum obtainable correlation be- 
tween a composite measure and the criterion. 
Tf all the correlations were corrected for at- 
tenuation it is possible that these corrected 
correlations would indicate the theoretical 
superiority of the composite sum of job satis- 
faction. However, the problem cannot be re- 
moved by demonstrating the theoretical supe- 
riority of one measure over another. It is an 
unfortunate fact of life that every time one 
introduces the notion of importance in the 
study of job satisfaction such importance vari- 
ables will have to be measured. Importance 
measures cannot be discarded on the basis of 
the available data. However, the claims that 
importance must be considered in the study 
of job satisfaction are unfounded since such 
claims cannot ignore the problems of relia- 
bility. 

METHOD 

Subjects 

The Ss for this study were the corporate office 


personnel of a large international corporation with 
its corporate offices located in Montreal, Quebec. In 


2The use of one of the Kuder-Richardson 
formulas would be inappropriate to estimate the re- 
liability of the sum since the measures of satisfaction 
with the separate job components are not homogene- 
ous parts of a longer test. 
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July 1966 all of the employees were requested to 
take part in a job-satisfaction survey which the 
company was conducting in cooperation with the 
second author. The questionnaires were given to the 
employees by departmental representatives after the 
representatives had met with the investigator and 
had the questionnaire explained to them. The em- 
ployees were asked to complete them at home or, if 
they thought they would have 30 uninterrupted 
minutes, to do them at work. All employees were 
asked to participate. Completed questionnaires were 
placed in envelopes, sealed by the respondents, and 
returned either to the investigator or sent through 
the mails directly to the University of Illinois. 
Anonymity was guaranteed to all respondents and 
they were assured that their individual responses 
would not be made known to the company. Names 
were not requested on the questionnaires but for 
many of the employees their job title and depart- 
ment (which were requested) would be enough to 
identify the respondent. Of the 800 employees who 
worked in the home offices and who were present 
during the period of the study, 670 (83%) returned 
usable questionnaires. 


Variables 


Satisfaction with five aspects of the job (work 
done, pay level, promotional opportunities, super- 
vision, and co-workers) was assessed by means of 
the Job Descriptive Index (JDI) (Smith, 1967) 
which is a cumulative-point adjective checklist. In 
addition, the General Motors Faces Scale (Kunin, 
1955) was used to assess job satisfaction with the 
same five job aspects. In this particular study a 6- 
point version of the original 11-point scale was used. 
The six faces used consisted of two scowling faces, 
one neutral face, and three smiling faces. A measure 
of overall job satisfaction was also obtained. The 
same 6-point version of the General Motors Faces 
Scale was used to assess overall satisfaction. 

Importance was measured by a scale designed to 
give interval scale values of importance. A descrip- 
tion of this scale may be found in Hulin and Smith 
(1967) or Mikes (1967). This scale provided ratings 
of importance of six aspects of the job (interest and 
difficulty of work, pay, promotion, supervision, co- 
workers, and working conditions). Satisfaction with 
work was used in conjunction with both the rated 
importance of interest and difficulty of work and of 
working conditions since the satisfaction measures 
may not adequately discriminate between these as- 
pects of the job. 

Background variables (job level, sex, age, tenure) 
were obtained from the face sheet of the question- 
naire which asked for this and other information. 
Job level was quantified from job listings of the 
company by matching the job title reported on the 
face sheet with the job evaluation scale provided by 
the personnel department of the company involved. 
All of these measures were obtained in July 1966. 

The reasons given by the terminators for quitting 
were recorded during the initial stages of this study. 
The plan was to eliminate those Ss from the sample 
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who were “unavoidable” quits (pregnancy, husband 
leaving town, etc.). However, the distinction be- 
tween avoidable and unavoidable quickly became 
meaningless. Within the first month two girls who 
were pregnant reported to the personnel department. 
One terminated. Her questionnaire was placed in the 
unavoidable category. The second girl did not quit 
but only requested a maternity leave and planned to 
return to work. It was impossible to justify the ex- 
clusion of the first girl. The decision was made to 
include all terminators regardless of stated reason 
for quitting. 


RESULTS 


By May 1967, 6 males and 16 females who 
had participated in the study and who could 
be identified had quit their jobs. No correla- 
tional analyses were performed on the male 
sample since the number of cases was too small. 
Analyses for the female sample were per- 
formed on a group of 48 Ss, 16 terminators 
and 32 nonterminators who were matched 
on age, length of service, and job level with 
the terminators. Biserial correlations were 
computed between the sum of JDI scores and 
turnover (scored 1 for termination, 0 for non- 
termination), the sum of the Faces scores with 
job aspects and turnover, the sum of im- 
portance-weighted JDI scores and turnover, 
the sum of importance-weighted Faces scores 
and turnover, the measure of overall job satis- 
faction and turnover, and the sum of im- 
portance scores and turnover. The results 
of this analysis are presented in Table 1. 

The hypothesis was not confirmed. The un- 
weighted sums of the satisfaction scores alone 
proved to be as highly related to termination 
decisions as the sums of the importance- 


TABLE 1 


BISERIAL CORRELATION BETWEEN PREDICTOR 
VARIABLES AND TURNOVER 








3 Correlation 
Variable with turnover 

Sum of JDI scores —.46 
Sum of JDI scores weighted by 

importance — 33 
Sum of Faces scores —.Al 
Sum of Faces scores weighted by 

importance —.39 
Job-in-general Faces score —.34 
Sum of importance scores —.22 


Note.—Females, NV = 48, 
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weighted satisfaction scores. The least ac- 
curate predictor was the sum of importance 
scores alone. 

One possible explanation for the lower 
predictive validity of the importance-weighted 
satisfaction scores was that the multiplicative 
model used did not distinguish between job 
aspects high in importance but low in satis- 
faction and job aspects low in importance but 
high in satisfaction. The first case would be 
an indicator of future turnover but the latter 
case would not. To test this problem of lack 
of symmetry, the female terminators and their 
matched control Ss were dichotomized at the 
medians of satisfaction with work itself and 
the importance of work itself. The same pro- 
cedure was followed using the 6 males and 
their 12 matched control Ss. In each case the 
medians were computed separately for the 
male and female Ss and therefore different 
median values were used for each of the two 
samples. Satisfaction with work itself was the 
variable selected for this analysis because 
it had the highest correlation with turnover 
of any of the five JDI measures. The 
dichotomized male and female turnovers and 
their matched controls were then combined 
preserving the median splits which had been 
obtained from the separate analyses of the two 
samples. This dichotomization and combining 
of male and female workers yielded a four-cell 
table based on a total NV of 66 (22 terminators 
and 44 matched nonterminators). The ratio of 
terminators to total cell numbers in each of 
the four cells was then computed. The inter- 
active medel of importance and satisfaction 
would be supported if the 2 X 2 table in- 
dicated a high turnover ratio in the high- 
importance—low-satisfaction cell, a low ratio 
in the high-satisfaction—high-importance cell, 
and intermediate turnover ratios in the other 
two cells. These data are presented in Table 2. 
No statistical analyses were done on these 
data but an inspection indicates the existence 
of main effects due to satisfaction and im- 
portance but no interaction effects are present. 


DISCUSSION 


The multiplicative model presented in this 
paper received little empirical support. The 
sum of satisfaction scores with unit weights 
did as well as the composite sum of satisfac- 


WEIGHTING COMPONENTS IN JOB SATISFACTION 


tion scores when weighted by importance in 
the prediction of turnover. The intuitively ap- 
pealing idea that importance is important in 
the study of job satisfaction received no sup- 
port. The most immediate conclusion which 
could be reached on the basis of these data 
would be that there is no need to use import- 
ance and an interactive model to link satis- 
faction with the different aspects of the job to 
the behavioral responses to the job since the 
simpler model assuming unit weights is more 
parsimonious and is at least as valid. 

There are alternative conclusions which 
could be reached on the basis of these data. 
It is possible that the interactive model of job 
satisfaction is valid but that the measures of 
satisfaction used in this study are invalid. 
Such a conclusion lacks force, however, when 
the validity data for the JDI and Faces mea- 
sures of satisfaction are examined. These data 
(Smith, 1967) indicate high convergent and dis- 
criminant validity for the JDI. The numerous 
studies which have appeared recently using 
the General Motors Faces Scale of overall 
satisfaction have yielded results which would 
be impossible if the Faces Scale lacked validity 
(Ewen, Smith, Hulin, & Locke, 1966; Graen, 
1966; Hulin & Smith, 1967). It is also pos- 
sible that the multiplicative model of job 
satisfaction is basically valid but the measure 
of importance of the various job aspects em- 
ployed lacks validity. The data presented by 
Mikes (1967) indicate high scale reliability 
for this measure. The data pertaining to the 
convergent and discriminant validity are less 
impressive. Nevertheless, these data indicate 
at least an adequate amount of convergent 
validity. 

One psychometric consideration must be 
discussed. What was done in this study was 
to weight a limited number of variables using 
idiosyncratic, individually determined weights. 
Using such idiosyncratic weights, the result- 
ing sum of the weighted variables will have a 
correlation in excess of .90 with the unweighted 
sum of the same variables (Ewen, 1967). This 
extremely high correlation, of course, reduces 
drastically the chances of demonstrating the 
superiority of one measure or another. 

However, it should be stressed that this 
psychometric consideration is not unique to 
this particular study. Any study which uses 
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TABLE 2 


ANALYSIS OF TURNOVER RATES AS PREDICTED BY AN 
INTERACTION OF IMPORTANCE AND SATISFACTION 























Satis- Importance of work itself 
faction Total 
with work 
itself High Low 
High 5/23 (22%) | 3/10 (30%) 8/33 
Low 3/10 (30%) | 11/23 (49%) 14/33 
Total | 8/33 14/33 22/66 











Note,—Numerators in cells are number who quit and de- 
nominators indicate total cell frequency. 


importance as a weighting measure to weight 
measures of job satisfaction will have the same 
problem associated with it. This means that 
claims for the superiority of weighted com- 
posite measures of job satisfaction are very 
likely unjustified. 

It is also possible that had this study used 
a different behavioral measure as its criterion 
the results would have been more promising 
for a multiplicative model. Termination de- 
cisions were used as the dependent variable 
because this is the one behavioral response 
which has indicated consistent and high re- 
lationships with affective responses to the job 
(Hulin, 1966, 1968; Weitz & Nuckols, 1953). 
The use of productive behavior as a dependent 
variable would have created many problems 
since the previous studies on job satisfaction 
which attempted to link affective responses to 
productive behavior have not indicated any- 
thing like a consistent or significant relation- 
ship (Brayfield & Crockett, 1955; Vroom, 
1964). 

In addition, more promising results might 
have been obtained had other prediction 
models been used to combine the satisfaction 
and importance data. The use of other ad hoc 
models (such as a multiple-regression predic- 
tion equation) would clearly have taken ad- 
vantage of the negative relationships between 
importance ratings and turnover, the negative 
relationships between satisfaction and turn- 
over, and the zero association between satis- 
faction and importance to enhance the predic- 
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tion of turnover. However, with only 48 Ss in 
the present analysis sample such analyses 
could have proved misleading. 

It should also be pointed out that this study 
does not stand alone. The research of Ewen 
(1967) while using a different dependent 
variable (overall job satisfaction) yielded re- 
sults no more promising for an interactive 
model of job satisfaction. It would appear that, 
every time the intuitively appealing use of 
importance in combination with satisfaction 
has been investigated in order to determine 
the validity of such a model, negative results 
have been obtained. 

In summary, it appears that the use of 
importance measures as multiplicative weights 
in a model of job satisfaction does not in fact 
yield the relationships between a composite 
measure of job satisfaction and behavior, 
or between a composite measure of job satis- 
faction and overall satisfaction which are 
stronger, more consistent, or more theoretically 
appealing. The simpler model of job satisfac- 
tion assuming unit weights for the measures 
does as well. 
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EVALUATING MORTGAGE LOAN RISK 


WILLIAM D. BUEL 
Byron Harless, Schaffer, Reid & Associates, Inc., Tampa, Florida 


Mortgage loan application blanks for matched samples of delinquent and non- 
delinquent mortgages were statistically analyzed for purposes of identifying 
items capable of discriminating between the 2 samples. 8 bits of information 
(4 application blank items and 4 ratios) differentiated delinquent from non- 
delinquent accounts and, taken in total score form, gave rise to a biserial cor- 


relation of .64 (p< .001). 


Recent efforts by psychologists have demon- 
strated the validity of weighted credit appli- 
cation blanks. McGrath (1960) reports a 
maximum biserial correlation of .58 (several 
weighting methods were compared) between 
a total score based on 24 weighted credit ap- 
plication blank items and a payment-reposses- 
sion criterion relative to automobile purchase, 
while Hassler, Myers, and Seldin (1963) dis- 
covered nine items capable of producing a 
biserial correlation of .71 when taken in total 
score form relative to differentiating “open” 
from “special handling” department store ac- 
counts. Elsewhere, Myers (1963) describes 
point-biserial correlations ranging from .26 to 
.61 for various weighting methods relative to 
differentiating paid from delinquent accounts 
and accepted from rejected applicants, re- 
spectively. 

Literature searches by the present author 
have disclosed no such efforts relative to 
mortgage loans. In fact, the “lore” of the 
mortgage-lending industry suggests that risk 
evaluation would be relatively difficult be- 
cause payment on one’s home looms more im- 
portant than payment on less significant items, 
and therefore elicits a different set of pay- 
ment motives, motives alleged to cause mort- 
gage installment payments where “other” 
periodic remittances might be neglected. 
While such motives may operate, the present 
study exhibits validity comparable to the 
validity of credit scoring systems designed for 
other purposes. 


CRITERION 


So that payment records would be availa- 
ble for parallel samples of delinquent and 
nondelinquent mortgage loan recipients, this 
study employed historic data, data arising 
from the mortgage application blanks of indi- 


viduals to whom mortgage loans had been ex- 
tended for purposes of home construction or 
purchase—judgments about delinquency—non- 
delinquency could be made only on the basis 
of real experience. 

Examination of the organization’s delin- 
quent files suggested an artificially dichoto- 
mous criterion.t Delinquency was therefore 
defined as 1 or more month’s failure to pay, 
whereas nondelinquency suggested no pay- 
ment failures. While such stringent definitions 
make differences difficult to discover, they 
were forced upon the investigator by sample- 
size constraints, that is, no usable data could 
be spared. 


MeEtTHOD 


The mortgage loan application blanks to be studied 
were drawn from files, by date of loan issuance, for 
the period February 1960 to April 1966. Once 150 
mortgage loan application blanks representing delin- 
quent accounts had been selected, 150 representative 
of nondelinquents were chosen by searching the files 
immediately following each of the delinquent accounts, 
selecting only those blanks which could be matched 
with a delinquent account blank on date (within 30 
days), purpose of application (construction or pur- 
chase), and present address (town). Accordingly, 
these matching variables served as their own control 
and are, across both samples, not necessary of sepa- 
rate statistical treatment. 

After selection of 150 delinquent and 150 nonde- 
linquent accounts, every third pair of files (50 de- 
linquent and 50 nondelinquent) was removed so as 
to provide for a 100-account cross-validation group 
which paralleled the 200-account item-analysis sam- 
ple (100 delinquent and 100 nondelinquent accounts) 
to be used in the analysis of the information on the 
mortgage loan application blank. Also, with reference 
to the delinquent group, the item-analysis and cross- 
validation samples were matched on number of 
months of delinquency. 


1 Appreciation is extended to Warren E. Backert 
and James E. Dirst of the Home Building and Loan 
Association of Aurora, Illinois for permission to pub- 
lish this study. 
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Thirty-one quantifiable or categorizable pieces of 
information were available from each mortgage loan 
application blank, as follows: amount of monthly 
payment, amount of loan, address, possession of tele- 
phone, availability of pledge, length of mortgage in 
months, purpose of mortgage, purchase price, down 
payment, total value of property (home and lot com- 
bined), value of lot, value of building, husband’s 
occupation, husband’s employer, husband’s number 
of months of service, husband’s salary per month, 
marital status, wife’s employer, wife’s months of 
service, wife’s salary per month, other income (hus- 
band and wife), total salary per month (husband 
and wife), total income per month (husband and 
wife), age of husband, age of wife, number of chil- 
dren, total dependents other than self (including 
wife whether or not she was employed), value of other 
monthly payments, balance owed on those monthly 
payments, number of other accounts, and a 4-point 
credit rating from the local credit bureau.2 These 
31 bits of information were analyzed, via tetrachoric 
coefficients of correlation,? to determine their power 
of differentiation between the delinquent and non- 
delinquent accounts in the item-analysis sample. 

In addition to the analysis of the items of informa- 
tion described above, 21 of them, either because of 
their quantitative nature or by logical dictate, lent 
themselves to inclusion in ratios with other items. 
Accordingly, a 21 21 paired-comparison matrix 
was prepared to facilitate consideration of all possible 
combinations. While 210 ratios could have been de- 
[W(N — 1)] 

Zz 

of certain of them. For example, the ratio “number 
of children” to “total dependents” is almost meaning- 
less since it breaks down into a series of constants 
for mortgagees with like numbers of children and de- 
pendents, Further, the intercorrelation of these two 
values is .9904, indicating overlap and nonindepen- 
dence. Such examples are not infrequent, and judg- 
ment dictated that only 79 ratios be calculated and 
tested for their significance of differentiation. 


veloped , logic suggested the elimination 


RESULTS 


Four items of mortgage loan application 
blank information and four ratios differenti- 


2Some items of information (husband’s occupa- 
tion and husband’s employer are examples) were 
analyzed in such fashion as to reflect qualitative 
rather than quantitative differences, that is, to re- 
flect differences in occupation or employer relative 
to goodness or poorness of payment behavior. Un- 
fortunately, such categories were numerous and fre- 
quencies within them became meaninglessly small 
and useless. 

8’ Tetrachoric coefficients of correlation were em- 
ployed in all item and ratio analyses because both 
item and criterion data had been artificially dichoto- 
mized by comparing the frequencies of response for 
a particular interval of item data (the one with the 
largest delinquent-nondelinquent response difference) 
against all other intervals of item data. 
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ated delinquent from nondelinquent accounts 
at the .05 level of significance or better (V = 
200). Those eight items and their tetrachoric 
correlation coefficients follow: wife employed 
(+.23), number of children (—.27), number 
of other accounts (—.28), credit rating 
(+.45), monthly mortgage payments/age of 
husband (—.30), amount of loan/total income 
(—.24), age of husband/number of depen- 
dents other than self (+.30), and age of hus- 
band/number of other accounts (+.25). 

Each of these items and ratios was assigned 
a set of differential weights ranging from 10 
to 0 within each item, weights which while 
strictly intended for use with qualitative data 
were chosen for the sake of homogeneity of 
method across both qualitative and quantita- 
tive items (Wherry, 1944). Employing the 
eight weighted items as a scoring key for use 
with 100 mortgage loan application blanks 
held apart from the item analysis and cor- 
relating the total scores so derived with the 
delinquency—nondelinquency dichotomy gave 
rise to a biserial correlation of .64, significant 
at the .001 level of confidence. Figure 1 pre- 
sents the data from which this correlation 
arises. 

Further evidence of validity is connoted by 
the fact that the 50-person nondelinquent 
holdout group achieved a mean score of 
53.24, whereas the 50-person delinquent hold- 
out group achieved a mean score of only 
37.84, a difference of 15.40. Especially in light 
of the total score standard deviation (15.06 
with NM = 100), the implications of such a 
mean score difference may be seen. Also, with 
regard to Figure 1, it is interesting to note 
that at several points in the distribution 40% 
more nondelinquents than delinquents equaled 
or exceeded a particular mortgage loan appli- 
cation blank score. The most dramatic evi- 
dence of this sort relates to a score of 50, 
a point at or above which 64% of the non- 
delinquent group scored—only 24% of the 
delinquent group achieved this level. 


DiIscussION 


While the personnel evaluator is most usu- 
ally concerned with positive discrimination 
(identifying individuals with good perform- 
ance potential), the mortgage loan evaluator 
is most interested in discriminating at the 
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DELINQUENT ACCOUNTS 


Mortgage Percent Scoring 
Application At or Above 
Blank Score Score Interval Frequency 





70+ 

65-69 
40-64 
55-59 
50-54 
45-49 
40- 44 
35-39 
30-34 


25.2129 
20-24 @60000 
15-19 0 @ 
10-14 
BOW) 


0-4 


401 


NON-DELINQUENT ACCOUNTS 


Percent Scoring 
At or Above 
Scere Interval 


Mortgoge 
Application 
Blank Score 


Frequency 





660606606 70+ 
9000666 65-69 
6066 60 - 64 
660666 JOEY, 
06666606 6 50 - 54 
66 6 6 45-49 
6 6 6 6 40 - 44 
900000 6 SS roo, 
666 30 - 34 
Zou e9, 

20 - 24 

Onealy 

10 - 14 

ag 

0-4 


Fic. 1. Results of applying mortgage application blank scoring key to holdout group (NV = 100). 


low end of the scale (identifying the indi- 
vidual with poor payment potential). Accept- 
ing this inverted emphasis, an examination of 
Figure 1 suggests that rejecting individuals 
achieving mortgage loan application blank 
scores of 34 or below would result in eliminat- 
ing individuals most of whom would be poor 
risks. Lowering screening standards just one 
score interval (rejecting individuals scoring 
29 or below) has even more profound effects 
in terms of the percentage of known delin- 
quents who would have been rejected. 

As is obvious, delinquency and nondelin- 
quency do not comprise opposing cells in a 
dichotomy. Rather, they are the qualitative 
extremes of a continuum which can be con- 
ceived of in quantitative terms. Perhaps the 
truth of the matter is that the items discussed 
here are differentiating with regard to “late- 
ness” of payment (quality), while the cri- 


terion defines lateness in terms of number of 
months of delinquency (quantity). Suffice it 
to say that at the upper extreme of the late- 
ness continuum, lateness evolves into risk for 
the loan-granting institution—risk expressed 
as the cost associated with efforts at collec- 
tion of back payments, indirect expenses re- 
lated to the clerical handling costs of delin- 
quent accounts, legal action, and so forth. 
The typical mortgage-lending institution is 
interested in the mortgagee’s “success” in 
maintaining a periodic payment schedule, and 
the items found valid in this study may be 
viewed as contributors to or inhibitors of such 
success. Considering the signs of the item cor- 
relations cited earlier, payment success would 
seem enhanced by a working wife, a good 
credit rating (a history of payment success), 
and relatively great age of husband in rela- 
tion to numbers of individuals dependent upon 
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him and number of other accounts for which 
he is liable. On the other hand, negative im- 
plications accrue to large numbers of children 
and large numbers of accounts other than 
mortgage, and to relatively great size of 
monthly mortgage payments relative to hus- 
band’s age and relatively great size of mort- 
gage loan in relation to family’s total income. 

As will be noted, age of husband entered 
into the calculation of various scorable ratios, 
even though age of husband did not in itself 
differentiate (this failure is true of all bits of 
information employed in the scorable ratios). 
While there is no question that these ratios 
will correlate with one another (they are 
functions of a common variable), the distri- 
butions arising from them were sufficiently 
different to warrant inclusion in the final key. 
In fact, previous research in the life insurance 
industry (Ferguson, 1967) has furthered the 
concept of “economic maturity” as a predic- 
tor of life insurance sales success, and it may 
be that somewhat the same sort of phenome- 
non is causing age-related ratios to differenti- 
ate mortgage-payment behavior, even though 
age itself is not significant for differentiation. 

As mentioned earlier, it was not possible to 
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delete accounts of 1 month’s delinquency be- 
cause of sample-size considerations. Accord- 
ingly, such were treated the same as truly de- 
linquent accounts (delinquencies in excess of 
1 year were not uncommon). This practice 
has the effect of masking differences, since 1 
month’s delinquency may represent oversight 
rather than true delinquency. As a result, the 
coefficients, differences, and percentages re- 
ported here may be underestimates of the 
practical and statistical significance of the 
scoring key. 
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WORD LEGIBILITY AS A FUNCTION OF LETTER LEGIBILITY, 


WITH WORD SIZE, WORD FAMILIARITY, AND 
RESOLUTION AS PARAMETERS 
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To determine the relationship between letter legibility and word legibility, in- 
dividual characters in context were compared with isolated characters. The 
words included in the stimulus material varied in familiarity and in length. 
2 independent variables were introduced to degrade legibility: character size 
and resolution. Characters of 4 heights were reproduced at each of 20 vertical- 
horizontal resolution combinations. Word legibility was found to increase with 
resolution and with character size. Word length, over the range investigated, 
proved to be unimportant, but word familiarity was an important determinant 
of legibility. Familiar words were found to be about as legible as individual 
letters, with a product-moment correlation coefficient of .98. Unfamiliar words 
were generally less legible than individual letters, with a coefficient of .94. 
If the degree of familiarity of words is known, word legibility can be predicted 


with reasonable accuracy on the basis of letter legibility. 


In efforts to measure legibility, many differ- 
ent criterion measures have been applied. 
Viewing distance and speed of perception have 
been the most widely used threshold mea- 
sures; speed and accuracy of reading are often 
the basis for suprathreshold measurements. 
Words and nonsense syllables have been com- 
pared in investigations of legibility, with the 
general conclusion that meaning and famili- 
arity contribute to identification. Howes and 
Solomon (1951), studying the influence of 
familiarity on speed of recognition, found the 
visual duration threshold (measured tachisto- 
scopically) to be a roughly linear function of 
word frequency based on the Thorndike-Lorge 
word count. The product-moment correla- 
tions between log word frequency and visual 
exposure duration ranged from —.76 to —.83. 

In subsequent experiments by Solomon and 
Postman (1952), pronounceable nonsense 
words were introduced in order to control the 
“frequency of prior usage.” When the rela- 
tionship between tachistoscopic recognition 
threshold and frequency was measured, the 
recognition thresholds were found to vary in- 
versely with word familiarity (as it had been 
built into the experiment). Howes (1954) 
interpreted the Solomon-Postman results in 
terms of response emission probability rather 
than exposure duration threshold. In his view, 
word frequency in a large general word count 
is best interpreted as the average probability 


of emission by the S (which Howes calls the 
“base probability”). Consequently, he argues, 
a short exposure will suffice for any word 
whose base probability is almost as great as 
its probability of occurrence in the set of 
words whose legibility is being measured. 

Since legibility can be defined in so many 
different ways, any new study should clarify 
its particular approach as precisely as possi- 
ble at the outset. Legibility is defined here 
as the average percentage of correctly iden- 
tified letters or words. Legibility is recognized 
as depending on variables in both the stimu- 
lus and the observer. Typically, stimulus ma- 
terial may vary in respect to character size, 
type style, height-width ratio of characters, 
stroke width, color, paper surface, and con- 
trast. Variables which include the user are, 
for example, viewing angle and distance, 
visual exposure duration, illumination, and 
visual acuity. There are additional influences, 
not so readily defined and measured, which 
are implicit in any attempt to measure legi- 
bility. For instance, the S is assumed to 
be familiar with the alphabet from which 
the test characters are chosen; otherwise the 
idea of recognition or identification would be 
meaningless. 

Recently, Saltz (1967) has called attention 
to fundamental problems in the way word 
frequency—and consequently, word familiar- 
ity—has been measured. He points out that 
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the Thorndike-Lorge word count disregards 
word meaning in counting frequency, making 
it impossible to distinguish the two potential 
influences on legibility. A single spelling 
(“bear,” for example) may represent two 
meanings, yet its occurrences are counted as 
if it were a single word. Familiarity, fre- 
quency, and meaning overlap in ways which 
blur the significance of the numbers used to 
represent each factor. 

Whatever the proper interpretation may 
be, the cumulative evidence indicates that 
readers recognize frequent or familiar words 
more quickly than words rare to them, and 
meaningful words more quickly than nonsense 
syllables. When distance, rather than speed of 
perception, is the criterion measure, this con- 
tinues to hold true. Taylor (1934) found 
meaningful words to be readable at a greater 
distance (approximately 50% greater) than 
nonsense words, whether the printing was 
black or white. Tinker (1963), investigating 
differences between uppercase and lowercase 
letters, found less than a 6% difference in 
distance, but the difference favored meaning- 
ful words for both cases. It might be noted 
that Tinker’s nonsense words were his mean- 
ingful words spelled backward; for example, 
“evoba” comes from “above.” 

The redundancy in individual letters con- 
tributes some degree of familiarity to any 
combination of letters, whether it is a mean- 
ingful word or a nonsense syllable. Tinker 
(1963) has shown that the upper halves of 
words are more legibile than the lower halves, 
and the work of Crooks (1959) gives some 
indication of the amount of degradation indi- 
vidual forms (other than words) can suffer 
and still be recognizable. In a sense, the pres- 
ent study attempted to determine how much 
character degradation can occur before legi- 
bility is affected; the main objective, however, 
was to quantify the relationship between indi- 
vidual letter legibility and word legibility. 

In planning the experiment reported here, 
it was assumed that lowercase letters are more 
legible in the context of intelligible words 
than separately, character by character. This 
assumption is in keeping with work reported 
by Tinker (1963) which measured legibility 
by such criteria as fixation frequency, per- 
ception time, and perception distance. Infor- 
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mation already available (from Erdmann & 
Neal, 1966) had determined the resolution 
needed to make individual characters legible 
(as required in parts lists, for example, and 
other arbitrary groupings). That resolution 
was assumed to establish the maximum reso- 
lution requirement for all conditions. There- 
fore, if systematic trends were found between 
word legibility and resolution, the correspond- 
ing relationship between word and letter legi- 
bility could be quantified, and future ques- 
tions about word legibility could be answered 
by research on individual characters. 

On the basis of these considerations, the 
present experiment was designed to determine 
the legibility of words as a function of hori- 
zontal and vertical resolution and character 
size, much as individual character legibility 
was determined by Erdmann and Neal 
(1966). Since other legibility studies (includ- 
ing those cited) suggest the need for more 
precise characterization of words, two poten- 
tially important variables—word familiarity 
and word length—were introduced as addi- 
tional experimental parameters. 


MertTHOD 


The approach taken was to (a) design a test docu- 
ment which incorporated certain differences (in char- 
acter height, word length, and word familiarity) ; 
(b) reproduce the document, adding systematic dif- 
ferences in vertical and horizontal resolution; (c) 
submit the resulting stimulus material to Ss to meas- 
ure legibility; and (d) relate the legibility data thus 
derived to the physical variables under investigation. 
The procedure followed made it possible to start 
with a relatively small number of words, include 
them all in the original document, and then re- 
arrange the material systematically after printing to 
prevent interaction between the content and the ex- 
perimental parameters (word length and familiarity). 

The content includes 144 familiar and 144 un- 
familiar words selected from Thorndike and Lorge 
(1944) to represent a certain range of word lengths 
(see Figure 1). The “familiar” are among the 1000 
words most common in English; the “unfamiliar” 
appear less often than once in a million words, and 
more often than once in 4 million. The row of indi- 
vidual characters below each set of words is made 
up of 17 letters in random order, with a, e, and o 
repeated. The whole document is in lowercase Dual 
Gothic type, selected for its relatively uniform stroke 
width and its simplicity. 

Where the criterion measure is legibility, defined 
as the percentage of correctly identified letters or 
words, degradation variables are often introduced to 
control some aspect of the experimental situation— 
viewing time or distance, for example, or resolution, 
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Fic. 1. Test document for word legibility study (one of four sessions). 


size, or contrast in the stimulus material. In the work 
reported here, character size and resolution were 
varied, but high contrast was maintained through- 
out the printing process. In Sections B, C, and D of 
the test document, character heights were 40, 60, 
and 80 mils, respectively. In Section A (shown in 
Figure 1), characters were 100 mils in height, corre- 
sponding approximately to the 12-point type familiar 
in typescript and elsewhere. (The point system is 
standard in typography, but does not specify charac- 
ter size exactly. Character height is the better basis 
for direct comparisons of the effects of character 
size.) 

The test document, mounted on the drum of a 
digital facsimile device for scanning and printing, 
was reproduced at 20 different combinations of hori- 
zontal and vertical resolution. The device scanned 
across the document in narrow bands, sampling 
periodically to determine if the document at suc- 
cessive points was black or white. The smaller the 
size of the scanning aperture and the more frequent 
the sampling, the greater the resolution. The resolu- 
tion obtained can also be described, as it is more 
commonly, by the number of samples or elements 
per inch on the output copy. One can consider the 
scanned document to be covered by a grid, with 
each square in the grid representing a resolution 
element and the dimensions of the grid being the 
shape of the scanning aperture. Figure 2a shows how 
such a grid might reproduce one character on a docu- 
ment page. The character outlined represents the 
printed copy being scanned, and the shaded blocks 
show how the character would be “reported” by 
five successive scan lines. Figure 2b is an actual 
photograph of a character before and after digital 
facsimile reproduction. 

Five different horizontal resolutions and four dif- 
ferent vertical resolutions were used in this study. 
The size of the scanned element (square of the grid) 
for each of the 20 combinations is shown in Table 
1. From the 20 copies reproduced, four test booklets 
were assembled so that each group of words (Sec- 
tion A, B, C, or D on the master document) was 


presented on five consecutive pages at the lowest 
vertical resolution in combination with each of the 
five (increasing) horizontal resolutions. The next 
five pages represented the next larger vertical resolu- 
tion at each horizontal resolution (for the same 
group of words) and so on. Sections A, B, C, and D 
were presented in that order in the first test book- 
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Fic. 2a. Digital facsimile “grid” showing black/ 
white decision for each element area. 
























































Fic. 2b. Printed character before and after digital 
facsimile reproduction at a resolution of 150 x 100 
elements per inch (20 X Magnification). 
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TABLE 1 


RESOLUTION COMBINATIONS (IN SQUARE MILs) 
ComMPpARED IN Worp LeEcrpriity Stupy 


R. L. ERpDMANN AND A. S. NEAL 


TABLE 2 


LEGIBILITY AS AFFECTED BY WorpD LENGTH AND 
FAMILIARITY FOR Two CHARACTER HEIGHTS 

















Horizontal resolution 
Vertical (elements/inch) 
resolution 
(elements/inch) 
75 100 125 150 200 
100 133 100 80 67 50 
125 107 80 64 53 40 
150 89 67 BS) 44 33 
200 67 50 40 33 De) 








let, in another sequence in the second booklet, etc. 
(Four of the 24 possible sequences of word groups 
were selected at random.) 

The procedure permitted subsequent evaluation 
of the effects of 20 element areas and independent 
evaluation of the effects of horizontal and vertical 
resolution. Since the page order within each booklet 
was in the order of increasing resolution, the S$ 
would be expected to identify a larger percentage 
of words on each successive page. 

The same words were read at all five horizontal 
resolutions while vertical resolution was held con- 
stant to eliminate confounding with word form and 
familiarity. This required modifying the method of 
limits (by using only an increasing series of hori- 
zontal resolutions) to prevent memorization of the 
stimulus material from an earlier page. This design 
was also practical in view of the limited time and 
resources available to produce the stimulus materials 
and conduct the experiment. Pilot experimentation 
prior to the main experiment indicated negligible 
practice effects when only the ascending series was 
used. 

The Ss for the experiment were 72 college students 
with 20/20 vision, natural or corrected, who volun- 
teered for the task and received a fixed compensation. 
Each S was tested separately in a well-lighted booth. 
He was told that the object of the study was to 
determine how printing resolution was related to the 
legibility of letters and words. He was then asked to 
read aloud as many of the words and letters as pos- 
sible in one of the four test booklets, page by page 
from front to back. If he could not pronounce a 
word, he was allowed to spell it, but the word was 
not considered identified unless every letter was 
named correctly (i... each word was considered as 
a unit). Proceeding at his own pace, each S spent 
approximately 1 hr. at the task. The “answers” re- 
corded on coded sheets by the E were subsequently 
transferred to punched cards, and legibility results 
were calculated by means of programs written for the 
IBM 1620 computer. 


RESULTS 


From the accumulated data, the computer 
calculated the percentage of correctly iden- 








Average percent legibility 
Word : é 
Tetcih 40-mil characters 60-mil characters 
Familiar | Unfamiliar | Familiar | Unfamiliar 

3 letters 43.66 18.01 
4 letters 84.31 70.07 
5 letters 40.83 16.80 
6 letters 89.81 64.21 
7letters}| 43.03 25.62 
8 letters 91.46 77.48 





tified words for each character size as a func- 
tion of word length, familiarity, and vertical 
and horizontal resolution. Most of the 20 
resolution combinations produced 100% legi- 
bility for words with 100-mil and 80-mil char- 
acters. The ranges were 93-100% and 70- 
100%, respectively. For 60-mil words the 
lower limit dropped sharply, the range being 
30-100%; for 40-mil words, legibility ranged 
0-100%. 

This preliminary process of elimination 
made it apparent that the more detailed analy- 
sis should concentrate on the shorter char- 
acters, since the range of legibility for the 
taller was so limited. An analysis of variance 
was therefore performed for the 40-mil and 
60-mil words to determine the effects (if any) 
of word length and familiarity. The results, 
summarized in Table 2, show statistically sig- 
nificant differences between familiar and un- 
familiar words. For word length, on the other 
hand, the F ratio indicated negligible effects 
(see Table 3). The data were therefore com- 
bined across all word lengths for the remain- 
der of the analysis. 

The three remaining categories—letters, 
familiar words, and unfamiliar words—were 
compared as shown in Figure 3, with vertical 
and horizontal resolution combined to make 
element area the unit of resolution. (Subse- 
quent interpretations of the data took into 
account the possible effect of the proportions 
of that unit, and considered vertical and 
horizontal resolution separately.) As in the 
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TABLE 3 


ANALYSIS OF VARIANCE DuE TO Worp LENGTH 
AND FAMILIARITY FOR Two CHARACTER 
HEIGHTS 





Item | df 





MS | F ratio 





40-mil characters 











Familiarity (F) 1 15,000.59 14.46* 
Word length (L) 2 310.85 30 
Bex 2 190.61 18 
Error 114 1,037.56 = 
60-mil characters 

Familiarity (F) 1 9,655.29 15.54* 
Word length (L) 2 723.93 LY 
FXL 2 440.99 ey 
Error 114 621.38 — 





Note.—Critical ratio for F1jsi00 = 6.90; for Fe2s100 it is 4.82. 


*p < .01. 


Erdmann and Neal (1966) study of individual 
characters, legibility systematically increased 
with character height and with resolution (i.e., 
with smaller element areas). As to the relative 
legibility of words and letters, and the effect 
of familiarity, the results showed familiar 
words and individual letters to be consistently 





LEGIBILITY (PERCENT) 














25 40 55 70 85 100 115 130 
ELEMENT AREA (MILS2) 


mrmre FAMILIAR WORDS 
UNFAMILIAR WORDS 
meewnr= LETTERS 





Fic. 3. Legibility of familiar and unfamiliar words, 
and of letters, at four character heights. 
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Fic. 4. Legibility of 60-mil characters, showing sepa- 
rate effects of horizontal and vertical resolution. 


more legible than unfamiliar words, at all 
character heights. 

The independent effects of vertical and 
horizontal resolution were plotted in a repre- 
sentative graph for one character height (60 
mils). As that graph (Figure 4) shows, the 
legibility of both familiar and unfamiliar 
words increased steadily with improved resolu- 
tion in either direction. Again, familiar words 
were consistently the more legible. Another 
approach (see Figure 5) assumed a goal of 
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Fic. 5. Horizontal and vertical resolution required 
for 90% legibility with 60-mil characters. 
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90% legibility with 60-mil characters, and 
plotted the corresponding resolutions. Famil- 
iar words reached that legibility level at the 
lowest resolution; individual letters required 
greater resolution, and unfamiliar words, still 
greater. 

The results of the experiment can be seen 
most graphically in Figure 6, where all the 
data points are plotted for legibility levels 
below 100%. The open circles represent av- 
erage legibility for familiar words (by word 
length) and black dots represent unfamiliar 
words. At points below the dashed line, word 
legibility is higher than letter legibility. The 
solid curves which enclose the data points are 
labeled to summarize the data trends. Word 
legibility varied considerably in some cases in 
relation to a given level of letter legibility. 
The extreme case—a range of more than 50% 
—was observed at about 40% letter legibility. 
Word legibility varies less, of course, near the 
O and 100% levels of letter legibility. The 
product-moment correlations for the relation- 
ship between letter and word legibility are: 
.98 for familiar words, .94 for unfamiliar 
words, and .93 for words regardless of degree 
of familiarity. 

These correlations depend in part upon the 
particular experimental conditions. If data 
had been collected only at points where the 
legibility was 0 or 100%, the correlations 
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Fic. 6. Word legibility versus character legibility, 
as affected by word familiarity. (Open dots represent 
average legibility for familiar words; closed dots, for 
unfamiliar words.) 
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would, of course, be unity. In practice, how- 
ever, the data points for the correlations show 
variability at each legibility level. Thus one 
would expect that for any reasonable distri- 
bution of conditions the trends indicated in 
this study would be verified; namely, the 
legibility of individual letters is more highly 
correlated with the legibility of familiar words 
than with that of unfamiliar words. 


DiscuSssION AND CONCLUSIONS 


Up to a point, the word legibility experi- 
ment serves to corroborate the findings of the 
related character legibility study (Erdmann & 
Neal, 1966). The results are comparable with 
respect to the effect of vertical and horizontal 
resolution on legibility for selected character 
heights. As expected, the taller the characters 
in use, the more legible the word at any given 
resolution. Correspondingly, for a given char- 
acter height, word legibility increased with 
increasing resolution. 

The specific purpose of the present study 
was served by the results which relate word 
legibility to character legibility, in the inter- 
est of predicting the former from the latter 
(which may be more readily measured). Pre- 
vious work (Tinker, 1932, 1963) has led to 
the conclusion that words in general are more 
legible than individual lowercase letters. The 
present experiment showed this to be true 
only for familiar words at high levels of legi- 
bility. If a word was familiar to him, the S$ 
needed only a few legible letters to enable 
him to identify it. If it was unfamiliar, more 
of the letters needed to be legible to make 
the word legible. At the extreme, when S$ 
spelled out unfamiliar words, word legibility 
was found to be lower on the average than 
for individual letters, since no partial credit 
was allowed for words. 

The results are of course dependent upon 
the definition of word legibility which con- 
siders a word as a unit. A word is either 
recognized or not, and no credit is given un- 
less all the characters in the word are iden- 
tified. This definition follows the convention 
set by Tinker (1932) and employed in cur- 
rent legibility research (Kinney & Showman, 
1967; Shurtleff, 1967). 

If characters were grouped into words but 
credit was given for individual characters, one 
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would expect familiar words to score some- 
what higher than these groups of random 
characters. Longer familiar words might score 
higher than shorter words up to some limit, 
beyond which the identification of a few let- 
ters no longer insured identification of the 
word (even though length was not important 
when words were scored as a unit). 

With the same scoring scheme (i.e., credit 
for characters), unfamiliar words should be 
approximately equal to random letters in 
legibility. Very unfamiliar words would proba- 
bly require the identification of every char- 
acter (except where knowledge of the common 
sequences of letters in English words offered 
an advantage). The important thing to note 
is that even where the measurement process 
differs, the legibility of words remains a func- 
tion of word familiarity. 

In conclusion, word familiarity was found 
to have considerable influence on unit word 
legibility, whereas word length had little effect. 
For familiar material, the average legibility 
of words can be expected to be equal to or 
higher than the average legibility of indi- 
vidual letters. However, for unfamiliar ma- 
terial (one might think of words in a foreign 
language, or technical information outside 
one’s own field), word legibility can be ex- 
pected to be less than the average legibility 
of individual letters. With correlation coeffi- 
cients ranging from .98 for familiar words to 
.93 for all words, it is apparent that knowledge 
of letter legibility is helpful in predicting 
word legibility, and knowledge of word famili- 
arity further increases the accuracy of such 
predictions. 
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EFFECT OF LANE-CLOSURE SIGNALS UPON DRIVER 
DECISION MAKING AND TRAFFIC FLOW * 


KENNETH PERCHONOK ? anp PAUL M. HURST 
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A decision-theoretic model was applied to driver behavior on an urban express- 
way. The model permits inferential measurement of “responsiveness versus 
confusion” and “risk-taking predisposition.” These variables, as well as 2 direct 
measures (“risk taking” and “hazard”), were measured in a field study of 
forced merging from a blocked lane. Resultant indexes were compared under 2 
experimentally manipulated methods of lane closure: signal closure (SC) and 
conventional closure (CC). The SC method, which provides earlier warning of 
the closure, was superior to CC in terms of lower hazard and greater responsive- 
ness versus confusion. There were also improvements in several traffic-flow 


characteristics. 


The purpose of this study was to opera- 
tionally define variables immediately involved 
in, and resulting from, the decision-making 
processes of drivers in a merging situation, 
to measure the changes in these variables 
under differential traffic-control systems, and 
hence to use these measured changes to evalu- 
ate those systems. The investigation proceeded 
in the context of stimulus-dependent response 
elicitation mediated by a decision-making 
process. 

Briefly, the model for decision making 
states that a gap of size g will be accepted 
if it is larger than a driver’s threshold, G, the 
smallest acceptable gap. It follows that P, 
(A), the probability of accepting a gap of size 
g, is equal to the probability, P(g > G), that 
g is above threshold, and that, as g grows 
larger, P, (A)=P(g>G) increases from 
0 to 1. It has been argued by Hurst (1964) 
that as drivers become better able to per- 
ceive and interpret environmental information 
their evaluations of incurring risk in gap ac- 
ceptance/rejection become better, and their 
behavior becomes more consistent with re- 
spect to the immediate environment, and that 
this consistency can be measured by 7, the 
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correlation ratio between gap “size” and ac- 
ceptance/rejection. Perchonok (1964) showed 
that this increased consistency is essentially 
equivalent to an increase in the slope of P,(A) 
for 0< P,(A)< 1. Conversely, as drivers 
become less responsive to the relevant environ- 
ment (they behave in a more confused man- 
ner), the response curve flattens out and the 
value of 7 is diminished. Thus 7, as an index 
of “responsiveness versus confusion,’ mea 
sures the ability of drivers to cope with their 
respective environments. 

In addition to changes in the degree of 
responsiveness, driver behavior also reflects 
the amount of risk that drivers are willing 
to tolerate. That is, if G, the gap-acceptance 
threshold, is lowered, P(g > G) increases and 
P,(A), the probability of acceptance, in- 
creases. Thus, the gap-acceptance threshold 
reflects “risk-taking predisposition.” The ef- 
fect of change in driver responsiveness and 
risk-taking predisposition is illustrated for a 
hypothesized example in Figure 1, where, in 
comparison to Curve A, B shows decreased 
responsiveness and C shows a lowered thresh- 
old or increased risk-taking predisposition. 

It can be seen that either type of change in 
the response curve is likely to reflect the 
probability of accepting a small gap, which 
is taken as our definition of “risk taking.” 
That is, not only would a decreased threshold 
yield an increased probability of accepting a 
small gap, but decreased responsiveness would 
also tend toward that outcome. 

Finally, a fourth measure was employed as 
an index of resultant “hazard.” The measure 
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is given simply by the number of small gaps 
accepted; the greater the number of small 
gaps accepted, the greater the net hazard to 
drivers of merging vehicles and to those in the 
adjacent lane. It should be noted that al- 
though hazard is a function of risk taking, it 
is also a function of the distribution of avail- 
able gap sizes—if very few small gaps are 
presented, hazard must be low even if the 
probability of accepting a small gap, when 
presented, is high. 

In order to pursue this work most directly, 
it was decided to establish the “most relevant 
stimulus” with respect to gap-acceptance be- 
havior. Such a measure of gap size was 
selected on empirical grounds from a list of 
27 algebraic combinations of variables de- 
scribing the kinematics of the merging vehicle 
and the vehicles forming the gap (Hurst, 
Perchonok, & Seguin, 1968). 


METHOD 
Data Collection 


Data were recorded on 35-mm. film using an 
elevated camera overlooking a three-lane section of 
the John C. Lodge Freeway in Detroit, Michigan. 
The observation area was approximately 350 ft. 
long; the lane closure was established by a mainte- 
nance vehicle located in the medial lane at the down- 
stream end of that area. Data collection started at 
2:15 p.m. on two successive afternoons, one for each 
type of closure system. 

Under one system, conventional closure (CC), the 
only warning given to drivers was supplied by the 
truck and a wooden panel, supporting flashing lights, 
attached to the rear of the truck. Under signal 
closure (SC) additional warning was provided by an 
overhead sign approximately 1,200 ft. upstream of 
the truck. This sign displayed electrically lighted sig- 
nals consisting of a red “X” for the closed lane and 
a green arrow for each of the open lanes, In addi- 
tion, a traffic engineer operated a suggested speed 
indicator, at the same location, following the same 
procedures used in any normally occurring lane 
closure. 

The data reduction involved the transformation 
of pictorial information for each Y, sec. into quan- 
titative form via the Pictorial Data Transducer sys- 
tem developed by the Institute For Research, State 
College, Pennsylvania. This system involves a semi- 
automated film-scanning device, from which vehicu- 
lar positional data on IBM cards are obtained, and 
a battery of computer programs to organize this out- 
put into an analyzable format. A more complete de- 
scription of the system is given by Seguin and 
Perchonok (1965); the details of the transducer it- 
self are discussed by Hicks (1964). 
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GAP SIZE. 

Fic. 1. Three hypothetical response curves illus- 
trating differential responsiveness and_ risk-taking 
predisposition. 


Definitions and Formulas 


“Gap size’ was measured by a decision point 
which was defined as that time, to the nearest 4 
sec., at which the longitudinal locations of the rear 
of the lead vehicle defining the gap and the front of 
the subject vehicle in the medial (closed) lane were 
coincident. For the purposes of this study only 
those gap occurrences were included for which the 
lead vehicle had a higher velocity than the subject 
vehicle. Gap size as determined by Hurst et al. 
(1968) was measured as: 


gap size = [Hr/Vr] — Tr-s 


where Hy =the distance (ft.) between the lead and 
following vehicles, minus the length of the subject 
vehicle; Vr—speed (ft/sec) of the following ve- 
hicle; and Tr-s=the minimum time (sec.) re- 
quired by the following vehicle to match the velocity 
of the subject vehicle assuming maximum decelera- 
tion with a coefficient of friction of .6, and a 
response time of .75 sec. 

“Responsiveness versus confusion” was measured 
by », the correlation ratio, which is the square root 
of the proportion of acceptance/rejection behavior 
attributable to gap size. A higher value of 7 implies 
greater responsiveness and, therefore, less confusion. 

“Risk-taking predisposition” was measured by G, 
the mean value of the gap-acceptance threshold as 
computed by transforming the acceptance curve to a 
frequency distribution of thresholds and computing 
the mean as suggested by Spearman (Guilford, 1954). 

A “small gap” was defined as any gap whose size 
was less than the G computed from the pooled data. 
It was, then, any gap which the average driver 
would consider unacceptable. 

“Risk taking” was defined as the number of small 
gaps accepted divided by the number of small gaps 
presented. It was, thus, an estimate of the probabil- 
ity that a small gap would be accepted. 

“Hazard” was defined as the number of small gaps 
accepted. 


RESULTS 


Volumes under the two closure systems were 
essentially equivalent: under signal closure 
(SC) the volume was 67.1 vehicles/min, while 
under conventional closure (CC) the volume 
was 67.7 vehicles/min. However, relative 


TABLE 1 


LANE VOLUMES FOR SIGNAL CLosuRE (SC) AND 
CONVENTIONAL CLosuRE (CC) 





Volumes in Proportion of 








vehicles/min volume by lane 

Lane 
SC CE SC Ce 
Shoulder 31.8 30.5 AT 45 
Center 30.6 25.8 46 38 
Medial 4.7 11.4 07 alg 
Total 67.1 67.7 1.00 1.00 





lane volumes did change as a function of 
closure (Table 1). Table 1 further explicates 
the change in relative lane volumes in terms 
of the probabilities of entering the observation 
area in each of the three lanes. 

Chi-square tests showed the probability 
that a vehicle entering the observation area 
did so in the medial lane was significantly 
(p < .002) lower under SC and that the 
probability of entering in the shoulder lane 
was not significantly changed. Of the vehicles 
not entering the medial lane, the likelihood of 
entering the center, as opposed to shoulder, 
lane increased under SC but not significantly 
(p = .17). The relative lane volumes leaving 
the observation area were quite similar under 
the two closure systems. 

Other changes in traffic characteristics that 
were noted are shown in Table 2. The prob- 
abilities of these results occurring if chance 
alone had been in operation were sufficiently 
small to allow rejection of the hypotheses of 
“no difference.” 

The curves in Figure 2 are the empirical 
response curves under the two systems of 
closure. The two vertical lines represent the 
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GAP SIZE 


Fic. 2. Proportion of gap acceptances as a function 
of gap size for two types of closure, 
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TABLE 2 


THREE TRAFFIC-FLOW CHARACTERISTICS FOR SIGNAL 
CLosuRE (SC) AND CONVENTIONAL CLosuRE (CC) 





Characteristic SGCp | CCalew 
Median speed of medial lane vehicles} 
at decision point (ft/sec) 18.7 | 14.0] .058 
Median number of vehicles in 
observation area 16.0 | 17.0} .042 
Median waiting time from area en- 
trance to gap acceptance (sec.) 6.0] 9.0} .012» 





a Significance level using one-tailed Mann-Whitney U test 
that observations under one system were stochastically higher 
than those under the other system (Siegel, 1956). 

b Two-tailed test. 


The values of the decision-making correlates 
for SC and CC along with significance levels 
for respective changes are given in Table 3. 
All differences, except that for risk taking, 
had small probabilities of chance occurrence 
and were therefore accepted as “real” dif- 
ferences. 


DIscUSsSION AND CONCLUSIONS 


The similarity of overall volumes seems to 
indicate that observed differences are reason- 
ably attributable to the type of closure, as 
opposed to the nature of the traffic. The 
relative effect of SC upon lane volumes was 
as expected: the early warning of lane closure 
was successful in inducing drivers to leave the 
medial lane causing a concomitant increase of 
volume in the center lane. Thus, the signal 
system was successful with respect to its 
primary function. 

Additional benefits included a decrease in 
the slowing of subject vehicles and a shorter 
waiting time before leaving the closed lane. 
The significant reduction of vehicle density in 
the observation area implies that vehicle 
velocities tended to be higher, and that po- 
tential capacity was increased. Thus, the ef- 
fectiveness of the SC apparently extended to 
a general improvement in traffic flow. 

The decreased confusion during SC is 
graphically exhibited in Figure 2 which ex- 
hibits a tendency toward more appropriate 
behavior than under CC; namely, the likeli- 
hood of accepting very small gaps and of re- 
jecting large gaps is lower under SC than CC. 
It should be noted that this phenomenon 
obtains even in the face of a changed thresh- 


DRIVER DECISION MAKING AND TRAFFIC FLOW 
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TABLE 3 


DeciIsIoON-MAKING CORRELATES FOR SIGNAL CLosuURE (SC) AND CONVENTIONAL CLosuRE (CC) 














Variable SC 
Responsiveness versus confusion (n) .890 
Risk-taking predisposition (@) — 1.204 
Rice taking (= small gaps pou 43 

no. small gaps 
Hazard (no. small gaps accepted) 3 





CE Test Pp 
714 Wilcoxon® .063 
—.428 Wilcoxon? 024 
30 Fisher? .808 
9 Chi-square .050 








a The Wilcoxon matched-pairs signed-rank test was used to detect stochastic differences. The directions of change in ‘‘re- 
sponsiveness versus confusion” and in ‘‘hazard’’ were predicted; thus one-tailed hypotheses were tested. 


b Two-tailed. 


old. It can be inferred that although drivers 
under SC exhibited a greater risk-taking pre- 
disposition—a willingness to accept smaller 
gaps—their lower level of “confusion” per- 
mitted them to better behave in correspond- 
ence with the environment, with the result 
that there were no significant differences in 
actual risk taking by individual drivers. 
Thus, the effect of SC in reducing “confusion” 
paid off. The early alerting to lane closure ap- 
parently allowed drivers to better assimilate 
information about the environment and thus 
to respond more appropriately to it. 

The increased risk-taking predisposition 
under SC was unexpected. An explanation, al- 
though ad hoc, is available. It had been ob- 
served via the television surveillance system 
in Detroit that some drivers ignore the lane- 
closure signal and remain in the closed lane, 
taking advantage of its reduced volume. Since 
drivers receive an earlier warning under SC, 
it seems appropriate to assume that of the 
drivers in the observation area a greater pro- 
portion were there by choice under SC as 
compared to CC. Thus, instead of interpreting 
the increased predisposition toward risk tak- 
ing as a product of the situation in which the 
driver finds himself, it seems likely that the 
early warning acts as a selection device en- 
couraging the less aggressive driver to leave 
the closed lane. Thus, the more aggressive 
driver remained to take advantage of the 
reduced medial lane volume at the possible 
cost of encountering higher risk when, as he 
ultimately must, he changes lanes. 


Finally, it can be seen that measured “haz- 
ard” was greater under CC than SC. The 
differential comparison between “risk taking” 
and “hazard” is attributable to the reduced 
medial lane volume under SC. Here, again, 
the success of the signal system is obvious. 

It may be concluded that the SC system 
provided a significant safety improvement 
over the CC system in terms of reduced ob- 
jective “hazard” in gap acceptance. The 
means whereby this was achieved were a re- 
duction in the proportion of vehicles entering 
the observation area in the closed lane and 
(by inference) a decrease in “confusion” by 
drivers of merging vehicles. 
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LEADERSHIP PERFORMANCE OF NURSING SUPERVISORS 


AT TWO ORGANIZATIONAL LEVELS * 


STANLEY M. NEALEY anp MILTON R. BLOOD 2 


University of Illinois 


The relations of leader style and leader behavior to work-group performance 
and subordinate job satisfaction have been investigated for 22 Ist-level and 
8 2nd-level nursing supervisors in a Veterans Administration Hospital of 1680 
beds. Task-oriented (low least preferred co-worker—LPC) leaders received 
higher performance ratings at the 1st level of supervision, while relationship- 
oriented (high LPC) leaders performed better at the 2nd level of supervision. 
Subordinate’s job satisfaction was positively related to leader Consideration at 
both levels of supervision. However, Initiating Structure leader behavior con- 
tributed to high subordinate job satisfaction at the 1st level of supervision but 
low subordinate job satisfaction at the 2nd level. These differences in effective 
supervisory patterns point to differences in situational leadership demands at the 


2 supervisory levels. 


The present study was designed to explore 
in a field setting the relations of leadership 
style and leadership behavior on the one hand 
to measures of work-group performance and 
work-group satisfaction on the other, and to 
compare these relations across two organiza- 
tional levels. In other words the aim is to 
discover if and how leadership demands dif- 
fer by organizational level. 

There are several reasons for hypothesizing 
that the first-level supervisor and his boss, the 
second-level supervisor, will face different 
leadership demands. For one thing, leader- 
ship theory and leadership research have for 
several decades been moving away from the 
idea of universal leadership traits and the 
“one best way” to lead. In place of these 
earlier oversimplifications, leadership effective- 
ness is now generally conceded to involve an 
interaction between the leader’s characteris- 
tics, his behavior, the nature of the followers, 
and the characteristics of the particular lead- 


1 This study was conducted under Contract NR 
177-472, Nonr 1835(36) (Fred E. Fiedler, principal 
investigator). The authors wish to express their 
appreciation to Otto Schaefer, hospital director, and 
to Mark Ardis, chief of staff, for permission to con- 
duct the study. They are indebted to Roy Hamlin, 
research psychologist, for liaison and critical reading 
of the manuscript, to Fred E. Fiedler for guidance 
and support, and to the entire nursing service for 
their help and cooperation throughout the study. The 
authors particularly wish to thank James Catney, 
chief of nursing, for his active participation at every 
stage of the study. 

2 Now at University of California, Berkeley. 


ership situation, including the nature of the 
task and the organizational setting. Terman 
(1904) initially suggested this approach, and 
studies by Carter, Haythorn, Shriver, and 
Lanzetta (1951), Hemphill (1949), Stogdill 
(1948), Flanagan (1949), and many others 
(see Gibb, 1954) have led to its general ac- 
ceptance. The interactional conception of 
leadership is the basis for Fiedler’s (1964, 
1967) model of leadership effectiveness and 
figured prominently in reviews of research on 
supervision and _ productivity by Dubin 
(1965) and Sales (1966). The point is that 
a number of situational variables, for in- 
stance, follower characteristics, leader power, 
and group task, may vary markedly by 
organizational level. Leader behavior, in order 
to be effective, may need to vary accordingly. 

A number of writers in industrial adminis- 
tration, among them Barnard (1950), Niles 
(1949), and Pfiffmer and Sherwood (1960), 
have hypothesized that management func- 
tions do vary from level to level within organi- 
zations. Georgopoulos and Mann (1962) 
found that technical skills become less im- 
portant as managers move upward in organi- 
zations. Argyris (1964) and Katz and Kahn 
(1966) have expressed similar views. 

A partially contrasting perspective on the 
above question of differential demands by 
management level is provided by several 
studies in which managers at different organi- 
zational levels have expressed their opinions 
about what qualities lead to success (see 
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Fleishman, 1953; Pellegrin & Coates, 1957; 
Porter & Henry, 1964). These studies all 
found some differences by level, but these dif- 
ferences were not striking. On the whole, the 
managers surveyed did not feel that effective 
patterns of leadership attributes differ greatly 
by organizational level. Related studies by 
Rosen (1961) and by Triandis (1959a, 
1959b) reinforce this conclusion. 

The opinions of industrial administration 
scholars and those of managers thus seem in 
only partial agreement that organizational 
level is an important variable in determining 
which pattern of leadership behavior will be 
most effective. Whatever may be the eventual 
empirical resolution of these differences, if we 
are willing to entertain the notion that the 
leadership demands on top managers may be 
different in kind as well as degree from the 
leadership demands at lower levels, then we 
must face the possibility that experience at 
lower levels may have minimal value, or may 
even be disadvantageous, to effective perform- 
ance at higher levels in the organization. 


METHOD 
Research Site and Subjects 


This study was conducted in a Veterans Admin- 
istration hospital of 1,680 beds. The Ss were nursing 
personnel at four levels in the nursing service as 
follows: 248 nursing assistants, 43 staff nurses, 22 
head nurses, and 8 unit supervising nurses. All Ss 
except nursing assistants were RNs. 

The aspect of the study which is of concern here 
is a comparison of certain leadership variables across 
two levels of the nursing staff. For this purpose head 
nurses will be considered as first-level supervisors 
and will be referred to as such from here on. Each 
head nurse supervised directly three to eight nursing 
assistants as well as one or two staff nurses. The 
hospital was organized into six units, each with 
roughly equivalent functions. The nursing function 
in each unit was under the supervision of a unit 
supervising nurse who reported directly to the hos- 
pital’s chief of nursing. An evening supervising nurse 
and a night supervising nurse had responsibility for 
the nursing function for the whole hospital during 
their respective shifts. For purposes of analysis, these 
two have been lumped with the six unit supervisors 
and these eight nurses will hereafter be referred to 
as second-level supervisors. 


Predictor Variables 


Two predictor variables have been used in the 
present study. The first was a measure of leadership 
style, specifically Fiedler’s (1964) measure of the 
leader’s esteem for his least preferred co-worker 
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(LPC). The S is asked to think of the person with 
whom he has had the most difficulty in working. 
Then he describes this person by means of 17 evalua- 
tive bipolar adjective scales. A high LPC score indi- 
cates that S is relatively accepting of his least pre- 
ferred co-worker (even though he was hard to work 
with). Fiedler (1967) hypothesized that S with a 
high LPC score is generally oriented toward good 
group relations when in a leadership role. A low LPC 
score indicates that S has low esteem for the person 
with whom he cannot work well. Fiedler (1967) hy- 
pothesized that S with a low LPC score is generally 
oriented toward task accomplishment when in a 
leadership role. 

The second predictor variable was a measure of 
leader behavior as described by the leader’s sub- 
ordinates. Twenty-four items (the 12 highest loading 
and purest loading items on each dimension) were 
used from the Leadership Behavior Description 
Questionnaire (LBDQ), developed at Ohio State 
University (see Stogdill & Coons, 1957, pp. 108— 
109). It yields two scores. The first, Consideration, 
indicates the degree to which the leader exhibits a 
human-relations orientation toward his subordinates, 
while the second, Initiating Structure, indicates the 
degree to which the leader performs the management 
functions of planning, organizing, controlling, and 
pressing subordinates for production. 


Criterion Variables 


Two types of criterion variables were employed. 
The first type consisted of ratings of the perform- 
ance of organizational subparts. These ratings were 
made by superiors one level above the supervisor of 
the subpart in question. Thus, the performance of 
the second-level supervisors was rated by the chief 
of nursing and by his two principal assistants. The 
performance of first-level supervisors was rated by 
second-level supervisors. It is important to note that 
these ratings involved the performance of the whole 
subpart of the organization rather than just the 
leader of that subpart. Interest was in organizational 
performance rather than leader ratings. These per- 
formance ratings were made on 4-point scales and 
involved three areas of nursing performance as fol- 
lows: (1) physical patient care; (2) information 
about patients, the extent to which work-group 
members knew patients and their individual treat- 
ment schedules; and (3) human relations skill, skill 
in handling difficult emotional or social situations, 
understanding the why of patient behavior. A 
fourth rating, also on a 4-point scale, was a global 
rating of job performance. The 4 points on these 
scales were labeled, “much above average,” “above 
average,” “about average,” and “a little below aver- 
age” (among those subordinates rated). 

A second type of criterion was the mean job sat- 
isfaction of the immediate subordinates of each 
supervisor. Thus the subordinate job-satisfaction 
score of second-level supervisors consisted of the 
mean job satisfaction of the head nurses and staff 
nurses who reported to them. Likewise the subordi- 
nate job-satisfaction score of a first-level supervisor 
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TABLE 1 


CORRELATIONS BETWEEN LEADERS’ LEAST PRE- 
FERRED CO-WORKER SCORE AND RATED 
PERFORMANCE CRITERIA AT Two 
LEVELS OF SUPERVISION 





First- Second- 
level level 
Performance criteria aoe ae p(ri—re) 
(N=21) | (N=8) 
ry r2 
Physical patient care —.395 .584 <.05 
Information about patients —.373 -646 <.05 
Human relations —.063 .817* <1 05 
Global performance —,219 .790* <.05 





Note.—Throughout the article, two-tailed p values have been 
used, 
*5<.05. 


consisted of the mean job satisfaction of the nurs- 
ing assistants under her. Job satisfaction was mea- 
sured by means of the Job Descriptive Index (JDI) 
developed at Cornell University (Smith, Kendall, & 
Hulin, 1968). This index yields satisfaction scores 
in five areas: the work itself, supervision, people, 
pay, and promotional opportunities. 

Several other variables are included in the data 
analyses of this study. These include: (a) years of 
service as a registered nurse, (b) intelligence as 
measured by the Wonderlic Personnel Test (Wonder- 
lic, 1959), (c) a rating of leadership power on which 
the chief of nursing rated second-level supervisors 
who in turn rated first-level supervisors, and (d) a 
measure of the leader’s esteem for her immediate 
subordinates as a group. The leadership power rating 
consisted of nine items. Five items measured the su- 
pervisor’s “expert power” in doing the work, for ex- 
ample, how well the supervisor knows the work rela- 
tive to her subordinates, the amount of knowledge 
and skill the supervisor has, and her ability to give 
explicit and detailed guidance to subordinates. Four 
items measured the supervisor’s “delegated power,” for 
example, her degree of authority to reward or punish 
subordinates, her influence with superiors in having 
recommendations accepted, and her power to set 


TABLE 2 


CORRELATIONS BETWEEN LEADERS’ LEAST PRE- 
FERRED CO-WORKER SCORE AND RATED 
LEADER POWER AT Two LEVELS 
OF SUPERVISION 








First-level Sa . 
Rated leader | supervisors supervisors} p(r1—r2) 
power (NV =21) (N =8) 
T1 r» 
Expert power — 337 471 ns 
Delegated power —.224 -766* <.05 
Total power —.315 .614 <.05 


*p<.05. 
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work standards for subordinates. The expert- and 
delegated-power subscores were correlated .737 
among first-level supervisors and .777 among second- 
level supervisors. These two scores have been com- 
bined as a measure of leader power in some of our 
analyses. As shall be seen later, this rating of the 
leader’s power was highly correlated with the ratings 
of work-group performance in the present samples. 
The measure of leader’s esteem for subordinates con- 
sisted of 10 evaluative bipolar adjective scales? plus 
five questions measuring the leader’s confidence in 
and friendship for her subordinates and her evalua- 
tion of how well they do their work. 


Data Analyses 


Correlations between predictor variables and cri- 
terion variables will be reported separately for the 
two levels of supervision. Two-way analyses of 
variance were performed to determine if pairs of 
predictor variables interact to determine work-group 
performance and job satisfaction. One set of 
ANOVAs were performed using the leader’s LPC 
score and the leader’s esteem for subordinates as inde- 
pendent variables. Another set of ANOVAs involved 
the leader-behavior variables of Consideration and 
Initiating Structure as independent variables. These 
ANOVAs produced a number of significant main ef- 
fects at both supervisory levels, but significant inter- 
actions occurred only at the first level of supervision. 
ANOVA results will be reported only for these sig- 
nificant interactions. In other words, correlations will 
be reported rather than ANOVA main effects. 


RESULTS 
LPC Scores as Predictors 


The correlations between the leadership 
style measure (LPC) and the four ratings 
of performance by superiors are shown in 
Table 1. As can be seen the correlations differ 
in sign at the two supervisory levels. In gen- 
eral the high LPC (relations oriented) sec- 
ond-level supervisors received better ratings 
from superiors, while at the first level of 
supervision low LPC (task oriented) nurses 
were judged the better supervisors. However, 
none of the correlations at the first level of 
supervision quite reaches the two-tailed sig- 
nificance levels used for all significance tests 
throughout this paper. Column 3 of Table 1 
shows that the correlations between LPC and 
performance were significantly different at 
the two levels of supervision. 


3 These 10 items are identical to Fiedler’s (1967, 
Appendix A) “group atmosphere” scale. The part- 
whole correlation between the sum of these 10 items 
and the sum of the total 15 items was .958 among 
the first-level supervisors. 
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Ratings of the leadership power enjoyed by 
supervisors at both levels were made by su- 
periors just as were the ratings of perform- 
ance criteria. Raters were instructed to rate 
the expert and delegated power of the super- 
visor herself, but to rate the whole work 
group on the performance criteria. Despite 
this different emphasis, the power and cri- 
terion ratings turned out to be highly corre- 
lated at both supervisory levels. The power 
total score and the global rating of perform- 
ance were correlated .884 and .856 (p< .01 
in both cases) at the first and second levels, 
respectively, of supervision. Therefore, the 
power ratings are treated as another indica- 
tion of the excellence of job performance as 
seen by superiors. Table 2 shows the corre- 
lations between leader LPC scores and rated 
leadership power at the two levels. Again 
high LPC (relation oriented) second-level 
supervisors receive good ratings, and again 
LPC predicts in different directions at the 
two levels. 

The correlations between leadership style 
(LPC) and the job satisfaction of subordi- 
nates are shown in Table 3. The LPC score 
of supervisors did not correlate significantly 
with any area of subordinate job satisfaction 
or with the sum of the five job-satisfaction 
scores. 


Relations between Predictors 


Before moving on to the correlations be- 
tween leader behavior, as perceived by sub- 
ordinates, and the criterion variables, the 
relations among the two types of leadership 
predictor variables should be reported. Table 
4 shows the correlations between the leader- 
ship style measure (LPC) and the leader- 
behavior dimensions Consideration and Ini- 
tiating Structure. None of the correlations in 
Table 4 approaches significance. Apparently 
these two types of predictor variables were 
quite independent. Readers who may have 
fallen into the habit of interpreting high LPC 
scores as indicating high Consideration and 
low LPC scores as indicating high Initiating 
Structure should take note of this finding. 


LBDQ Scores as Predictors 


Descriptions of leader behavior as_per- 
ceived by subordinates failed to predict rat- 
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TABLE 3 


CORRELATIONS BETWEEN LEADERS’ LEAST PREFERRED 
CO-WORKER SCORE AND SUBORDINATE JOB 
SATISFACTION AT Two LEVELS 
OF SUPERVISION 











First-level oe : 
Job-satisfaction | supervisors : 
ee) (N =21) supervisors} p(r1—r2) 
(r1) EL 
(r2) 
Work — .2738 439 ns 
Supervision —.275 340 ns 
People — .288 —.407 ns 
Pay .208 094 ns 
Promotion —.116 —.475 ns 
Total satisfaction | —.199 326 ns 








8 None of the correlations in this table is significant at the 
-05 level. 
ings of performance by superiors. Table 5 
shows the correlations between the LBDQ 
scores Initiating Structure and Consideration 
on the one hand and rated performance and 
rated power on the other. Only one of the 
correlations is significant. Ratings of ex- 
pert power were correlated .494, p< .05, 
with Initiating Structure scores among first- 
level supervisors. One might conclude very 
cautiously that first-level supervisors rated 
high in Initiating Structure by their subordi- 
nates are well thought of by their superiors. 
Furthermore, the significance values shown in 
Column 7 of Table 5 lead to the conclusion 
that among first-level supervisors Initiating 
Structure behavior was valued more by su- 
periors than was Consideration behavior.* 


4 These differences were tested by means of Hotel- 
ling’s t test for the significance of the difference 
between 7yz and fzz when computed from the same 
population. (See Walker & Lev, 1953, p. 256.) 


TABLE 4 


CORRELATIONS BETWEEN LEADERS’ LEAST PREFERRED 
CO-WORKER SCORE AND LEADER BEHAVIOR AT 
Two LEVELS oF SUPERVISION 








First- Second- 
level level 
Leader behavior super- super- 
dimension visors visors b(r1—12) 
(N=21) | (VN=8) 
TL To 

Initiating Structure | —.167 els ns 
Consideration —.163 .418 ns 
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TABLE 5 


CORRELATIONS BETWEEN LEADER BEHAVIOR AND SUPERIORS’ RATINGS OF PERFORMANCE 
AND POWER AT Two LEVELS OF SUPERVISION 



































First-level Second-level 
supervisors supervisors Across levels Within levels 
(N = 22) (NV =8) 
Superiors’ ratings 
Initiating | Consider- | Initiating | Consider- 
Structure} ation | Structure} ation p(ni—rs) | p(rea—ra) | p(ri—re) | p(ra—ra) 
ry re rs r4 
Physical patient care} .107 elit 150 .012 ns ns ns ns 
Information about 
patients .240 S13 .090 306 ns ns nS ns 
Human relations .010 103 057 302 ns ns ns ns 
Global performance 413 077 Low —.015 ns ns <.058 ns 
Expert power 494% 162 —.144 .409 ns ns <.05 ns 
Delegated power 098 — 22 201 157 ns ns <.05 ns 
Total power 362 — .009 — .017 333 ns ns <.05 ns 








a This significance level may seem surprising since 7 =.413, df =20, is not significantly different from zero. The difference be- 
tween 7 =.413 and ry =.077 is significant because Consideration and Initiating Structure are correlated .704 in this sample. 
*D <.05: 


While leader behavior as rated by subordi- 
nates was only slightly correlated with per- 
formance ratings by superiors, it was closely 
related to subordinate job satisfaction. Table 
6 shows the correlations between LBDQ 
scores and job satisfaction as measured by 
the JDI. At the first level of supervision both 
Consideration and Initiating Structure are 
positively related to all five aspects of sub- 
ordinates’ job satisfaction and to the job- 
satisfaction sum. All but two of these rela- 
tions are significant beyond the .05 level. 


As Column 7 of Table 6 shows, Considera- 
tion was correlated with subordinate job sat- 
isfaction more highly than was Initiating 
Structure. Clearly, nursing assistants posi- 
tively valued both Consideration and Initiat- 
ing Structure behavior on the part of the 
first-level supervisor, but they valued the 
former more than the latter. At the second 
level of supervision the situation appeared 
somewhat different. Table 6 shows that Con- 
sideration was on the whole positively related 
to the job satisfaction of subordinates, but 


TABLE 6 


CORRELATIONS BETWEEN LEADER BEHAVIOR AND SUBORDINATE JOB SATISFACTION 
AT Two LEVELS OF SUPERVISION 











First-level 


Second-level 


supervisors supervisors Across levels Within levels 
(N= 22) (NV =8) 
Job-satisfaction 
area Md se ; aed 
Initiating | Consider- | Initiating | Consider- 
Structure} ation | Structure} ation bini—rs) | p(r2a—rs4) | P(ri—re) | p(rs—r14) 
ry 12 13 14 

Work .520* Ode .087 306 ns ns <.05 ns 
Supervision ei} Ons rail .820* <.01 ns <.05 <.05 
People FO BLO 494. —.144 ns <.05 ns ns 
Pay .256 .524* = 037 263 ns ns ns ns 
Promotion 348 nO 2iae —.669 .289 <.05 ns ns ns 
Total satisfaction .501* .809** —.170 624 ns ns <.01 ns 
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that Initiating Structure scores of second-level 
supervisors tended to be negatively related to 
the job satisfaction of their subordinates. The 
correlations between the leader-behavior di- 
mensions and satisfaction with supervision 
were not only significantly different from 
zero, in opposite directions, but were signifi- 
cantly different from one another (p < .05). 
In this situation it appeared the first-level 
supervisors wanted their own superior to 
evidence Consideration but zot Initiating 
Structure behavior. This is in contrast to the 
first-level supervisor’s own subordinates, the 
nursing assistants, who valued both Consid- 
eration and Initiating Structure. 

The above findings involving LBDQ scores 
should be interpreted in the light of the rela- 
tions between the Consideration and Initiat- 
ing Structure dimensions themselves. Among 
second-level supervisors these two dimensions 
were correlated —.570, while at the first level 
of supervision they were correlated +.704, 
p < .01. These two dimensions of the LBDQ 
are, of course, designed to be orthogonal. The 
12 items in each scale were selected because 
of their orthogonal factor loadings based on 
an industrial population as reported by Stog- 
dill and Coons (1957, pp. 108-109). While 
Stogdill and Coons report (p. 110) that their 
dimension scores intercorrelated only —.02 
in an additional sample of 122 foremen, the 
results of this study indicate a much lower 
degree of factor independence. As a check the 
LBDQ scores from the total sample of 251 
Ss who had completed the LBDQ were factor 
analyzed. The principal axis factor analysis 
even without rotation strongly supported the 
two-dimensional factor structure and a priori 
item scoring of the present data. Nevertheless, 
the high intercorrelations between the Consid- 
eration and Initiating Structure item com- 
posites in the samples of this study and the 
fact that these intercorrelations differed in 
direction from one organization level to an- 
other should be kept in mind. The authors 
would further recommend that the factor in- 
dependence of the LBDQ dimensions be 
checked as a matter of course in future studies 
using this instrument, and that these checks 
be carried out separately for groups of Ss in 
separate situations or experimental treat- 
ments. 
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Fic. 1. Significant ANOVA interaction at the first 
level of supervision. The leaders’ least preferred co- 
worker (LPC) score and esteem for subordinates 
interact to determine satisfaction with supervision. 


As previously noted, two-way analyses of 
variance were performed to see if the leader- 
ship predictor variables would interact signifi- 
cantly to yield differences on the criterion 
variables. For the most part they did not, but 
at the first level of supervision there occurred 
a set of significant interactions that are rather 
revealing. These analyses featured as inde- 
pendent variables the leadership style measure 
(LPC) and the leader’s esteem for her sub- 
ordinates (these two variables were corre- 
lateds— 003 aVi—"21 ) 2 

The first-level leader’s LPC score and her 
esteem for her subordinates interacted to pro- 
duce differences in her subordinates’ satis- 
faction with supervision (fF = 5.255, df= 
1/18, p < .05) and total job satisfaction (F 
=40.992,,0/ =1/18, (p)<205). These inter= 
actions are charted in Figures 1 and 2. As 
can be seen, first-level supervisors with high 
LPC scores and high esteem for their sub- 
ordinates stand out as having highly dissatis- 
fied subordinates. This finding was partially 
explained by the fact that the same two inde- 
pendent variables (LPC and esteem for sub- 
ordinates) interacted to produce significant 
differences in the Consideration ratings of 
these first-level supervisors (F = 5.166, df = 
1/18, p < .05). This interaction is charted in 
Figure 3. Nurses with high LPC scores and 
high esteem for subordinates were rated by 
those subordinates as lowest in Consideration 
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Fic. 2. Significant ANOVA interaction at the 


first level of supervision. The leaders’ least preferred 
co-worker (LPC) score and esteem for subordinates 
interact to determine total job satisfaction. 


behavior. Although the interaction did not 
reach significance, this same group of nurses 
was also lowest in Initiating Structure be- 
havior. In other words, these high LPC first- 
level supervisors who like and trusted their 
subordinates were seen by them as generally 
“inactive” in leadership behavior. Their sub- 
ordinates in turn were highly dissatisfied with 
their supervision and with their total jobs. 


Relations between Criteria 


At each of the two supervisory levels the 
correlations were computed between the five 
aspects of job satisfaction of subordinates and 
the four ratings of performance made by 
superiors. None of the 20 correlations at 
either level reached statistical significance. 
It is interesting to note, however, that 18 out 
of 20 of the correlations at the first level of 
supervision are positive, while 13 out of 20 
are negative at the second level of supervision. 
A x? test of the frequency of plus and minus 
correlations at the two supervisory levels 
yielded® y7i=111 2.90 Std [r=No ipr<e OOLIE 
should be pointed out that the 40 correlations 
on which this x? is based are not independent 
since there are substantial positive correla- 
tions within the JDI subscores and within the 
performance rating areas. The resulting y? is 
thus difficult to interpret. Conservatism 
compels us to stop short of the tempting con- 
clusion that the role expectations sent from 
superior and subordinate are compatible in 
the case of the first-level supervisor but con- 
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Fic. 3. Significant ANOVA interaction at the first 
level of supervision. The leaders’ least preferred co- 
worker (LPC) score and esteem for subordinates 
interact to determine Consideration. 


tradictory in the case of the second-level 
supervisor; and that here, contrary to the 
usual situation, the second-level supervisor 
is really the “man in the middle” caught be- 
tween conflicting demands from superiors and 
subordinates. 


DISCUSSION 


The present study was based on the single 
simple hypothesis that effective leadership 
style and leadership behavior patterns would 
differ across supervisory levels. This hypothe- 
sis was confirmed. In general, good perform- 
ance ratings by superiors went to high LPC 
(relations oriented) second-level supervisors 
and to low LPC (task oriented) first-level 
supervisors (see Tables 1 and 2). Taking 
subordinate job satisfaction as a leader goal, 
it was found that Consideration behavior on 
the part of the leader was positively valued 
by subordinates at both supervisory levels. 
Initiating Structure behavior was also posi- 
tively valued by subordinates at the first level 
of supervision, but was negatively valued at 
the second level of supervision. 

A post hoc interpretation of the job-satis- 
faction findings comes readily to hand. The 
subordinates of second-level supervisors were 
RNs and shared the same professional qualifi- 
cations as their leader. They liked her to be 
considerate but they did not want her to 
structure their jobs for them. On the other 
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hand, the subordinates of the first-level su- 
pervisors looked to them for both considera- 
tion and structure because they did not have 
the professional qualifications to handle psy- 
chiatric patients alone. Their leader was 
professionally qualified and they looked to 
her for structure. This suggests that the 
amount of Initiating Structure behavior a 
subordinate welcomes from a superior is 
partly a function of the difference between 
them in the degree of task expertise they 
possess. 

Since Initiating Structure and Considera- 
tion were highly positively correlated at the 
first level of supervision one may think in 
terms of “active” leaders and ‘inactive’ lead- 
ers at this level. The inactive first-level super- 
visors tended to be relations oriented (high 
LPC) and to have high esteem for their sub- 
ordinates. This finding fits well with Fiedler’s 
(1967, Chapter II) recent prediction that 
high LPC leaders who feel satisfied with their 
interpersonal relations with group members 
will sit back and take it easy; that is, they 
will be relatively inactive in dealing with 
the group. The remarkable thing in this case 
however is that the high esteem these first- 
level leaders felt for their subordinates was 
negatively reciprocated. Since the subordi- 
nate’s satisfaction with the leader and the 
leader’s esteem for the subordinate are two 
sides of the same relational interface one 
would expect them to be significantly posi- 
tively related. 

The present findings lend some confirma- 
tion to claims that different organizational 
levels will place different demands on leaders. 
While the authors do not expect the particu- 
lar patterns discovered in the present study 
to generalize very far, they would expect dif- 
ferences in leadership demands to occur when- 
ever the leadership situation differs sharply 
from level to level. In particular, two aspects 
of the present sample should suggest cau- 
tion against generalizing these findings di- 
rectly to other populations. First, one must 
consider that a nursing service may be a 
special kind of organization. Second, moving 
up through the levels of a hierarchy, n be- 
comes smaller at each level. In this case, at 
the second level dropped to 8. A correlation in 
such a small sample can be strongly influ- 
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enced by one or two individuals. Therefore, 
skepticism is appropriate regarding reliability 
of the second-level analyses. 

The present findings suggest several organi- 
zational management dilemmas and leave 
some unanswered questions. In selecting first- 
level supervising nurses, for instance, task- 
oriented (low LPC) applicants would seem to 
be preferable because in this situation they 
are generally active in leadership behavior. 
In selecting second-level supervisors, however, 
high LPC (relations-oriented) applicants may 
have a better chance of success. Most organi- 
zations as a matter of policy try to fill 
vacancies at upper levels of supervision with 
personnel who have demonstrated good per- 
formance at lower levels. In the present or- 
ganization, this policy would lead to the 
promotion of low LPC first-level supervisors 
to second-level supervisory positions. If these 
nurses carry their high-activity leadership 
pattern up with them, they stand to alienate 
their new subordinates by showing too much 
Initiating Structure behavior. This problem 
could be compounded if only low LPC nurses 
are originally selected for supervisory posi- 
tions at the first level. In that case, there 
would be no high LPC nurses available for 
promotion to second-level positions. 

On the other hand, low LPC nurses, if 
promoted to second-level positions, might 
change their old pattern of showing high con- 
sideration and high structure in response to 
new situational conditions. Further research 
is of course needed to determine the extent to 
which upward mobile managers can perceive 
changes in their new situation as opposed to 
the old. What steps can be taken to ease this 
transition are still largely matters of conjec- 
ture. 
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RELATIONSHIP OF MEMORY OPTIMISM TO WORK 
COMPETENCY AND PERSONALITY VARIABLES 


H. MELTZER anv D. LUDWIG! 


Washington University 


The purpose of this study was to investigate the relationship of memory 
dynamics and work competency to personality variables and _ personified 
values. The source material used was the life and work stories of 143 workers 
which included their reports on outstanding pleasant and unpleasant experi- 
ences, responses to questions concerning personified values, and ratings on 
their competency as workers. In general, statistically significant differences 
were found in favor of memory optimists also being rated as more competent 
on the job, having a higher level of autonomy, and rating higher in inter- 
personal competence. Workers who had high economic values were less often 
memory optimists, less competent on the job, and tended to be older. 


The purpose of this study was to investi- 
gate two problems related to personality 
variables—memory dynamics and work com- 
petency. In the area of memory dynamics the 
field is relatively barren as far as workers as 
Ss are concerned. Meltzer and Ludwig (1967) 
found that, although younger workers are sig- 
nificantly more often memory optimists than 
older workers, each age group expressed sig- 
nificantly more pleasant memories than un- 
pleasant. These findings were interpreted to 
suggest that personality factors rather than 
the age factor as such are influencing memory 
dynamics involved. In the only study previ- 
ously published on college students, Purcell 
(1952) found a relationship between person- 
ality factors and security feelings as mea- 
sured by the Maslow Security-Insecurity Test 
and discovered that features of childhood and 
adult memories alike were also shown to be 
significantly related to security scores. This 
he interpreted as an initial advance toward 
detailing personality and memory functions. 
An earlier investigator, Waldfogel (1948), on 
the basis of a lengthy study of the affective 


1An earlier version of this article was presented 
at the meeting of the Midwestern Psychological As- 
sociation, Chicago, May 1967. 
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characteristic of childhood memories, came to 
the conclusion that “more work should be 
done in the dynamics of memory optimism 
and memory pessimism and on personality 
differences between memory optimists and 
memory pessimists [p. 31].”’ One of the chief 
purposes of the present study was to try to 
satisfy the implied research need indicated in 
these studies by investigating the relationship 
of memory dynamics to personality variables 
in workers. 

Another important phase of this study was 
to investigate the relationship of work compe- 
tency to memory dynamics and personality 
variables. There is extensive literature on the 
relationship of work competency to job sat- 
isfaction and personality variables. Since the 
authors’ concern in this phase of the study 
was to investigate work competency to person- 
ality variables, there will be no review of the 
literature on job satisfaction and performance. 
Instead, interested readers may refer to the 
oft-quoted review by Brayfield and Crockett 
(1955), Triandis’s (1959) critique, and the 
comprehensive chapter devoted to this topic 
by Vroom (1964), particularly pages 181- 
187. One of the earlier studies concerned 
with personality variables rather than job 
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satisfaction is that of Georgopoulos (1957) in 
which he reported that workers who perceive 
that effective performance is a path to their 
goals are more productive. The stronger the 
need, the more will this relationship hold. In 
a study of blue-collar workers, Friedlander 
(1966) found no difference between high and 
low performance. He found that low perform- 
ers found recognition through advancement to 
be more important and that high performers 
attached more importance to opportunities 
for self-actualization in white-collar govern- 
ment workers, but he found no significant dif- 
ferences in blue-collar workers. Guion and 
Gottier (1965) found very little indication of 
a significant relationship between personality 
test results and prediction of performance. In 
this present study an attempt was made to 
investigate work competency as rated by in- 
terviewers with two personality variables, 
autonomy and interpersonal competence as 
well as memory dynamics. Also explored was 
the relationship of personified values to mem- 
ory optimism and work competency. In the 
light of the foregoing studies, as well as the 
authors’ previous study, three hypotheses are 
made. These can be expressed as follows: 

1. Memory optimists will generally be more 
adequate as workers, more autonomous, and 
will rate higher in interpersonal competence. 

2. People who rated as more competent 
workers will tend to be memory optimists, 
more autonomous, rate higher in interpersonal 
competence, and typically will be younger. 

3. Workers who are more economic minded 
as compared to those who are more political 
minded (in the Allport-Vernon-Lindzey sense) 
will less often be memory optimists, rate less 
competent on the job, and will tend to be 
older. 

Since the nature of the study will be more 
clearly understood in the light of knowledge 
of the source material and Ss used, as well 
as the industrial setting of the community in 
which the study was made, next will be pre- 
sented the method and procedures where this 
material is described. 


METHOD 
Subjects and Setting 


The Ss used in this study were 143 workers in a 
paper converting industry located in a stable college 
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community in upstate New York. This paper mill 
is the only plant of any size in the community and 
obtaining a job there at that time was considered 
by a good many of the people living in the com- 
munity as the beginning of establishment. For the 
most part the workers were semiskilled and skilled, 
with large families and strong feelings about their 
positions with the company, jealousy over promo- 
tions of others, and pride in their own advance. 
The workers ranged in age from 19 to 78 and repre- 
sented about one-third of all of the workers in the 
plant. The 143 were selected from the larger group 
because they were the only ones interviewed for 
whom data concerning memories of pleasant and 
unpleasant experiences as well as personified values 
were available. 

The source material used was based on initial 
interviews with workers who were taken over by the 
new company. The interviewers were representatives 
of the new company and the attitudes toward them 
were extremely favorable because of dissatisfaction 
with the old company. The interviewers, in addition 
to the senior author, were graduates of Washington 
University with a minimum of 4 yr. of interviewing 
experience. The interview was structured in the 
following sequence (Meltzer, 1950): 


1. Directed questioning about work, family, per- 
sonal story, and present conditions or problems. 

2. Recall of the most outstanding pleasant experi- 
ence with question, “And now tell me what comes 
to your mind when you think of the most outstand- 
ing pleasant experience of your life.” 

3. Recall of other outstanding pleasant experiences. 

4. Recall of the most outstanding unpleasant 
experience. 

5. Recall of other outstanding unpleasant experi- 
ences. 

6. Questions concerning personified values. 

(a) “Who is the greatest person that ever 
lived? Why?” 
(b) “Who is the greatest person living? Why ?” 

7. Questions concerning personified ideal or aspira- 
tion: “Of all the people you’ve ever seen, heard about 


or read about, whom would you rather be like? 


Why ?” 


Memory optimism was determined by predomi- 
nance of pleasantness over unpleasantness in the 
experiences given by the worker, memory pessimism 
was determined by the predominance of unpleasant- 
ness over pleasantness, and indifferentism by an equal 
number of responses. 

Work competency was based on the rating by the 
interviewers at the end of the interview. The inter- 
viewers had available for use the ratings of the 
respective supervisors. Scores for various ratings 
werevas follows:\1 for Dsi2, D-+-;\3, ©; 4, Gao mer 
OB oes ge AGecweAGiss 

Autonomy was determined on the basis of per- 
sonal and work histories given by the worker in the 
interview, and a content analysis. The worker was 
given a plus (+) in autonomy if the histories 
contained statements indicating adequate security and 
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adjustment; minus (—) indicated insecurity and mal- 
adjustment. A zero (0) was given if no clearer assess- 
ment could be made. A second judge made inde- 
pendent ratings of the same factors with the resulting 
correlation of .6 between the two. (A 3X3 con- 
tingency table was used with the maximum cor- 
relation being .82.) 

Interpersonal competency was also determined by 
a content analysis. A worker was given a + in 
interpersonal competence if the histories contained 
statements indicating that the worker got along well 
with others; if there were statements indicating a 
history of poor relations with others, a — was 
given; and a O was given if no clearer judgment 
could be made. The rating of the second judge cor- 
related .64 in this measure. (A 3 X 3 contingency 
table was used with the maximum correlation 
being .82.) 

Personified values were based on reactions to 
personifications of greatest ever lived, greatest living, 
and model.? The predominant value was assigned to 
each S, and of the seven values assigned only the 
four with sufficient frequency are reported. The 
ratings of the second judge correlated .68 for the 
two most frequent values. 


RESULTS 


Relationship of Memory Optimism to 
Autonomy and Interpersonal 
Competence 


The workers were divided into two groups 
according to the memory optimists index; 
those with an index of +2 and above were 
put in the upper group (more optimistic). 
Those with an index of +1 and below were 
put in a lower group (less optimistic). The 
same procedure was followed in dividing the 
group for autonomy. Those who rated + were 
considered in the upper group here considered 
and those who rated — were put in the lower 
group. Those who scored 0 were not con- 
sidered. (Since the middle group was there- 
fore excluded in the calculations of the chi- 
square values for both the autonomy and 
interpersonal adequacy ratings, the total NV 
is somewhat less than 143 in these cases.) 

Table 1 gives a 2 X 2 comparison between 
the optimism and autonomy of the workers. 
The chi-square value of 10.97 (p < .001) and 
the corresponding phi coefficient of .35 indi- 
cate that those who have greater memory 
optimism also tend to be more autonomous or 
tend to be more secure and well adjusted. 

2 For details of procedure in the use of projective 


interviewing of which this is a phase, see Meltzer 
(1950). 
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TABLE 1 
RELATIONSHIP OF Memory Optimism TO 
PERSONALITY VARIABLES 
Group 
Optimism 
Factor group Upper Lower $ x 
factor | factor 
n n 
Autonomy* Upper 24 12 ealhateriaee 
Lower 18 39 35 | 10.97 
Interpersonal Upper 27 i9 | 02 66 
competency? Lower SY) HET | a 
—— | ————__—— 
aN = 93. 
bN =111. 
*¥p < .001. 


The workers were divided into two groups 
according to interpersonal competency. Those 
who rated + were placed in the upper group 
and those who rated — were placed in the 
lower group. A 2 X 2 comparison of memory 
optimism and interpersonal competency is 
given in Table 1. The nonsignificant chi- 
square value of .66 and the correspond- 
ing phi coefficient of .02 must be inter- 
preted as failing to support any relationship 
between memory optimism and interpersonal 
competence. 


Relationship of Work Competency to 
Autonomy, Interpersonal Competency, 
and Age 


To compare these variables the workers 
were divided into upper and lower groups for 
each factor. For the work competency vari- 
able, those with a C+ and above were put 
in the upper group, and those with C and 
below were put into the lower group. For the 
age variable, those above 40 were put into 
the upper group and those below 40 were put 
into the lower group. The divisions for the 
other factors are the same as given in Table 1. 
The comparison of all of these variables was 
made in a 2 X 2 fashion and the results are 
given in Table 2. Chi-square values were all 
significant and show that the more competent 
workers also tend to be more often memory 
optimists (¢ = .19, p < .05), tend to be more 
autonomous (¢ = .40, p < .001), tend to be 
more competent in interpersonal relations 
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(¢ = .23, p< .01), and tend to be younger 
(oe 20 ote One 

All in all, what the results reported in 
Table 2 indicate is that the more competent 
people rated higher in performance (work 
competence), also were more autonomous, 
rated higher in interpersonal competence, 
were younger, and were more likely to be 
memory optimists. 


Memory Optimism and Personified Values 


After the group of workers was rated ac- 
cording to dominant personified values in 
their lives, it was found that economic and 
political values classified nearly all of the 
workers. With one exception, then, the sta- 
tistical treatment of the data employed these 
two divisions. The same divisions into upper 
and lower groups were made for the other 
variables as were given in the previous tables. 
The comparison of these variables was also 
made in a 2 X 2 fashion and the results are 
given in Table 3. This table shows that those 
workers who were classified as being more eco- 
nomic minded as compared to those who were 
more political minded were less often memory 
optimists (not significantly so), were rated 
as less competent on the job (¢=.19, 
p< .01), and tended to be older (¢ = .20, 


TABLE 2 


RELATIONSHIP OF WORK COMPETENCY TO 
PERSONALITY VARIABLES 









































Work Group 
compe- 
oe tency | Upper| Lower $ x? 
group | factor | factor 
n n 
Autonomy® Upper 32 17 
Lower ln 33 .40 14°505* 
Interpersonal Upper 36 25 931 5.93% 
competency? | Lower 18 32 ‘ 
Age® Upper | 32 42 
Lower |, 44: |. 25 ol) oe 
Memory Upper 38 36 a 
optimism® Lower 20 49 oO Sel) 
aN = 93. 
bN = 111. 
oN = 143. 
*p < .05. 
** p < .001. 
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TABLE 3 


Memory OPTIMISM AND PERSONIFIED VALUES 






































Group 
Factor Value Upper | Lower b x2 
factor | factor 
n n 
Optimism® | Economic 26 38 
Political 25 32 ns 
Work Economic 25 39 3 
compe- Political 33 24 .19 | 4.28 
tency® 
Age® Economic 40 23 
Political 25 |33: |seeseme 
Age> Theoretical 1 5 “ 
Religious 6 0 .84 | 8.57 
aN = 121 
bN = 12 
*p <.05 
kD < 01 


p< .01). This table also shows that those 
few workers who were more religious minded 
as compared to those who showed more theo- 
retical interests tended to be older (p < .01). 
The phi coefficient for this comparison is .84 
and the small number of cases needs to be 
taken into account for interpretation of this 
figure. 


DISCUSSION 


The purpose of this paper was to study the 
relationship between memory dynamics, per- 
sonality, and performance variables in a field- 
study situation with a group of 143 workers 
as Ss. The memory dynamic under considera- 
tion was the degree of memory optimism, the 
performance variable was the competency 
rating of the worker, and the personality 
variables included autonomy, interpersonal 
adequacy, and the value system of the 
workers. The age factor was also considered. 

Looking first at memory optimism, a num- 
ber of factors seem to be correlated with the 
degree of such optimism. The workers more 
optimistic about their past experiences also 
tend to be judged as more adequate on the 
job and more autonomous than do the less 
optimistic workers. It would seem from this 
that the success on the job for the workers 
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of this community is a very important factor 
in their memory optimism—or at least the two 
go together. The role of this factory in the 
community would seem to bear this out, for 
getting a job at this factory is something 
looked forward to by many in the area. The 
relation between autonomy and optimism is 
also a logical one and would seem to indicate 
a positive outlook on the part of these work- 
ers. In other words, those who feel more 
secure and adjusted can view their past posi- 
tively and with less bitterness—thus showing 
memory optimism. This is in accord with 
Purcell’s (1952) finding that relative frequen- 
cies of joy and fear memories are significantly 
related to security feelings. 

Looking at the factors related to the rated 
job adequacy of the workers, some interesting 
relationships are also present. Those who are 
rated higher on job adequacy also tend to be 
more autonomous and interpersonally compe- 
tent, tend to be younger, and tend to be more 
politically than economically oriented. Aside 
from actual ability, which was held constant 
in this study, there would seem to be a syn- 
drome for the adequate worker. Good adjust- 
ment and the ability to get along well with 
others would certainly enhance the productiv- 
ity of the worker, as this and many other 
studies bring out. As far as the younger 
worker’s being more productive or more ade- 
quate, this would probably be a factor of the 
type of work required in this factory. The 
skills required to run the machines are not 
very complex, so there would not be an in- 
crease in job adequacy with more practice, 
whereas actual motor skills have been shown to 
decrease with age. The value system for the 
adequate worker is also an interesting finding. 
It would seem that the better worker is not 
working just for the money nor is he con- 
cerned primarily with increased pay (eco- 
nomic value), but is more interested in de- 
veloping his skills and looking forward to 
advancement (political or power value). Such 
a person would also tend to look for ways 
of improving production and gaining prestige 
in the eyes of his superiors rather than con- 
centrating on or complaining about his salary. 
This work attitude would tend to favorably 
influence the competency rating of the worker. 

These findings remind one of statements 
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made by Dobzhansky (1967) in an article in 
Science on “Changing Man”: 


Optimists believe that ours is the best of all pos- 
sible worlds. And pessimists are those who fear that 
the optimists are right. This is a flippant, but valid 
statement of truth. Optimism is often a result of 
ignorance of cold and unwelcome realities. There is, 
however, another kind of optimism, which is pes- 
simism surmounted. The world is far from perfect, 
but it is not unalterable. I am tempted to call this 
evolutionary optimism [p. 409]. 


What he calls “evolutionary optimism,” 
Meltzer and Ludwig (1967) have called 
realistic optimism. They reported responses 
of some people who gave no unpleasant ex- 
periences whatever, but whose life stories in- 
dicate them to be eternal or wishful optimists 
(who have repressed unpleasantness) rather 
than realistic optimists. Realistic optimists 
face unpleasantness as well as pleasantness 
as realities they react to and profit from. It 
is not surprising, therefore, to find a positive 
relationship between memory optimism and 
work competence as well as work competence 
and personality variables. 

Looking specifically at the factors related 
to the value system of the workers, we have 
already seen that the political value is related 
to job adequacy. There are also significant 
differences in values according to the age of 
the workers. The older workers tend to be 
more economic and religious minded, whereas 
the younger workers tend to be more political 
and theoretical minded. Looking first at the 
older workers, it would seem that the threat 
of dependence that comes with the approach 
of retirement brings about a greater emphasis 
on the monetary value. Also the reality of 
death and the end of the existence in this 
world seems to bring about a greater emphasis 
on the religious values of life. The younger 
workers, with a life ahead of them, tend to 
look more for power and prestige, and with 
the threat of death more remote tend to 
concentrate on contemporary living rather 
than on religious values. Thus, in general, it 
would seem from this study that, by the 
use of a more clinical approach in the inter- 
viewing process, knowledge concerning the 
nature of memory optimists in relationship 
to performance ratings and personality vari- 
ables and knowledge concerning the differen- 
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tial characteristics of more competent workers 
are obtainable. It must be added, however, 
that such a “critical approach” does add 
elements of subjectivity to the study. 
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RELATIONSHIP OF KUDER PREFERENCE RECORD 


VERIFICATION SCORES TO ADJUSTMENT: 
IMPLICATIONS FOR VOCATIONAL DEVELOPMENT THEORY 


FRANK A. NUGENT 


Western Washington State College 


High school males with inappropriately low Verification (V) scores on the 
Kuder Vocational Preference Record (KPR) were compared to high school 
males with acceptable V scores on the California Psychological Inventory (CPI) 
to investigate whether personal conflicts might be contributing to low V scores. 
Intellectual capacity and testing conditions were controlled. Incorrectly 
marked or incomplete inventories were eliminated. The final N was 256 boys 
with acceptable V scores and 83 with inappropriate ones. Students with 
acceptable V scores showed significantly higher mean scores on 15 of the 18 
CPI scales. Thus, the V score might be considered 1 index of personal and 
social maturity. Assumptions that vocational choice is an integral part of total 
personality development and that vocational maturity relates to total maturity 


received some support. 


The purpose of this investigation was to ex- 
plore the relationship of the Verification (V) 
score of the Kuder Vocational Preference 
Record (KPR) to adjustment. 

Kuder developed the V scale as a check on 
the confidence that can be given to a stu- 
dent’s answers on the interest inventory. In 
the KPR manual he discusses possible reasons 
for V scores which fall outside the range 
(38-44) which he has conceived of as ac- 
ceptable for meaningful interpretation of in- 
terest profiles (Kuder, 1957). Scores above 44 
he points out are usually a result of not 
understanding directions since 44 is the high- 
est possible score. The reasons given for 
scores below 38, however, are varied and 
less clear cut. The explanations for low devi- 
ance include not only the possibility of the 
student marking the answer sheet incorrectly, 
but also the possibilities of reading difficulty, 
disturbing test conditions, physical illness or 
emotional upset, careless or insincere re- 
sponses, or an atypical interest pattern. 

Students with V scores below the accept- 
able range, who were tested under adequate 
conditions, who were technically accurate in 
test taking, and who had sufficient intelligence 
to comprehend the vocabulary of the inven- 
tory, are the Ss of this study. The focus of 
this investigation then is whether emotional 
factors are contributing to the inappropriately 
low V scores. 


Support for the contention that a low V 
score is an indication of emotional conflict 
is limited and predominantly conjectural. 
Kuder’s speculations about the relationship 
of the low V score to emotional factors have 
been echoed by some counselors operating on 
clinical hunches and observed behavior of 
counselors, but no studies relating personality 
characteristics or adjustment process to V 
scores of the Kuder profile are in the litera- 
ture. 

However, related assumptions that voca- 
tional choice is an implementation of the 
self-concept and an integral part of total 
personality development and that vocational 
maturity relates to total maturity have been 
important concepts of vocational development 
theory during the past decade (Super, 1957). 
If these assumptions are valid, then it is 
conceivable that high school boys who could 
not meaningfully respond to the choices of 
the KPR might be reflecting confusions of 
self-image due to distorted perceptions of 
themselves. Also, it is possible that conflicts 
with others significant in the environment 
might have led to inner tensions which ren- 
dered them less personally or socially mature 
than the average high school boy. In these 
contexts the V score of the Kuder might be 
considered one index of personal and social 
maturity. 

Based upon Kuder’s conjectures, the clini- 
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cal hunches of counselors, and the assump- 
tions of vocational development theory, the 
following hypothesis was formulated: High 
school boys with V scores on the Kuder 
Vocational Preference Record within the ac- 
ceptable range (V-AR) will show more favor- 
able scores on a personality inventory than 
will high school boys with V scores not in the 
acceptable range (V-NAR). 


METHOD 


Kuder Vocational Preference Records (KPR) and 
California Psychological Inventories (CPI) were 
given in a 4-yr. high school to all males except those 
in special education classes. The town has a popu- 
lation of approximately 10,000 and is situated in 
the northwest region of the United States. The in- 
vestigator supervised all the test taking to assure 
optimal testing conditions. 

After eliminating individuals with IQs below 90, 
with incorrectly taken inventories, and with dis- 
simulated CPI profiles (Gough, 1964), the final NV 
was 256 boys with V—AR scores and 83 with V-NAR 
scores. 

Mean scores on each of the 18 subscales of the 
CPI were obtained for both groups, and ¢ tests 
for significance of mean differences were obtained. 


RESULTS 


In Table 1 the frequency distribution of 
these scores is shown. As can be seen from 
Table 2, high school boys with V—AR scores 
had significantly higher mean scores on 15 
of the CPI subscales when compared to high 
school boys with V—NAR scores. The hy- 
pothesis was given considerable support. 

Generally, on the basis of these signifi- 
cantly different mean scores, individuals with 
V-AR scores when compared to individuals 


TABLE 1 
FREQUENCY DISTRIBUTION OF V-NAR KPR Scores 








Score f 
16-17 2 
18-19 0 
20-21 1 
22-23 2 
24-25 4 
26-27 5 
28-29 3 
30-31 9 
32-33 10 
34-35 15 
36-37 32 
Total 83 
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TABLE 2 


COMPARISON OF SCORES ON THE CPI BETWEEN HIGH 
ScHoot Boys wita V-AR KupeErR ScorRES AND 
THOsE with V-NAR KupEr Scores 

















Scale Hordern g} CPT SD a 
results | scores 
Measures of poise, ascendancy, and self-assurance 
Dominance V-AR 45.42 | 12.45 | —2.91** 
V-NAR | 40.94 | 11.19 
Capacity V-AR 42.06 | 10.63 | —3.39** 
for Status V-NAR | 37.41 | 11.48 
Sociability V-AR 47.29 | 10.57 | —4.07** 
V-NAR | 41.94 | 9.83 
Social Presence V-AR 51.78 | 10.74 | —1.48 
V-NAR | 49.76 | 11.00 
Self-Acceptance V-AR 52.52 | 10.56 | —183* 
V-NAR | 50.16 | 9.20 
Sense of V-AR 40.69 | 13.28 | —5.78** 
Well-Being V-NAR | 30.87 | 13.90 





Measures of socialization, maturity, and responsibility 























Responsibility V-AR_ | 43.78 | 10.49 | —7.36** 
V-NAR | 33.37 | 11.86 
Socialization V-AR 50.94 | 10.49 | —7.51** 
V-NAR | 40.46 | 12.57 
Self-Control V-AR 41.08 | 10.73 | —3.02** 
V-NAR | 36.96 | 10.81 
Tolerance V-AR_ | 41.73 | 11.48 | —5.57** 
V-NAR | 33.86 | 10.20 
Good Impression | V-AR_ | 39.83 | 9.35 | —2.07* 
V-NAR | 37.33 | 10.03 
Communality V-AR_ | 50.96 | 11.32 | —7.55** 
V-NAR | 38.30 | 17.89 
Measures of achievement potential and 
intellectual efficiency 
Achievement via | V-AR 42.08 | 11.40 | —6.53** 
Conformance V-NAR | 32.60 | 11.71 
Achievement via | V-AR 45.83 | 10.73 | —3.95** 
Independence | V-NAR | 40.54 | 10.10 
Intellectual V-AR_ | 42.83 | 11.91 | —7.55** 
Efficiency V-NAR | 31.05 | 13.55 





Measures of intellectual and interest modes 


Psychological- V-AR 48.82 | 9.90 | —2.68** 
Mindedness V-NAR | 45.49 | 9.56 
Flexibility V-AR 52.03 | 10.18 54 
V-NAR | 52.72 | 10.15 
Femininity V-AR 48.17 | 9.22] —1.31 
V-NAR | 46.70 | 7.71 


Note.—V-NAR = Verification scores not within acceptable 
range; N = 83; V-AR = Verification scores within acceptable 
range, N = 256, 

2 ecu test was used since prediction was directional. 

0. 


DD <.01. 


KUDER PREFERENCE RECORD AND ADJUSTMENT 


with V—NAR scores tend to be more self- 
confident, self-assured, and _ self-accepting 
(measures of poise, ascendancy, and self-as- 
surance); more responsible and more emo- 
tionally and socially mature (measures of 
socialization, maturity, and responsibility) ; 
more adequately motivated, intellectually 
efficient, and resourceful (measures of 
achievement potential and intellectual effi- 
ciency); and more appropriately responsive 
to the demands and needs of others (Psycho- 
logical-Mindedness). 


DIscUSSION 


Certain conjectures arise concerning stu- 
dents with V-NAR scores on the KPR when 
considered within the context of vocational 
development theory. Perhaps some of these 
students were subtly rebelling against the 
choice making demanded by the self-inven- 
tory as part of a general negative reaction or 
apathy to what they consider unreasonable or 
unattainable expectations from the environ- 
ment. Possibly with some students an impasse 
in decision making occurred because of unre- 
solved conflicts between their own inclinations 
and contradictory demands of parents and 
other authority figures. Such conflicts could 
contribute to self-doubts and confused self- 
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perceptions with concomitant immobilization 
in making choices. 

One further speculation is that some of 
these socially immature and personally in- 
effective students might not have had the op- 
portunity to explore their interests or the 
world of work. In these situations, lack of 
experience or nalveté could be contributing to 
low V—-NAR scores. 

Counselors should be alert to the possibility 
that for some individuals low V-NAR Kuder 
scores may indicate pervasive personal prob- 
lems of which difficulty of vocational interest 
choice is a part. If the broader areas of con- 
flict are resolved, these individuals might 
then be able to evaluate their interests more 
realistically or might feel freer to system- 
atically explore occupational fields consistent 
with their interests and abilities. 
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An instrument was developed to measure managerial attitudes and behavior in 
communication-type interaction episodes. This self-recording form produced 
data on both managers’ interaction activities during their working day and on 
how they felt about each episode that took place. The results showed that the 
managers studied were more favorable to self-initiated interactions than to 
other-initiated interactions. The data also showed that managers tended to 
evaluate the interactions they had with their superiors more positively than 
those that they had with their subordinates. The implications of these findings 
for communications within organizations were discussed, and the usefulness of 


the self-recording instrument for future research studies was considered. 


There have been an increasing number of 
studies during the last 10 years that have 
focused upon managers and the managerial 
job (see, e.g., Vroom’s 1965 review). Yet only 
a relatively small proportion of these investi- 
gations have studied the actual behavior of 
managers on their jobs. As Dubin (1962) has 
pointed out, “It is one of the curiosities of 
the literature on managerial behavior that 
we know almost nothing about it in any de- 
tail [p. 12].” The few studies that have 
been made of managerial behavior (e.g., 
Burns, 1954; Carlson, 1951; Davis, 1953; 
Dubin & Spray, 1964; Martin, 1959; Shartle, 
1956) do provide some information about 
what managers actually do on their jobs, but 
they fail to enhance our knowledge of the psy- 
chological dimensions of such behavior. That 
is, they supply little or no data on why man- 
agers behave in a certain manner or on how 
managers react to different behavior episodes. 

The main purpose of the present study was 
to explore the possibility of gathering data 
that will link certain kinds of managerial at- 
titudes to specific behavioral episodes. In 
short, it was expected that if attitudes can be 
more closely linked to behavior then both 
the attitude data and the behavior data will 
become more meaningful. 

Previous studies of managerial behavior 
have shown that managers spend a high pro- 
portion of their time talking to other people. 
Burns (1954) reports, for example, that man- 


agers spend 80% of all their time at work 
talking to others. This suggests that the study 
of the interaction episodes which take place 
during a manager’s workday may be a fruitful 
place to begin studying some of the sub- 
stantive issues concerned with managers’ job 
attitudes. Furthermore, the study of these 
episodes also should contribute to an under- 
standing of the communication process in or- 
ganizations, particularly with respect to the 
questions of why certain communication pat- 
terns exist and why communication problems 
develop. 

There are at least two important dimen- 
sions to consider in studying managers’ reac- 
tions to interaction episodes. The first of 
these has to do with who initiates the inter- 
action. Psychologically, there would seem to 
be a significant difference between a situation 
where a manager initiates an interaction and 
one where he has the interaction initiated for 
him by someone else. In the former situation 
the manager is likely to be in control of the 
situation and the interaction is likely to be 
held to satisfy his needs. In the latter case 
just the opposite may be expected to occur, 
and the manager is likely to be in a situation 
where he has less control and where the meet- 
ing is designed to fit someone else’s needs. If, 
as is suggested by the research on small 
groups (Collins & Guetzkow, 1964), people 
generally feel more positively about those in- 
teractions that satisfy their needs and in 
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which they exercise control, then self-initiated 
interactions should be more positively evalu- 
ated than other-initiated interactions. 

A second relevant dimension in determining 
a manager’s reaction to an interaction episode 
concerns the position level of the other per- 
sons involved in the interaction. However, it 
is not immediately obvious as to whether an 
individual should typically feel most posi- 
tively about an interaction with his subordi- 
nates or with his superiors. Arguments can be 
advanced for either prediction. When he talks 
to a subordinate, a manager is clearly in a 
higher status and higher power position, and 
this could put him in control of the situation 
and thereby create conditions that may lead 
him to feel positively about the interaction. 
A study by Hurwitz, Zander, and Hymovitch 
(1953), however, indicates that high-status 
and high-power individuals may tend to feel 
less positively about interactions than will 
low-status and low-power individuals. This 
study found that, after interactions, low- 
power individuals liked high-power individ- 
uals better than high-power individuals liked 
lower-power individuals. This suggests that 
superiors interacting with their subordinates 
might evaluate the episode less positively 
than would the subordinates, despite the fact 
that the former have more control in the 
situation. 

In summary, the purpose of this study was 
both to develop and test an instrument for 
measuring managers’ reactions to specific in- 
teraction behavior episodes, and to collect 
data that are intended to answer two substan- 
tive questions about managers’ reactions to 
these interaction episodes: Does the initiator 
of an interaction feel more positively about 
the episode than does the recipient? Does the 
superior or the subordinate feel more posi- 
tively about an interaction episode between 
the two? 


MertTHOD 
Research Instrument 


Each respondent was asked to complete a number 
of self-recording forms. As can be seen from Figure 
1, the form asks for factual information about a 
behavior episode as well as for the manager’s atti- 
tudes toward the episode. The form itself is adapted 
from one used earlier by Burns (1954). It asks for 
some of the same factual information (some of 
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which was not used in the present study) as the 
form developed by Burns, but it also includes atti- 
tude scales. This addition was crucial in terms of 
the present study since no previous studies have at- 
tempted to measure managers’ attitudes immedi- 
ately after an episode. This was of course done so 
that the attitudes could be directly related to specific 
episodes. The semantic-differential-type scales in- 
cluded were designed to tap the managers’ global 
evaluation of the episodes. 

The instructions for using the forms asked the 
manager to fill out a new form “at the conclusion of 
each behavioral episode that occurs during the work- 
ing day.” The term “behavioral episode” was defined 
as “any situation that has an integrity of its own” 
(e.g., a meeting, a telephone conversation, etc.). The 
manager was told that “at the end of the day, you 
should have a number of completed forms that, 
when considered together, will give an accurate pic- 
ture of how you spent your time during the day.” 
The instructions went on to explain each of the indi- 
vidual questions on the form, It was pointed out 
that in answering the first five questions it might be 
necessary to check more than one alternative. The 
managers were cautioned to check the words “su- 
periors, subordinates, or peers” only when the in- 
teraction involved someone who was actually a 
member of their organization. Finally, for Item 6, 
the attitude item, respondents were instructed to 
“rate the episode on each of the five scales provided.” 


Sam ple 


The sample included 105 middle and lower level 
managers from five organizations. Thirty-four of 
these managers were middle level and 71 of them 
were lower level managers. 

The lower level managers were on the lowest level 
of management in their organizations and were gen- 
erally the first line of supervision; in some cases in- 
dividuals in this classification had few supervisory 
duties but were considered management individuals 
by the organization employing them. The middle 
level managers were in positions above the first level 
of supervision, but below the vice-presidential, com- 
pany officer, or major departmental head level. 

One of the five organizations studied was a large 
manufacturing plant with over 3,000 employees. 
Fifty of the managers studied came from this organi- 
zation. The other 55 managers studied were em- 
ployed by four local social service agencies. Included 
among the sample of social service managers were 
the directors and managers of a YMCA a visiting 
nurse service a girl scout council and a city depart- 
ment of health. The 105 managers studied repre- 
sented all the managers employed by the social 
service organizations and a random sample of about 
40% of the lower and middle level managers em- 
ployed by the manufacturing firm. A comparison 
between the demographic characteristics of the sam- 
ple and the demographic characteristics of other 
samples of middle and lower level managers (e.g., 
Porter, 1961, 1962) revealed good comparability with 
but one exception. Thirty-one of the managers in 
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TIME 
FROM TO 
1. TYPE OF CONTACT Ze 


Letter or Memo 
Discussion in Group 
Discussion with one 
individual 
Telephone 

Records or Reports 
Other 


TYPE OF ACTIVITY 4. 


Production/Operations 
Sales/Distribution 
Personnel/ Indust rial 
Relations 

Public Relations/ 
Advertising 

General Administration 
Research and Development 
Engineering 
Finance/Accounting 
Other 


PURPOSE OF EPISODE 


Information or Advice 


Giving 
Receiving 


Creating 


ATTITUDE TOWARD EPISODE 


Valuable 
Dissatisfying 3s Dai Paces ec 


- 


Boring 2) Seca mes 


Precise Smashe syy co aesmncms 


Challenging SRS Sis easier 


Fic. 1. Copy 


the present sample were females, while none of the 
other samples contained female managers. The fe- 
male managers all worked for the social service 
organizations. Because of this and because of certain 
organizational differences between the manufacturing 
firm and the social service organizations, it was de- 
cided to analyze much of the data from these groups 
separately in order to determine if comparable find- 
ings would appear in two rather diverse populations. 


Procedure 


Meetings with the managers in each organization 
were held at the beginning of the workday on Mon- 
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OTHERS INVOLVED 


POSITION OF OTHERS INVOLVED 


Superiors 

Subordinates 

Peers 

Outsiders in Upper Management 
Outsiders in Middle Management 
Outsiders in Lower Management 
Other Outsiders 

None 


CONTACT INITIATED BY 


Self 
Other 


Instructions or Decisions 


Worthless 
Satisfying 
Interesting 
Vague 

Not Challenging 


Self-Recording Form. 


day. All 105 managers who attended the meetings 
agreed to participate in the study. A researcher first 
gave each participant a copy of the instructions for 
completing the self-recording form and a sample 
form. He then went over the instructions carefully 
with the managers and pointed out that the research- 
ers were particularly interested in getting a form 
completed for each behavioral episode. 

Questions were then invited from the managers 
with respect to the instructions given. Most of the 
questions were concerned with when it was appropri- 
ate to fill out a self-recording form. A number of 
specific incidents were discussed as examples. The re- 
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searcher continually stressed that he wanted to be 
sure that the completed forms would give a good pic- 
ture of the interaction patterns of each manager dur- 
ing the week in question. (It had been arbitrarily 
decided in advance that, because of their frequency 
and highly specialized nature, managers’ contacts 
with their secretaries would not normally be in- 
cluded.) 

Each manager was then given 150 forms and five 
stamped envelopes addressed to the researchers at 
Yale University. The managers were told that they 
were to complete the self-recording forms for 1 wk. 
and that they were to begin as soon as the meeting 
ended. They were also told that at the end of each 
day they should put the completed forms in an 
envelope and mail them directly to the University. 
Thus, each manager was expected to mail five 
groups of completed forms to the University, one for 
each day of the study. This particular procedure was 
adopted for several reasons. First, it was hoped that 
this would emphasize to the managers the im- 
portance of completing the forms immediately after 
the episodes occurred. It also provided the research- 
ers with a record of when the episode took place. 
Finally, it served to emphasize that this was not a 
company-sponsored project and that the company 
was not trying to exercise surveillance over their 
activities. 


RESULTS 


A total of 4,390 completed self-recording 
forms were obtained. The mean number com- 
pleted (for 5 working days) by managers in 
the social service organization was 43.5, and 
the mean for managers in the manufacturing 
company was 39.9. Every manager completed 
at least 7 forms; the most that any manager 
completed was 112.1 Table 1 shows the 
frequency of the different types of contacts 
in the two samples. As can be seen from the 
table, the managers in both samples reported 
a large proportion of their episodes involved 
verbal interaction with others. Only 11% of 
the episodes of the managers in the manufac- 


1 The relatively wide range in the number of com- 
pleted forms returned for the 5-day period raises 
the question of whether some respondents were 
neglecting to report some of their episodes. One 
way to check this, of course, would be to have 
observers on hand in the work situation to record 
the factual occurrence of interaction episodes. This 
was not feasible in the present investigation and 
ordinarily would not be feasible in most such investi- 
gations if any very large number of respondents were 
included in the sample. In the present study other 
available evidence—the fact that most managers’ 
forms accounted for their total time during the 
working day—indicates that a high percentage of 
episodes were being reported. 
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TABLE 1 


PERCENTAGE OF Types OF CONTACTS REPORTED 











% manufactur- 


Type of contact ing company 


% social service 





Groups 33 14 
Individual 40 41 
Telephone 14 28 
Letter 6 8 
Records 5 10 





turing company and 18% of the episodes of 
the social service organization managers did 
not involve verbal interaction. 

Of the total episodes reported for both 
groups, 74.9% involved contact with some- 
one within the managers’ own organizations. 
Table 2 shows the percentage of these within- 
own-organization contacts that were with 
superiors, peers, and subordinates. The man- 
agers reported that they generally tend to 
interact most frequently with their subordi- 
nates. This is not surprising since there are 
undoubtedly more subordinates for these 
managers to interact with than there are su- 
periors. Burns’ (1954) data also show this 
general tendency. 

Table 3 shows the percentage of the con- 
tacts that are initiated by self, both, and 
others. The data show a relatively even bal- 
ance between self and others, and they show 
that very few contacts are seen as being 
jointly initiated. Thus, in the analysis of the 
attitude effects of who initiates an interaction 
it was decided to focus on self- and other-ini- 
tiated contacts. Additional analyses of the 
data (not shown in the table) indicated that 
the episodes reported were about equally 
spread out during the day from 8 A.M. to 4 


TABLE 2 


PERCENTAGE OF WITHIN-ORGANIZATION CONTACTS 
THAT INVOLVED SUPERIORS, PEERS, AND 
SUBORDINATES 








Y manufactur- 





Contact ing company % social service 
Superiors 24.9 DileD 
Peers 43.2 OLE, 
Subordinates 60.2 58.3 





Note.—These percentages add up to more than 100 because 
more than one type of individual can be engaged in each 
interaction, 
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TABLE 3 
PERCENTAGE OF SELF-, OTHER-, AND BoTH- 
INITIATED CONTACTS 
O7 ‘ 
ane 7o manufacturing} ,, : : 
Initiator company % social service 
Self 50 50 
Both 3 3 
Other 42 47 


P.M, (except for the 12:00-1:00 lunch hour 
period). 

The mean attitude scores toward all the 
episodes each manager reported were com- 
puted for him (Table 4). In this analysis the 
episodes were divided according to who ini- 
tiated the episode and who was involved in it. 
Then the mean of each attitude scale was ob- 
tained for all the episodes reported by a man- 
ager. Finally, the scores for each of the man- 
agers were combined, and the mean attitude 
scores for all participants were obtained. 
Thus, the data presented in Table 4 consist 
of means which were computed from the indi- 
vidual manager’s mean scores. Table 4 also 
reports a composite attitude toward each 
episode, computed by combining the data 
from the five attitude scales. Using the mean 
scores for each manager as the basic data in 
the study has the conservative effect of mak- 
ing N (105) the number of managers rather 
than the number of episodes reported. It also 
allows the responses of each manager to be 
weighted equally regardless of how many 
forms the manager completed. 
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TABLE 4 


Mean AttirupE SCORES FOR EACH OF THE FIVE 
SCALES: INITIATOR AND LEVEL OF 
PERSON CONTACTED 








Contact 

Attitude scale j|Initiator| — 7 
- : Sub- 

Superior) Peer ordinate 
Valuable- Self 6.21 6.12 5.96 
worthless Other 5.87 5.88 5.82 
Satisfying— Self 5.86 5.43 5,93 
dissatisfying Other 5.31 5.46 Jot 
Interesting— Self 5.48 5.47 Doe 
boring Other 5.38 5.41 5.18 
Precise—vague Self 5.76 D0 5.69 
Other Doi) 5.20 5.43 
Challenging—not Self 4.79 4.90 4.59 
challenging Other 4.99 4.75 4.58 
Composite for | Self 5.59 O52 5.43 
five scales Other 5.46 5.38 S27 

















Note.—Higher scores indicate more favorable responses. 
Standard deviations for the means reported in this table typi- 
cally were close to a value of .90. 


The data in Table 4 show, as expected, a 
general tendency for self-initiated contacts to 
be more highly evaluated than other-initiated 
contacts. This tendency was tested for statis- 
tical significance with a sign test (Table 5}. 
For each manager a comparison was made 
between his average scores for his self-initi- 
ated and his other-initiated contacts. The 
results in Table 5 confirm the fact that in 
both samples there is a general tendency 
for the managers to feel more positively 
toward episodes they initiated than toward 


TABLE 5 


NuMBER oF MANAGERS WHO ARE More POSITIVE TOWARD SELF-INITIATED 
THAN TOWARD OTHER-INITIATED INTERACTIONS 























Manufacturing Social service Total 
: company organizations om 
Attitude scale 
=I ad 0 p ste = 0 p St _ 0 p 
Valuable—worthless 36 11 3 01 34 19 2 10 70 30 5 OL 
Satisfying—dissatisfying 31 17 2 .10 oy 22 1 63 39 3 O1 
Interesting—boring 27 19 4 28 24 3 55 43 7 
Precise-vague 29 16 5 LOM oS 15 u O01 67 ot a O01 
Challenging—not challenging 21 26 3 26 ei 2 47 53 5 
Composite for five scales 34 15 1 O01 33 20 2 eLOMING7 35 | 3 01 





Note.—Plus sign indicates more favorable response to self-initiated episodes. Sign tests were two-tailed. 

















MAanacers’ ATTITUDES TOWARD COMMUNICATION EPISODES 437 
TABLE 6 
NuMBER OF MANAGERS FOR WuHomM TuHerR ATTITUDE ScoRES ARE More Positive 
FOR INTERACTIONS WITH THEIR SUPERIOR THAN WITH THEIR SUBORDINATES 
Manufacturing Social service 
company organizations Total 
Attitude scale 7 
es ~--— 

= = 0 p =f 7 0 p ata = 0 | p 
Valuable—-worthless oe 20 8 29 iS 6 05 56 oo 14 05 
Satisfying—dissatisfying 28 20 7 29 14 7 .05 i 34 14 05 
Interesting—boring 33 16 6 05 28 16 6 10 61 32 1 01 

Precise-vague 29 18 8 21 24 5 50 42 13 
Challenging—not challenging 31 wi 7 .10 er 17 6 58 34 13 | 05 
Composite for five scales 36 13 O01 27 20 3 63 33 9 | O01 














Note.—Plus sign indicates more favorable response to interactions with superiors. Sign tests were two-tailed. 


episodes initiated for them by others. The 
only dimension that deviates slightly from 
this tendency is the challenging—not challeng- 
ing one. However, for the total sample, the 
overall effect is highly significant as is shown 
by the results for the five-scale composite 
analysis. 

The data in Table 4 also showed that there 
is a general tendency for managers to feel 
more positively toward contacts that they 
have with their superiors than toward con- 
tacts they have with their subordinates. The 
peers’ contacts seem to fall between the su- 
perior and subordinate contacts. This ten- 
dency likewise was tested for significance with 
a sign test (Table 6). A comparison was 
made for each manager between his mean 
attitude toward all contacts he reported that 
involved his superiors and all those that in- 
volved his subordinates. As can be seen from 
Table 6, a significant majority of the mana- 
gers reported more favorable attitudes toward 
the contacts that they had with their superiors 
than the contacts they had with their sub- 
ordinates. Overall, this tendency appears to 
hold for both samples and for all of the atti- 
tudes studied. 

If the findings that self-initiated communi- 
cations and communications with superiors 
are evaluated highly are combined, this would 
suggest that the most positive attitudes man- 
agers hold should be toward self-initiated con- 
tacts with their superiors. On the other hand, 
the least positive should be toward other- 
initiated contacts with their subordinates. The 


data in Table 4 showed that this was the 
case. There was a significant difference be- 
tween the mean composite score for self-ini- 
tiated contacts with superiors (7 = 5.59) and 
the mean composite score for other-initiated 
contacts with subordinates (# = 5.27) (= 
Sen = .OL)! 

Table 7 presents the composite attitude 
scores (the mean for the five scales) for all 
the “dyadic interactions” that could be found 
in the data. Here, self-recording forms were 
included only where it was clear that forms 
had been obtained from both a superior and 
his subordinate who were reporting on a com- 
mon episode. Thus, in these data, the re- 
ports of both people taking part in the same 
interaction could be compared. The previous 
data, of course, involved looking at the aver- 
age reaction of managers to many different 
kinds of contacts (i.e., groups, dyads, etc.) 


TABLE 7 


Mean AttirupE Scores FOR Dyapic INTERACTIONS: 
INITIATOR AND LEVEL OF OTHER PERSON CONTACTED 





Initiator 
Evaluator = |——_———————_| Grand 
Self Other 
Subordinate 5.78% 5.77 5.78 
Superior SH 5.078 5.42 
Grand M 5.74 5.45 


Note.—Higher scores indicate more favorable attitudes. 
an = 20. 
by = 24. 
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while the dyadic data allow for the compari- 
son of the attitudes of both participants to a 
specific interaction. As can be seen from Table 
7, the initiator of the interaction feels more 
positively toward it than does the recipient, 
and the subordinate in the interaction feels 
more positively toward it than does the su- 
perior. Sign tests were computed between the 
attitude responses of each pair of participants 
and these trends proved to be statistically sig- 
nificant (p< .05). These results, therefore, 
serve to confirm the results reported above 
for the larger analysis which involved all in- 
teraction episodes by the total sample of 
managers. 


DISCUSSION 


Even though this study was essentially ex- 
ploratory, the results have both substantive 
and methodological implications. Relative to 
the former, they provide information for 
analyzing communications in organizations 
and for explaining certain job-satisfaction pat- 
terns that appear in organizations. From a 
methodological point of view, they are sug- 
gestive of the fact that meaningful data about 
attitudes toward specific behavior episodes 
can be obtained by the use of a self-recording 
form of the type employed in this study. 

The finding that managers tend to evaluate 
episodes they initiate more highly than epi- 
sodes initiated for them was anticipated. It 
would seem to follow from three factors. 
First, the initiator of any contact usually has 
the most control in the situation. Second, the 
initiator normally does not institute a contact 
unless he feels that a worthwhile issue that is 
important to him needs to be discussed. Third, 
the initiator normally should feel more re- 
sponsible for its taking place and might, 
therefore, be less likely to describe it nega- 
tively. 

The finding that managers tend to evaluate 
contacts with their superiors more highly than 
contacts with their subordinates is under- 
standable, even though it was_ specifically 
predicted. Part of the reason for this finding 
appears to be the fact that managers have 
relatively fewer contacts with their superiors 
compared to subordinates. Thus, a superior 
contact is relatively more “unusual” than a 
subordinate contact. In addition, the superior 
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has reward power over his subordinates. Thus, 
a superior-subordinate interaction is likely 
to be a more significant event for a sub- 
ordinate than for a superior. Apparently, as 
was suggested by the earlier study of Hur- 
witz et al. (1953), this factor is significant 
enough to overcome any tendency that supe- 
riors might have to evaluate the interaction 
more positively because of their superior con- 
trol and power in the situation. 

This finding also fits with some of the prob- 
lems that are often discussed where superior- 
subordinate communication is involved. Haire 
(1956) and others have pointed out that, 
when talking with their superiors, subordinates 
often tend to overinterpret what is being said 
because they place too much importance on 
everything that is said. On the other hand, 
superiors may tend to do the opposite. The 
data from the present study suggest that this 
is in fact a real problem and that superiors 
do indeed tend to attach less significance to 
superior-subordinate interactions than do sub- 
ordinates. 

Given the results of the present study, it is 
not surprising that superiors often claim that 
they do not know what is going on in their 
organizations, because subordinates fail to 
communicate adequately. The results show 
that the least valued interactions are those 
where a superior evaluates an episode in 
which his subordinate comes to him for a 
discussion. If, as seems likely, the superior 
communicates any of these reactions to his 
subordinates, then it will not be long before 
the subordinate will “learn” to reduce his 
initiation of these interactions. In effect, the 
subordinate is not likely to be reinforced (suf- 
ficiently) for initiating communications and 
may very well decide that it is not worth com- 
ing to his boss with information, since his 
boss is not really interested. 

Turning to methodological implications, the 
data seem to show that the self-recording form 
approach used in the present study is a useful 
way to measure managers’ behavior and their 
reactions to the behavior. As evidenced by the 
large number of forms that the managers 
were willing to complete, and by the results 
obtained, it appears that managers did take 
the study seriously and completed them con- 
scientiously. Many took the trouble to write 
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brief explanatory notes where they felt the 
categories on the form did not capture the 
essence of the contact. Some managers ex- 
pressed the view that filling out the forms was 
an important learning experience for them, 
since the forms helped give an overview of 
the day’s work. From a researcher’s point of 
view, it was gratifying to find that the forms 
could be completed relatively quickly and 
easily and without seriously interrupting or 
distorting the day’s typical work process. This 
approach also made it possible to measure the 
attitudes and behavior of a relatively large 
number of managers for at least a week’s time. 
It would be difficult if not impossible to get 
comparable amounts of data by observation or 
any of the other commonly used approaches 
for gathering this type of behavioral data. 
This instrument could also play an im- 
portant role in developing what have been 
called “psychological” job descriptions. The 
typical job description provides a relatively 
poor behavioral description of the nature of a 
job. The self-recording form makes it pos- 
sible not only to collect better descriptive 
data but also to get at some of the feeling and 
attitude dimensions of the jobs. It can pin- 
point some of the psychologically difficult sit- 
uations and interactions in the job, and, most 
importantly, the collection of such forms 
across a large number and wide range of in- 
teractions facilitates the ijinking of such in- 
teractions to organizational factors. Future 
studies might also look at the behavior and 
attitude patterns of managers who tend to be 
described as being effective and ineffective 
performers. In addition, actual behavior pat- 
terns might be related to the overall job satis- 
faction of the managers. Thus, a significant 
aspect of the present exploratory study is that 
it demonstrates the utility of the self-recording 
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form and thereby suggests a number of future 
studies that can profitably be done in the 
realm of organizational psychology. 
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EMOTIONAL MEANING OF FOUR TYPOGRAPHICAL VARIABLES’ 


ALBERT J. KASTL anv IRVIN L. CHILD 


Yale University 


The influence of 4 typeface variables (angular versus curved, bold versus light, 
simple versus ornate, serif versus sans-serif) on judgments of emotional mean- 
ing was studied in 40 college students. In addition to specific conclusions for 
each mood, it was found that, in general, moods such as sprightly, sparkling, 
dreamy, and soaring tend to be matched to curved, light, ornate, and perhaps 
sans-serif type; while moods such as sad, dignified, and dramatic are matched 


to angular, bold, and perhaps serif type. 


Typefaces used in books or advertisements 
seem often to be chosen to emphasize or con- 
vey certain moods. This research was con- 
cerned with some of the physical attributes 
of type which apparently make mood com- 
munication possible. 

The fact that certain typefaces can convey 
moods has long been established (Berliner, 
1920; Burt, Cooper, & Martin, 1955; Davis 
& Smith, 1933; Ovink, 1938; Poffenberger & 
Franken, 1923; Schiller, 1935); among these 
earlier studies only those by Ovink and Davis 
and Smith make generalizations beyond indi- 
vidual typefaces. Ovink concluded that the 
light-bold dimension was primarily respon- 
sible for the emotional meaning of typefaces. 
Davis and Smith maintained that several ad- 
ditional factors were also responsible: size, 
condensation, slant (italic versus roman), and 
style of type (a complex group of factors dis- 
tinguishing one family of type from another, 
as the difference between Caslon and Chelten- 
ham type). Their pioneering experiment faced 
the complexity of the problem and represented 
a significant advance; yet it has several lim- 
itations: 


1 The authors are indebted to Stephen Dwoskin for 
technical advice on typography and to John D. Car- 
roll for assistance in the statistical analysis. The in- 
vestigation was carried out during the tenure by 
Albert J. Kastl of a National Science Foundation 
predoctoral fellowship. A more detailed statement of 
results, a classified list of the typefaces, and sam- 
ples of their appearance have been deposited with the 
American Documentation Institute. Order Document 
No. 10004 from ADI _ Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. 20540. Remit in advance 
$1.25 for photocopies or $1.25 for microfilm and 
make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


(a) The categories employed are those tra- 
ditionally used by typographers. It may be 
questioned whether these categories are suf- 
ficiently distinct to ensure that laymen re- 
spond to those features of type which deter- 
mine traditional categorization. 

(6) The results are of uncertain statistical 
significance. 

(c) Only 13 typefaces were used, so that 
the categories are represented very poorly. 
(At the extreme, only one type represents the 
entire category of “expanded.”’) 

A recent study by Tannenbaum, Jacobson, 
and Norris (1964) proceeds from criticism 
similar to ours of earlier work, but is primarily 
concerned with type characteristics different 
from those we consider. We learned of their 
work only after completing ours, and so could 
not design the present study in relation to 
theirs. Later, their findings will be compared 
with our own. 

We chose for examination the following di- 
mensions: 

A. Angular versus Curved. For simple lines, 
an experiment long ago (Poffenberger & Bar- 
rows, 1924) supported the expectation that 
curves readily suggest quiet and gentleness; 
angles, harshness and seriousness. We expected 
a similar relationship with typefaces, both 
because of this experimental finding and be- 
cause of the general use of this dimension in 
visual art. 

B. Bold versus Light. This is the dimension 
stressed by Ovink (1938). Also, artistic usage 
suggests an important role for this variable. 

C. Simple versus Ornate. This typeface di- 
mension seems not to have been studied by 
psychologists, but its general role in art (cf. 
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Barry, 1957 and Fischer, 1961) suggested its 
inclusion here. 

D. Serif versus Sans-Serif. This dimension, 
too, has not yet been systematically studied 
for its possible emotional meaning. Serifs 
partly fill the gaps between characters and tie 
up the bulk of the word in a horizontal direc- 
tion. An obvious hypothesis is that serifs will 
suggest forceful and stable moods. 

The limitations indicated earlier were met 
by (a) having Ss themselves judge the physi- 
cal attributes of the typefaces; (b) using ap- 
propriate techniques to evaluate the statistical 
significance of the effects; (c) employing a 
relatively large number of types incorporated 
in a factorial design. Thirty-two types were 
used. They fall in two sets, each set com- 
prising variation in three of the four dimen- 
sions. Types 1-16 varied in Dimensions A, B, 
and C; Types 17-32 varied in Dimensions A, 
B, and D. In each instance the three dimen- 
sions were combined in an orthogonal design; 
each dimension had two levels, and each com- 
bination of conditions was represented by two 
different typefaces. Thus the same types con- 
stitute 32 samples of type for studying Di- 
mensions A and B, and 16 each for studying 
Dimensions C and D. 


MeEtTHOD 


Subjects 


The Ss were 40 students in introductory psychol- 
ogy, 20 men at Yale University and 11 men and 9 
women at The City College, New York (CCNY). 


Procedure 


Preliminary testing was done to insure the physical 
validity of the dimensions. Eleven judges were asked 
to categorize each of about 40 types we had selected 
for possible use. Approximately 20 were classified as 
‘angular or curved, bold or light, and simple or or- 
nate, and another 20 as angular or curved, bold or 
light, and serif or sans-serif. When disagreement ap- 
peared, the experimenters substituted new types un- 
til adequate agreement was reached (at least 9 of 
11 judges concurring with our initial categorization). 
Thirty-two typefaces, 16 of each group of 20, were 
finally selected and made into slides. Each slide con- 
tained in a single typeface, and always in upper- 
case, the first 16 letters of the alphabet arranged in 
four rows and four columns. The letters were of 
fairly uniform height, but since shape varies between 
and within typefaces, width could not be standard- 
ized. The experimental procedure at Yale and CCNY 
was essentially the same. Twenty Ss were divided 
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into four groups distinguished only by the order in 
which they judged various moods. To each group all 
32 slides were shown eight times, once for judging 
each of eight moods. (Two random orders of pres- 
entation of the 32 slides were employed alternately, 
ie., random order 1 was shown first, then random 
order 2, then 1 again, etc.) Each slide was shown for 
8-10 sec., with a 3-min, rest period after the third 
and sixth showing of the entire set of slides. Group 
I judged the moods in the order a, c, e, g, b, d, f, h; 
Group II in the order c, e, g, a, d, f, h, b; Group III 
in the order e, g, a, c, f, h, b, d; and Group IV in 
the order g, a, c, e, h, b, d, f. Each of the moods is 
defined by a set of related adjectives; the sets em- 
ployed are those used by Hevner (1935) and are 
presented in Table 3. In the text, we will refer to 
each set of mood words by the word which appears 
first in that set, both in Table 3 and in the form 
used by our Ss. We always intend reference, how- 
ever, to the entire set. Moods a, c, e, and g were 
always judged before b, d, f, and h; and order varied 
only within each of these groupings. This arrange- 
ment seemed feasible because the similarity of the 
moods follows around in a circle so that the two 
groupings are as a whole highly similar to each other. 
It does not appear to have affected the reliability of 
the data. For Moods a, c, e, and g there are 16 sig- 
nificant main effects and 11 significant interactions, 
and for b, d, f, and h there are 16 significant main 
effects and 13 significant interactions. 

When Ss viewed a projected typeface, they would 
indicate on a scale from 0 to 5 the degree to which 
they felt it conveyed the mood represented by the 
adjective set under consideration. A rating of 5 
signified that the mood was conveyed very strongly; 
4, considerably; 3, moderately; 2, somewhat; 1, 
slightly; 0, not conveyed at all. 

The only difference between procedures at Yale 
and CCNY was that at Yale the slides were shown 
to the four groups of Ss separately; whereas at 
CCNY they were shown to all Ss at the same time, 
the groups being intermingled in the classroom so 
that no group had a better view of the screen than 
another, 

Finally, at both Yale and CCNY, the slides were 
presented again for about 15 sec. each. The Ss were 
now asked to categorize the physical features of the 
typefaces by selecting one member of each of the 
following pairs as descriptive of that particular type: 
Types 1-16 were evaluated on the dimensions angu- 
lar-curved, bold—light, simple-ornate; 17-32 on the 
dimensions angular-curved, bold-light, serif-sans-serif. 
Fourteen of the Yale Ss and 12 of the CCNY Ss 
participated in this part of the experiment. (Due to 
a procedural error, judgments of the Yale group on 
the simple-ornate and _ serif-sans-serif dimensions 
were not obtained.) 


RESULTS AND DiscuSSION 


If Ss give different mood ratings to type- 
faces of various categories, are they respond- 
ing to the stimulus features intended to pro- 
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vide the basis for discrimination? An affirma- 
tive answer received advance support from 
the fact that preliminary judges agreed well 
with the experimenters’ initial categorization. 
More pertinent, however, would be direct evi- 
dence that Ss themselves are able to discrimi- 
nate on the intended basis. It was for this 
purpose that at the end of the experiment 
the Ss were asked to classify the typefaces. 
The results, in relation to what we intended, 
are indicated in Table 1. For example, as may 
be seen in the top line of the table, for 7 of 
the 16 types classified in advance as bold, 
over 95% of the Ss participating in this part 
of the experiment also classified them as bold; 
for 3 of the types classified in advance as 
light, over 95% of the Ss classified them as 
light; and so on. The other lines indicate the 
number of typefaces for which each lesser de- 
gree of agreement was found. Though agree- 
ment was not perfect, Ss taken as a whole 
differentiated consistently between types in- 
tended to be bold and those intended to be 
light, and between types intended to be serif 
and those intended to be sans-serif. On the 
other two dimensions, some individual types 
are not perfectly discriminated from all those 
intended to contrast with them; yet for each 
dimension the two sets of typefaces taken as 
a whole are very clearly differentiated. 

To illustrate the statistical analysis of the 
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mood ratings we will use the judgments made 
of the first 16 types for Adjective Group a. 
These 16 typefaces varied in Dimensions A, 
B, and C. The effects of these dimensions on 
Adjective Group a were studied in a five-way 
analysis of variance (AX BXCXEXS). 
(E represents the college variable, Yale or 
CCNY; and S, individual Ss). In Table 2 each 
of the main effects and interactions was com- 
pared with a term measuring its variation 
among Ss in order to obtain an F ratio (e.g., F 
= A/AS). Thus for the data from the first 16 
types, an F was obtained for the A effect 
(23.86, p< .001), for the B effect (24.08, 
p < .001), and for the C effect (50.06, p< 
.001). Similarly, for the second 16 types, an F 
was obtained for the A effect (25.03, p< 
.001), for the B effect (48.86, p < .001), and 
for the D effect (8.13, us). 

The large size of these effects deserves em- 
phasis. On Adjective Group a (sprightly), the 
8 bold types among the first 16 had an aver- 
age rating of 1.63, and the light types 2.41. 
An average rating of 1.82 was assigned to the 
angular types and 2.22 to the curved. For 
simple types the average was 1.57 and for or- 
nate types it was 2.47. These are large dif- 
ferences; though the scale went from 0 to 5, 
the mean of all Ss’ ratings on Mood a varied 
among typefaces only from .5 to 3.3, with a 
standard deviation of .78. 


TABLE 1 


CORRESPONDENCE BETWEEN SUBJECTS’ JUDGMENTS OF PHYSICAL QUALITIES OF TYPEFACES 
AND THE EXPERIMENTERS’ PRIOR CLASSIFICATION 








Percentage of Ss"} 
whose judgments} 
fagreed with Es’ 


No. typefaces yielding each percentage of agreement, 
for typefaces originally classified by Hs as 








prior classification ; 
Bold Light Angular 

100 7 3 3 
90 3 8 8 
80 3 ey 3 
70 1 2 
60 1 1 
50 1 
40 1 
30 
20 1 
10 

0 














Curved Simple Ornate Serif Sans-Serif 
2 5 2 6 4 
6 2 3 2 3 
1 
3 
1 1 
2D 
2 1 1 
1 

















Note.—N = 26 for each of the first 4 columns of data and 12 for each of the last 4. Percentages have been rounded to the 


nearest multiple of 10. 
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TABLE 2 
F Tests SHOWING THE INFLUENCE OF THE TYPEFACE VARIABLES (AND THEIR 
INTERACTIONS) ON THE MATCHING OF Moop-ADJECTIVE SETS 
Mood-adjective sets 
Typeface variables 
a | b | c | d | e | f | g | h 
Single variables 
Angular-Curved (A) | 23.86** | 39.78** Pie EO .O8 5:33) 22.04** 7.84 2.83 
2o.03tem \erloeso ss 1.79 5.19 4.81 13 Te Sie Assos 
Bold-Light (B) 24.08** | 14.00* TOU Sona oe. Clare eS Oteee ie 5106 155325" 6.85 
AStSON TOU Re ml 1OIS0 = s\n 3500" 5 | 221085". al2:25% SATO ORemn eel Zr0 0% 
Simple-Ornate (C) | 50.06** | 95.04** O20 mem tS eo Ls S32 .96 112, 19.69** 
Serif-Sans-Serif (D)| 8.13 9.66* 18.46** Salo) 3.02 1.26 Dae ph 0.11 
Interactions 
AXB 18.59** 5.28 Sd.o2meal 400" 2 2.26 1.19 13.86* 
Sail 3.08 1.20 46 5.10 Sou 3.48 70 
Aw C 1.05 7.65 23 48 4.61 Selon 3.48 9.04* 
AXD DSO ma eeoOLOoe 11.81* 96 19 17.34** 16.74** 7.41 
BEX TG 28.61** | 14.40* Sale eC ee 12.85* Seo 13.61* 5.85 
BXD 1.97 16.01** 1.69 19.62** 3.59 38.17** 11.45* 02 
EX SCAB SAO 46 1.90 .09 ARS 2a .06 3.66 47 1.06 
AXBXD S055 6.49 4.78 9.66* 42 .64 .19 | 87 











Note.—When results indicate that mood rating tends to be higher for angular, bold, simple, or serif type, the F is in roman. 
When the mood rating tends to be higher for curved, light, ornate, or sans-serif type, the F is in italics. 


pb <.01. 
** > < .001. 


Table 3 provides a verbal summary of the 
dimensions and interactions which influence 
communication of each mood. The dimensions 
are represented in each instance by that end 
of the dimension which makes for communica- 
tion of the particular mood. For interactions, 
a dimension is represented by that end whose 
presence enhances the main effect. Only those 
dimensions are included for which one of two 
Fs for the main effect was found significant 
at the .001 level (or both at the .01 level) or 
the single F significant at the .01 level. We 
will now supplement the table by summarizing 
another way, reviewing the main effects and 
the interactions of each of the typeface di- 
mensions. 

Curvilinearity clearly tends to convey the 
moods expressed by the five Adjective Sets 


a, b, c, d, and h (sprightly, sparkling, dreamy, 
calm, and soaring). Angularity, on the other 
hand, conveys the mood of Adjective Set f 
(dignified). Thus there is considerable agree- 
ment between the Poffenberger and Barrows 
(1924) study on the feeling tone of lines and 
our findings about the effect of angularity and 
curvature in typography. 

Light typefaces tended to be rated high by 
our Ss on the moods represented by Adjec- 
tive Sets a, b, c, and d (sprightly, sparkling, 
dreamy, and calm). Bold typefaces were rated 
high on the moods expressed by Adjective Sets 
e, f, and g (sad, dignified, and dramatic). 
These findings seem to agree well with the 
Ovink and Davis and Smith studies to the 
extent that we can properly match our moods 
with their somewhat different concepts, Ovink 
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TABLE 3 


TYPEFACE CHARACTERISTICS SIGNIFICANTLY MATCHED 
TO Sets OF Moop-ApDJECTIVES, AND 
SIGNIFICANT INTERACTIONS 








Characteristics entering into 


Set of mood-adjectives — 








Main effects Interactions 
a*—sprightly, cheerful, 1. curved (light,* sans-serif*) 
bright, vivacious, 2. light (simple, curved) 

happy, gay, joyous 3. ornate (bold) 

















b—sparkling, playful, 1. curved (sans-serif*) 
merry, jovial, humor- 2. light (simple,* sans-serif) 
ous, whimsical, fanci- 3. ornate (bold) 
ful, quaint 4. sans-serif | (curved,** light) 
c—dreamy, dainty, deli- | 1. curved (light*) 
cate, light, graceful, 2. light (simple, curved) 
lyrical, poetic, leisurely,| 3. ornate (bold) 
gentle 4. sans-serif | (curved) 
db—calm, serene, sooth- 1. curved (light*) 
ing, tender, longing, 2. light (simple, curved, serif) 
romantic, plaintive, 3. ornate (bold*) 
sentimental 
e—sad, pathetic, mourn- | 1. bold (simple*) 
ful, melancholy, de- 
pressing, gloomy, 
heavy, tragic 
f—dignified, spiritual, 1. angular (simple) 
solemn, sober, serious, | 2. bold (serif*) 


grave, earnest, lofty, 
philosophical 





_ 


(simple,* serif) 
(curved,* bold*) 


g—dramatic, forceful, . bold 
vigorous, martial, 2. serif 
ponderous, emphatic, 
majestic, exalting 





h—soaring, triumphant, 1. curved 
elated, exciting, im- 2. ornate 
petuous, restless, stir- 
ring, spirited 


(angular) 








Note.—When no asterisk appears in the Interactions 
column, the significant main effect (e.g., on the third line of the 
table, ornate) is separately significant at each end of the inter- 
acting variable (i.e., bold and light) at least at the .05 level, 
though more marked at the end named (i.e., bold). 

a The triple interaction for Mood Group a suggests that for 
serif type the effect of boldness is enhanced by angular type and 
for sans-serif type the effect of boldness is enhanced by curved 
type. However, for both serif and sans-serif type the effect of 
lightness is enhanced by curved type. 

6 The first triple interaction for Mood Group d suggests that 
the effect of brightness is enhanced in curved types for both 
simple and ornate types. For ornate type the effect of boldness 
is enhanced in curved types; for simple types there is a slight 
but quite insignificant trend suggesting that the effect of bright- 
ness is enhanced in curved type, for serif type only. The second 
triple interaction suggests that for serif type the effect of bold- 
ness is enhanced by angular type; for sans-serif type the effect 
of boldness is enhanced by curved type. 

* The main effect remains separately significant only for the 
end of the interacting variable named in the Interactions 
column; with the opposite level of the interacting variable 
the main effect of the first variable is not significant at even the 
.05 level, though it is still in the same direction. 

** The significant effect of sans-serif type is maintained only 
for curved type. For angular type the effect is significantly re- 
versed (p < .05). 


noted that (1) the qualities of luxury and re- 
finement are related to low pressure (light 
type), (2) soberness and economy to rather 
high pressure (bold type), and (3) force and 
strength to high pressure (bold type). This 
relationship between adjectives and type cor- 
responds to our Adjective Sets c, f, and g, on 
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which we found light (c), bold (f), and bold 
(g) types, respectively, to be rated high. 
Davis and Smith noted that the qualities of 
delicateness and beauty were related to non- 
bold (medium) type, while strength was re- 
lated to bold type. This relationship corre- 
sponds to our Adjective Sets c and g, on which 
our Ss gave high ratings to light and bold 
types, respectively. 

Examination of the interaction of this di- 
mension with the others suggests that (a) 
significant main effects of boldness are sig- 
nificant only if the type is simple rather than 
ornate, and (0) significant main effects of 
lightness are generally enhanced if the type 
is simple rather than ornate. The fact that the 
main effects of both lightness and boldness are 
greater in simple than in ornate type suggests 
that the present results have some generality 
for standard typography, since simple type- 
faces are usually employed. 

Considering the simple-ornate dimension, 
we find that ornateness is judged to convey 
the moods of Adjective Sets a, b, c, d, and h 
(sprightly, sparkling, dreamy, calm, and soar- 
ing). There are no significant main effects in 
which simple type is rated higher on expres- 
sion of a mood. Examination of the interac- 
tions of this dimension with the others shows 
that although simple type shows no significant 
main effects, it does enhance several main 
effects of other dimensions; ornateness never 
does this. This interaction indicates—to state 
more generally a point anticipated in the 
previous paragraph—that while simplicity by 
itself does not suggest a particular mood, it 
can render more conspicuous other distinctive 
features which do. Ornateness, on the other 
hand, apparently is so striking and distinctive 
in its emotional meaning that it diminishes 
the effects of all other typeface features simul- 
taneously present. 

Studying the serif—sans-serif dimension, we 
see that sans-serif type tends most strongly 
to suggest the moods of Adjective Sets b and 
c (sparkling and dreamy), and serif type the 
mood of Adjective Set g (dramatic). Although 
there are only these three significant main 
effects for this dimension, there are seven 
cases of significant interaction with main 
effects of other dimensions. For Adjective Sets 
a, b, and c (sprightly, sparkling, dreamy), 
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sans-serif type either enhances other main 
effects or is a significant main effect; for Sets 
d, f, and g (calm, dignified, dramatic), serif 
type either enhances the main effects or is 
itself a significant main effect. Comparing 
these two groupings of moods, we see that the 
first suggests freedom and high spirits; the 
second, restraint and seriousness. Apparently 
the serifs, by providing a horizontal linkage, 
facilitate communication of stability and em- 
phasis as was originally hypothesized, and 
their absence has an opposite effect. 

The main effects we have reported are not 
greatly limited by the interactions. In only 
two instances is a significant main effect of 
one variable reversed for one level of a second 
variable: For Adjective Group b (sparkling), 
the significant main effect of sans-serif is actu- 
ally found only if the type is curved; if it is 
angular, serif types are instead rated higher 
on this mood, and this tendency is itself sta- 
tistically significant (p < .05). For Adjective 
Group g (dramatic) the significant main effect 
of curved type obtained with the second 16 
typefaces is actually found only if the type 
has serifs; here a slight reversal not approach- 
ing significance appears if the type is sans- 
serif. 

When we look beyond these two exceptions, 
we find that the interactions do not cast doubt 
on the general validity of the main effects. 
For the 12 interactions marked with an as- 
terisk in Table 3, the main effect of one vari- 
able is separately significant (p < .05) at only 
one level of the second variable, yet at the 
other level the effect still runs in the same 
direction. For the 16 interactions without as- 
terisks in Table 3, the significant effect of the 
first variable remains separately significant 
(p < .05) for each level of the second, though 
the interaction shows the effect to be sig- 
nificantly larger for one level than for the 
other. Thus in a total of 28 cases of signifi- 
cant interaction the direction of the main 
effect is maintained for both levels of the 
second variable and in 2 it is not—evidence 
that the main effects are generally consistent 
in direction. 

There were three significant triple inter- 
actions; only one of them (in Adjective Set 
d) affected the findings presented in Table 3. 
(This interaction indicates that only for serif 
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types is the effect of lightness enhanced by 
curved types.) The fact that 13 of 16 triple 
interactions were insignificant suggests that 
the main effects and first-order interactions 
are on the whole maintained from level to 
level of the third variable. 

Interactions of the college variable (Yale 
or CCNY) with the main effects and interac- 
tions presented in Table 3 were also com- 
puted. There were 88 such interactions; only 
4 of them are significant at the 5% level, and 
of these only 1 suggests a reversal of a sig- 
nificant effect (it does so by so slight a margin 
as not to justify attention in the context of 
general agreement). The general agreement 
between colleges indicates that the findings 
are generalizable to other similar groups. 

Some of these diverse findings can be or- 
ganized into a single general scheme. Adjec- 
tive Sets a, b, and c (and often d and h) form 
a group of moods which our Ss tend to find 
well expressed by light, ornate, and possibly 
sans-serif typefaces. On the other hand, Ad- 
jective Sets f and g (and often e) form another 
group on which instead the angular, bold, and 
possibly the serifed types are rated high. The 
a, b, and c grouping consists of those adjec- 
tives which express happiness, joviality, deli- 
cacy; the f and g grouping expresses dramatic, 
forceful, serious, and philosophical moods. 
Each grouping consists of neighboring sets on 
Hevner’s mood circle, in which adjacency sig- 
nifies similarity. With fair adequacy, the two 
groupings may be characterized as suggesting 
femininity versus masculinity in connotative 
meaning. Viewed in this way, curvilinearity, 
lightness, ornateness, and absence of serifs 
convey femininity, while their opposites convey 
masculinity. This is certainly a helpful gen- 
eralization for keeping the overall tenor of our 
results in mind, and the suggestion that sexual 
meaning is involved may be of considerable 
importance. (In another context, Jones and 
Lepson, 1967, have found black to be asso- 
ciated with masculinity, gray with feminin- 
ity.) A masculine-feminine polarity provides, 
however, only a most general conception of 
results that are really much more complex. 
For example, Adjective Set h (soaring) seems 
on the whole to have more masculine than 
feminine connotation, and yet our Ss rate high 
on it typefaces with characteristics we have 
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just designated as feminine (lightness, curvi- 
linearity, and ornateness). 

The relationships we find are probably 
brought about by many processes. As our 
study provides no basis for evaluating the 
relative importance of one and another proc- 
ess, we will not present here our speculations. 

Our results may be put together with those 
of Tannenbaum et al. (1964) to support their 
conclusion that “there is a substantial basis 
for the use of typography as a code for the 
communication of connotative variations [p. 
72].” Specifically, Tannenbaum et al. provide 
evidence that connotative meaning is influ- 
enced by slant (italic versus roman), by case 
(lower versus upper), and by type family. The 
serif—sans-serif dimension entered their re- 
search post hoc, and apparently they could 
not confidently ascribe any influence to it. 
Our research supplements theirs, then, by 
demonstrating the influence of this dimension. 
In addition, we show the great influence of 
three dimensions with which they were not 
concerned: angular—curved, bold—light, and 
simple-ornate. Tannenbaum et al., on the 
other hand, make another special contribu- 
tion through comparing judgments by ama- 
teurs (comparable to our Ss) with judgments 
by professional typographers and by students 
of typography. In showing general agreement 
among these three kinds of judges, they dem- 
onstrate quite directly that the connotative 
meaning of typefaces can be utilized in com- 
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munication of intended meaning from artist 
to public. 
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This paper deals with the development of a Guttman-type scale of attitudes of 
southern industrial managers toward organized labor and the extent to which 
the results of such quantified statements differ from those of industrial managers 
outside the South. A 7-item scale was obtained. A comparison of the outcomes 
of these items with measures of attitudes of industrial managers outside the 
South indicates the presence of a number of similarities and differences in pat- 
terns of responses related to matters of the role of the union, the problem of 
union shop, collective bargaining, and the process of decision making. 


Writers in the field of industrial relations 
have long been interested in studying the atti- 
tudes held by managerial personnel toward or- 
ganized labor. Attempts to develop quantified 
statements of these attitudes have been 
launched with varying degrees of success 
(Chalmers et al., 1953-54; Derber et al., 
1960; Stagner, Chalmers, & Derber, 1958). 
Most of these attempts have been made in 
large industrial corporations in highly indus- 
trialized regions and the measures devised re- 
flected this situation. No attempts have been 
made, on the other hand, to develop quanti- 
fied measurements of managerial attitudes in 
southern industrial centers.? As a first step in 
the direction of accumulating evidence on this 
region, this paper presents the findings drawn 
from a study of managerial attitudes toward 
organized labor in one southern industrial 
center. 

The experience of the industrial South with 
labor does not conform to many of the find- 
ings of industrial relations in other regions 
(Heberle, 1948; Simpson & Norsworthy, 
1965). At least three factors are responsible 


1 Revised version of a paper read at the annual 
meeting of the Southern Sociological Society, New 
Orleans, April 1966. The paper is based on data 
collected for a doctoral dissertation submitted by the 
senior author to the Department of Sociology, 
Louisiana State University, August i965. 

2The southern region used in this paper corre- 
sponds to the Southeast of Odum (1936) in his 
Southern Regions of the United States. 


for this: First, industrialization in the South 
began later than in other areas of the nation 
with significant consequences for management- 
labor relations; second, the social structure 
of the South, especially patterns of social 
stratification and the racial composition of 
the population strongly influenced the labor 
movement in the region (Heberle, 1959; 
Vance & Demerath, 1954); and third, the 
types of industries “creeping” southward pro- 
duced few craft, white-collar, and professional 
jobs. The result has been an overrepresenta- 
tion of the lower occupational strata (Simpson 
& Norsworthy, 1965). 

To these factors one can add the fact that 
organized labor is weaker in the South than 
in other regions of the country. Both the pro- 
portion of organized labor and the rates of 
unionization of southern workers are below 
those of the nation as a whole. In 1953 the 
proportion of organized labor in the South 
was 17.1% of the total nonfarm labor force 
(compared to 32.6% for the nation). Until 
1953 the average rate of unionization of the 
southern labor force was only 57% of the na- 
tional rate. Although the South increased its 
proportion of unionized labor in recent years, 
it is still considered to be underorganized 
(Roy, 1965). 

The effects of these characteristics upon 
labor-management relations have been the 
subject of a number of studies. Heberle 
(1948) has maintained that the spread of 
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unionization in the South will result in indus- 
trial conflict different from that experienced 
in the North largely because of the tradition- 
ally paternalistic pattern of labor relations in 
the South. Confining his observations to south- 
ern textile industry, Gillman has noted that 
common customs and mutual interests so 
characteristic of this industry have created 
harmonious and sympathetic management- 
employee reliations which may be classified 
as “folksy” (Gillman, 1956). Roy (1965) an- 
alyzed changes in labor management in the 
South in terms of several factors that include 
the growth in influence of absentee-top man- 
agement groups, the shift in the type of asso- 
ciation between management and labor from 
“shirt-sleeved informality toward impersonal 
business relationships,” and the replacement 
of early benevolent paternalism by pseudo- 
paternalism and coercion. 

The foregoing studies clearly indicate that 
employee-management relationships in the 
South are markedly different from those of 
other regions but have recently undergone 
significant changes. Thus earlier forms of 
these relationships were characterized by folk- 
siness, indigenous labor force, and a common 
culture. Benevolent paternalism provided 
these relationships with a strong shield against 
the encroachment of alien ideologies, thus pre- 
serving continuous harmony and cultural soli- 
darity. More recent forms of management- 
labor relations have emerged during the past 
few decades with rapid industrial growth and 
the diversification of products. Basic to these 
new patterns is the growth of absentee owner- 
ship and “the replacement of Uncle Ben with 
college-trained officials” (Roy, 1965). Man- 
agement in the South is now forced to deal 
with labor in terms of policies originating out- 
side the region. Also, the recent industrial 
expansion has brought about a new surge of 
labor efforts to unionize workers of the region 
which has heightened labor’s awareness of its 
self-interest and accelerated the breakdown 
of paternalism. 

Differences in patterns of labor-manage- 
ment relationships between the South and the 
rest of the country are subtle and require a 
great deal of systematic analysis. In addition, 
southern industrial centers are currently ex- 
periencing a variety of types of relationships 
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due to differences in level of industrial growth, 
type of product, and geographical location 
that it would be difficult to speak of an inclu- 
sive pattern. Preliminary attempts must be 
made, however, to reveal some aspects of these 
differences so that more comprehensive studies 
could be undertaken in the future. In this 
paper the field of attitudes of industrial man- 
agers toward organized labor is explored. The 
problem of the study is twofold: (a) to assess 
the kinds of attitudes industrial managers in 
the South hold toward organized labor, and 
(5) to assess the extent to which southern 
managerial attitudes differ from those held 
by managers in other regions. The strategy 
employed to accomplish these tasks was to 
develop a Guttman-type scale of attitudes of 
industrial managers in a southern city and 
then to compare the findings with available 
scales and other indexes of attitudes held by 
management outside the South. Since many of 
the instruments used in other studies con- 
tained items referring to local situations, they 
were not suitable for research in the South. 
For this reason, an attempt was made to de- 
velop a scale which would yield the desired 
data on southern managerial attitudes while 
at the same time being suitable for regional 
comparisons. Given the importance of this 
methodological problem, the technique used 
in scaling and the technical characteristics of 
the scale will be described in some detail. 
The data were gathered from five bulk raw- 
material processing plants located in an ad- 
ministrative-industrial center in the Southern 
United States with a metropolitan area popu- 
lation of about 275,000. These corporations 
have some form of labor contract with either 
an independent labor union or a union affili- 
ated with the AFL-CIO. Seventy lower-level 
officials and 20 top management executives 
employed by these corporations were inter- 
viewed. Although separate interview schedules 
for each of the two levels of management were 
prepared and used throughout the study, 
items making up the scale presented here were 
identical for the two managerial levels. 
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Items of the Scale 


Attitudes of management toward labor 
unions were measured by seven items, each 
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containing four-response categories. These 
categories were assigned weights ranging from 
O for strongly favorable to 3 for strongly un- 
favorable. The items are as follows: 


1. Labor unions not only better their members’ 
economic and social conditions but also con- 
tribute to the welfare of society. 

2. The drive to organize workers is a legitimate 
right for labor unions, 

3. Collective bargaining is a necessary and im- 
portant function of labor unions that should be 
preserved. 

4. Labor unions are reasonable in their drive to 
recruit workers into their organizations. 

5. Labor unions’ demands regarding wages, hours 
of work, and working conditions, are in most 
cases, fair and reasonable. 

6. Labor’s demand for the union shop is a legiti- 
mate right for them. 

7. Labor unions should have a share in making 
decisions in the corporation. 


Scaling Technique 


Using the scalogram board technique (Gutt- 
man, 1950), responses of the two groups of 
managers to the above items were scaled in 
one unit. All items were dichotomized in the 
second trial. The items formed a “true scale” 
since a coefficient of reproducibility of .90 
was obtained, and the errors were randomly 
distributed (Guttman, 1950). The final order 
of the items was as follows: 1, 2, 3, 5, 6, 4, 
and 7. 

The marginal frequencies of the seven items 
are shown in Table 1. Item 7 was retained 
although it failed to meet Guttman’s criterion 
that an item should have 90 to 10% marginal 
frequencies to be included in a 10-item scale. 


TABLE 1 


MARGINAL FREQUENCIES OF THE ITEMS OF THE 
ATTITUDE SCALE OF UpPER AND LOWER 
MANAGEMENT OFFICIALS 





Marginal frequencies percent 





Item* 
Disagree Agree 
1 18 82 
2 28) 77 
3 52 48 
5 58 42 
6 61 39 
4 72 28 
7 93 7 


a Items are listed according to their final order in the scale. 
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TABLE 2 


DISTRIBUTION OF LOWER AND UprpeR MANAGEMENT 
OFFICIALS IN THE ATTITUDES SCALE TYPES 

















Perfect Scale items® Respondents 
scale 
type i{2z}a|s|o]4]: No. % 
DI SetentS tie dix ese 8414) 2.22 
II XeboX yy Meee) Xp Hh 12 13.33 
III Xe) kobe V Xe UX 22 24.44 
IV moe x x | 100) 11.11 
V ie. Ta ee 4 4.44 
VI eek 5 oo 
VII x | 26 | 28.88 
Total 90 | 100.00 











« The symbol x designates disagreement with the respective 
item. 


It was decided that marginal frequencies of 
93 to 7% are close enough to the minimum 
requirement which is, in itself, an arbitrary 
limit. 

Scores in the scale ranged from 0 to 7. Only 
one respondent scored 0 and one other respon- 
dent scored 7. Both respondents were lower 
managerial officials. Upper management offi- 
cials contributed fewer than their pro rata 
errors (11% of the total number of errors in 
the scale). Their scores, however, were dis- 
tributed in a consistent pattern. 

Seven scale types were established with 
respondents ranging from 12% in the first 
scale type (classified as having the strongest 
negative attitudes toward labor unions) to 3% 
in the seventh type (considered to have the 
strongest prounion attitudes). Table 2 shows 
the number of respondents in each scale type. 

The theoretical framework of the scale in- 
cludes the assumption that matters of agree- 
ment and disagreement between labor and 
management can be used to identify manage- 
ment feelings toward labor unions. Matters 
like the role of unionism in industrial society, 
the demands made by unions to improve 
workers’ social and economic conditions, the 
significance of collective bargaining, labor’s 








8 Stressing negative rather than positive attitudes 
toward organized labor is intentional since the paper 
is derived from a study of a set of hypotheses deal- 
ing with the impact of work orientations of man- 
agers upon their negative attitudes toward labor 
unions, 


450 


demand for the union shop, and labor partici- 
pation in making decisions in the corporation 
were considered a valid basis for assessing 
attitudes of both upper and lower manage- 
ment personnel toward labor unions. 

Polarization of attitudes and the likelihood 
of an attitudinal continuum is supported by 
the fact that the majority of the respondents 
agreed with the statement that labor unions 
have a vital role in the growth and welfare 
of modern society (Item 1), and disagreed 
with the statement that unions should have 
a share in making decisions in the corporation 
(Item 7). Between these two extremes, re- 
sponses fell into a gradient pattern with the 
number of disagreements per item decreasing 
gradually toward Item 1 and _ increasing 
toward Item 7. 

The final order of items in the scale reveals 
that respondents considered the existence of 
labor unions to be important to society. They 
also agreed that union efforts to recruit mem- 
bers are legitimate. On the issue of collective 
bargaining, however, responses leaned more 
to the negative side of the scale. Relatively 
high disagreement did not begin until re- 
spondents were asked to express their feelings 
toward the union shop. Disagreement became 
greatest (most widespread) when opinions 
were expressed about the legitimacy of labor 
to have a share in making decisions in the 
corporation.* 

Judging respondents on the basis of their 
scores in the scale, four levels of attitudes 
were established: high negative, moderately 
high negative, moderately low negative, and 
low negative. The distribution of respondents 
in these categories is shown in Table 3. The 
table shows that the majority of respondents 
are concentrated in the two middle categories 
and that the two extremes have almost equal 
numbers of cases. The authors are disposed 
to conclude that southern managers have mod- 
erately negative attitudes toward organized 
labor. 

The distribution of negative attitudes 
among managers of the five corporations is 
shown in Table 4. More than two-thirds of 


4 Using Schuessler’s Test I, it is concluded that the 
observed frequencies of the scale types exceed the 
frequencies expected by chance. See Schuessler (1961) 
for fuller details. 
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TABLE 3 


CLASSIFICATION OF MANAGEMENT BY ATTITUDES 
TOWARD LABOR UNIONS 














Attitude Score No. % 
High negative 7,6 13 14.44 
Moderately high negative} 5,4 38 42.22 
Moderately low negative ee 27 30.00 
Low negative 1,0 12 13.33 
Total 90 100.00 





the managers in Corporation A show low-nega- 
tive attitudes. On the other hand, Corpora- 
tions B, C, D, and E are characterized by 
high-negative attitudes with varying degrees 
of intensity. The striking thing about this 
distribution is the relationship it bears to size 
of the corporation. Specifically, the largest 
corporation has the least negative views while 
the “medium-sized” and small corporations 
have the strongest negative views. While this 
finding may be an artifact of the specific 
union-management relationships in the vari- 
ous organizations, the authors were unable to 
find evidence to support this view. They are 
led to conclude, then, that factors associated 
with size of the organization influence man- 
agerial attitudes toward unions. Specifically, 
the less negative views found in the largest 
corporation may reflect the fact that labor- 
management relations have reached a highly 
stabilized pattern based upon common goals 


TABLE 4 


DISTRIBUTION OF LEVELS OF NEGATIVE ATTITUDES 
IN THE FIVE CORPORATIONS 





























Attitudes toward 
labor unions 
: _| Approx. 
oe no High Low Total 

employees} negative negative 

Noses 9a | Nowlige 
A 2200 6 | 26.09; 17 | 73.91} 23 
B 720 18:95 72.00) > 7 | 28:00 REZ5 
Cc 700 12 | 66.66] 6 | 33.33] 18 
D 125 9.1956.25 | 7 | 43. 7oq eG 
E 100 Gribi 5.00) FZ) |) 25.00 8 
Total 51 39 90 
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and mutual interests, and that leadership of 
both sides has developed specialized forms of 
communication to channel differences between 
them. It is also possible that unions constitute 
less of a threat to managers of large organi- 
zations than is true in smaller corporations. 
On the other hand, it seems likely that the 
“paternalistic” pattern of labor relations dis- 
cussed by Heberle and the subsequent de- 
velopment of “pseudo paternalism” and ‘“‘co- 
ercion” suggested by Roy, is more likely to 
be a strategy employed by the middle-sized 
and smaller corporations. If this is indeed the 
case, then the strong negative views found in 
the middle-sized and small corporations may 
reflect the managerial reaction to the strong 
conflict provoked by labor-management dis- 
agreements in such a system. 


Attitudes and Organizational Commitment 


Previous studies have examined the corre- 
lation between attitudes toward labor unions 
and liking or disliking for the company in 
order to demonstrate the existence of what 
has been called a “common attitudinal cli- 
mate” among industrial workers and man- 
agers. Chalmers and his associates found that 
where industrial workers showed a liking for 
management, they expressed positive feelings 
toward unions and vice versa (Chalmers et al., 
1953-54). On the other hand, tests of such 
relationships among _ industrial managers 
showed an absence of significant association 
(Stagner et al., 1958). 

In the current study attempts were made to 
test the association between attitudes of lower- 
level managers toward labor unions and levels 
of commitment to the employing corporation. 
Organizational commitment was measured by 
responses to seven items dealing with feelings 
of managers regarding their work in the com- 
pany, with their willingness to help the com- 
pany during a crisis, and with their sense of 
identification with the corporation. The scalo- 
gram board technique was used to rank these 
items. In the final trial only five items were 
scalable with a coefficient of reproducibility of 
.90. Because the number of items was small 
and the requirements of a “true scale” were 
not fulfilled, details of the scaling procedures 
are not reported here. 
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The correlation between negative attitudes 
toward labor unions and organizational com- 
mitment indicates that the relationship is not 
significant.® Obviously this finding means that 
respondents who have high negative attitudes 
toward labor unions are not committed to 
their organization. While this may be correct, 
the authors must point out that their measure 
of organizational commitment may have influ- 
enced the outcome of the test. On the other 
hand, they take refuge in the finding of the 
Illini City study which found a “common 
attitudinal climate,” and conclude that man- 
agers who have strong negative attitudes 
toward labor tend to lack strong commitment 
to the organization. 


Discussion 


How do the attitudes of southern industrial 
managers toward labor unions compare with 
the views held by those outside the South? 
Here it is instructive to compare the findings 
of the results of two items of the scale with 
findings of a Gallup poll made public at al- 
most the same time the data of the study were 
collected (Gallup, 1965). The poll asked 2 
national sample: “In general do you approve 
or disapprove of labor unions?” In the Gallup 
sample 23% of the nonunion respondents dis- 
approved of labor unions. In the current study 
of managers 18% disagreed with the state- 
ment: labor unions “not only better their 
members’ economic and social conditions but 
also contribute to the welfare of society.” 

A second question in the Gallup poll asked: 
“Do you think a person should or should not 
be required to join a union if he works in a 
unionized factory or business?” Of the non- 
union respondents in the poll, 56% replied 
negatively. The present sample of southern 
managers elicited 61% negative replies to 
Item 6: “Labor’s demand for the union shop 
is a legitimate right for them.” 

It should be noted that the differences in 
percentages between southern managers and 
nonunion Gallup poll respondents for both 
items is rather small. In both instances the 
95% confidence interval around the southern 
managerial sample proportions includes the 
proportion from the Gallup poll. Although 


57 — —035; p(z—= —417) >.05. 
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the specific wording of the items is somewhat 
different, in both cases the meanings are simi- 
lar. Evidently southern managers do not dis- 
agree greatly with Gallup’s nonunion respon- 
dents. 

This should not be surprising, nor does it 
necessarily contradict Stouffer’s findings that 
Southerners are less tolerant than the rest of 
the nation. Stouffer (1955) found that the 
amount of schooling and type of community 
(rural-urban) of his cross-section respondents 
influenced their level of tolerance. Thus, the 
higher the educational level the more likely 
the individual is to show tolerance of non- 
conformists. Similarly, urban respondents 
tended to score relatively higher on the toler- 
ance scale than rural respondents. In the cur- 
rent study, respondents are urban residents 
and have achieved higher levels of education 
than the “average” Southerner. Thus, the rela- 
tively high proportion of southern industrial 
managers approving of labor unions may re- 
flect a tolerance that is due to their educa- 
tional achievement and urban residency.’ 

Viewed from a different perspective, the 
relatively mild reaction of southern manage- 
ment to the existence of labor unions, espe- 
cially in large corporations, can be attributed 
to the weakness of unionism in the South. It 
was pointed out earlier that southern labor 
unions have not reached the strength achieved 
by unions outside the southern region.® 

The findings of the present study are most 
meaningful when compared to similar studies 
of managerial attitudes outside the South. The 
most comparable study is one in which Stag- 
ner et al. (1958) developed an 8-item Gutt- 
man scale to measure attitudes of managers 
to labor unions and a 9-item scale of union 
attitudes toward management. The authors 
found that in most organizations ‘manage- 


6 By tolerance Stouffer means “willingness to grant 
certain rights to people whose views might be dis- 
approved, such as the right to speak, the right to 
hold certain kinds of jobs, etc.’ See Stouffer (1955) 
for details. 

7 The educational factor should probably be given 
more weight since urbanism by itself, as Stouffer 
points out, did not account for a significantly high 
level of tolerance in the southern region. 

8 The South is still considered by one union or- 
ganizer of being the “new frontier” for organized 
labor. See Conney (1961). 
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ment representatives were either moderately 
or very favorable toward the union” (Derber 
et al., 1960). They also found that managers 
were favorably disposed toward unions when 
asked to respond to more specific issues such 
as the role of the union in influencing the 
manner in which the organization was run, 
whether or not union claims were considered 
reasonable, and whether or not the amount of 
power enjoyed by union representatives was 
appropriate. 

The findings of the Stagner et al. study 
differ from those reported in this study in sev- 
eral respects. First, in contrast to the generally 
favorable attitudes of the Northern managers, 
the Southern managers had a generally nega- 
tive attitude toward organized labor. Second, 
even more pronounced differences exist be- 
tween the two groups when the specific items 
of the two studies are found. While responses 
to the item dealing with contract negotiations 
“tended to be concentrated at the more fa- 
vorable or friendly end of the scale” in the 
Stagner et al. study, responses of the South- 
ern managers to the question dealing with 
collective bargaining clearly lean toward the 
negative side of the scale. (See Table 1.) To 
the question of labor unions’ power and their 
interference in areas of production and dis- 
cipline the general pattern of responses re- 
ported in the Stagner et al. study is favorable 
although “extensive management criticism 
was apparent in many establishments.” The 
current study, on the other hand, shows a 
rather overwhelming majority of negative re- 
sponses to the question dealing with the legiti- 
macy of labor’s demands for a union shop 
and ‘“‘a share in making decisions in the cor- 
poration.” ® 


SUMMARY AND CONCLUSIONS 


In the absence of an instrument for mea- 
suring attitudes of southern managers toward 
unions, this paper has attempted to develop 





®Scale items in the Stagner et al. study have a 
great deal of resemblance to the corresponding items 
of the current study. The basic difference is that 
questions in Stagner’s study ask reaction of manager 
to relationships between themselves and the union 
which exist in actuality whereas items of the cur- 
rent study are simple declarative statements overly 
implying neither normative nor personal preference. 
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a Guttman-type scale for industrial managers 
in a southern city. The instrument developed 
met the requirements of a “true scale” as 
specified by Guttman. Perhaps one reason for 
the success of this scale is that the items pre- 
sented to the respondents were generalized 
statements. This confirms the Stagner et al. 
conclusion that generalized items tend to scale 
better than specific items. The scalability of 
the items used in this study leads the authors 
to conclude, with the Stagner et al. study, 
that a Guttman scale represents a highly re- 
liable way to ask managers “do you approve 
of the union?” (Stagner et al., 1958). 

A comparison of the characteristics of the 
scale developed on southern managers with 
those developed on populations outside the 
South revealed a number of important dif- 
ferences. It is probably correct to conclude 
that most of the differences arose when ques- 
tions focused on specific issues. The instru- 
ment itself seems to be more applicable to 
small corporations in areas not highly indus- 
trialized, and in regions where unionization 
has not dominated the modes of grouping 
employees. 

A final word is in order. The findings of 
this study must be qualified by one limita- 
tion. That is, some of our “southern” man- 
agers, especially top executives, may not be 
southern in origin. It was pointed out that 
absentee ownership has become an important 
factor in shaping labor-management relations 
in the South. Given the fact that most of the 
corporations included in this study are ab- 
sentee owned, and that the rate of geographi- 
cal mobility of industrial executives is high, 
it is possible that some of our findings may 
reflect the influence of nonsouthern managers. 
In fact, the major cleavage in southern indus- 
trial organization may involve a difference be- 
tween top and middle management with re- 
gard to their views on organized labor which 
is a function of the distinction between large 
absentee-owned corporations and smaller in- 
digenous organizations. 
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AND TEACHING MACHINE AS A FUNCTION OF 
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A study was designed to determine whether the effectiveness of programmed 
material presented by teaching machine and programmed text varies with 
Kuder interest pattern. Specifically, 30 introductory psychology students with 
high mechanical and low literary interest and 30 low mechanical and high 
literary interest students learned 2 programs. 4% of each group learned the 
programs via teaching machine while the remaining halves were instructed by 
programmed text. While no interest patterns or mode of presentation effects 
were uncovered, a significant Interests X Mode interaction appeared for the 
longer of the 2 programs. The implications of the results for educational and 


industrial training programs are discussed. 


Studies which have compared programmed 
texts and teaching machines as modes of pre- 
senting programmed material generally find 
few differences in acquisition and retention 
differentiating the two approaches (e.g., 
Eigen, 1962; Goldberg, Dawson, & Barrett, 
1964; McKeown, 1965. Also cf. Goldstein, 
1964; Goldstein & Gotkin, 1963). However, 
it is possible that, while the two modes of 
presentation might be comparable with groups 
of unselected Ss, individuals might respond 
differentially to the two procedures as a func- 
tion of personality/interest pattern differ- 
ences. In fact, a related suggestion has been 
made by Ripple, O’Reilly, Wightman, and 
Dacey (1965), who feel that personality char- 
acteristics of the learner may differentially af- 
fect learning under programmed instruction 
and conventional teaching regimens. 

A particularly intriguing hypothesis sug- 
gests that Ss may learn differentially from 
teaching machines and programmed texts as 
a function of interest differences. For ex- 
ample, Ss with high mechanical interest might 
be expected to learn better from teaching 
machines than persons with high literary in- 
terest, while the latter group might “outlearn” 
the former on material presented by pro- 


1The authors are indebted to Vi Kohorst and 
Kate Tidd for their assistance in the preparation of 
this paper. 
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grammed text. The present study was de- 
signed to determine if interest patterns reflect- 
ing literary or mechanical preferences are 
related to the differential effectiveness of text- 
and machine-presented programmed materials. 


MeETHOD 
Subjects 


The Ss were selected from 93 male volunteers 
drawn from two introductory psychology courses at 
St. Cloud State College. All of the students were 
administered the Kuder Preference Record, Form 
CH. On the basis of these scores, two groups were 
formed: Group Lni-Mio (above the total sample 
median on the Literary scale and below the total 
sample median on the Mechanical scale) and 
Group Mni-Lio (below the total sample median on 
the Literary scale and above the total sample median 
on the Mechanical scale). Those Ss who did not 
conform to these combinations of scores were elimi- 
nated from the experiment at this point. Also, 
because the statistical design employed required cell- 
to-cell proportionality, some additional Ss were ran- 
domly eliminated and the study was run with 30 Mni— 
Lio and 30 Lai-Mio Ss, half of each sample being 
assigned to a teaching machine group and half to a 
programmed text group. Minnesota Scholastic Apti- 
tude Text scores were available on 51 of the 60 Ss, 
permitting a rough evaluation of the intellectual abil- 
ities of the four resulting samples. An F test run be- 
tween the means of the four samples proved to be 
nonsignificant (F=2.09, df=3/47; M=34.0, SD 
= 14.1); it appears likely, therefore, that intellectual 
differences did not materially affect the results. 


PROGRAMMED INSTRUCTION AND MEASURED INTERESTS 45: 


Programs 


Two short linear programs were prepared es- 
pecially for the project.2 One, arranged in a fading 
format and consisting of 19 frames, was designed to 
lead to memorization of the following fictitious busi- 
ness address: 


Mr. Karl G. Lunden 

Chief of Product Development 
314 Brewster Building 

101 Hollyhock Drive 

Mesa, California 


The second program was intended to teach the prin- 
cipie underlying the digit-letter sequence: 


ORD eatin oe Ne mle dL 


This program consisted of 21 frames with over- 
lapping stimuli. Both programs required constructed 
responses. The machine presentation was imple- 
mented by means of a MTA-100 Scholar teaching 
machine. The special booklets prepared for the text 
presentation were arranged in a horizontal format. 


Procedure 


The 30 Ss each in the Lni—Mi. and the Mni—Lio 
groups were assigned randomly to either a machine 
or a text treatment condition. Upon arrival at the 
experimental room each S was exposed to printed in- 
structions regarding the use of the relevant presenta- 
tion mode. These instructions were comparable ex- 
cept for necessary references to either the machine 
or the text. While the machine program was ad- 
ministered individually, the booklet was adminis- 
tered to students in groups of about 15. The Ss were 
given all the time needed to complete the experi- 
mental programs assigned to them. (The average 
time required for the program was about 15-20 
min.) After completing the program Ss were excused 
with the explanation that they would be called back 
at the same time the following day. No description 
of the content of the second session was offered at 
this point. The next day Ss were examined for re- 
tention of the programmed material. This involved 
instructions to reproduce the business address and 
the digit-letter series learned on the preceding day. 
The dependent variable used for analysis was the 
number of individual characters correctly recalled. 
Thus, in the business address there was a total of 
88 letters and numbers, and a perfect score neces- 
sitated the recall of all characters in the correct 
order. Correspondingly, a maximum of nine charac- 
ters could be recalled in the digit-letter sequence. 


RESULTS 
Business Address Retention 


The means of the total number of business 
address characters recalled after 24 hr. are 


2 Copies of the programs are available from the 
senior author upon request. 


on 
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TABLE 1 


MEans FoR ToTAL NUMBER OF BusINEssS ADDRESS AND 
Dicit-LETTER CHARACTERS RECALLED AFTER 





24 Hr. 
Group Machine Text Total 
Lii-Mi.* 
Address 50.00 62.80 56.40 
Digit-letter 7.28 6.67 6.98 
Mui-Li.” 
Address 66.74 51.07 58.90 
Digit-letter 7.87 7.87 7.87 
Total 
Address 58.37 56.94 57.66 
Digit-letter 7.58 7.27 7.42 


Note.—n =15 for each cell. 

= Above the total sample median on the Literary scale and 
below the total sample median on the Mechanical scale. 

> Below the total sample median on the Literary scale and 
above the total sample median on the Mechanical scale. 


summarized in Table 1. Differences among 
the means of the rows and columns were 
tested for significance at the .05 level by a 
two-day analysis of variance. Table 2 sum- 
marizes this analysis. 

The implications of these data are as fol- 
lows: 


1. It is obvious that for the combined sam- 
ples there is no significant difference between 
the effectiveness of the programmed text and 
the teaching machine. 

2. When data from the programmed text 
and the teaching machine are combined there 
is no significant difference between the per- 
formances of the two interest groups. 

3. A significant interaction is observed be- 
tween interest pattern and mode of presenta- 
tion. By comparison with the programmed 
texts, the teaching machine appears to be a 
more appropriate technique for those with 
high mechanical and low literary interests 


TABLE 2 


ANALYSIS OF BUSINESS ADDRESS RETENTION 








Source SD! df MS F 
Machine—Text 30.8 1 30.8 | <1.00 
Group 93.7 1 03-7, \r<1.00 
Interaction 3038.8 1 3038.8 4.02* 
Within cells 42294.3 56 755.26 

Total 45457.6 59 





¥ ip <.055 
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while the opposite is true for those with low 
mechanical and high literary interests. 


Digit-Letter Retention 


The means for the four samples on the 
digit-letter task are also summarized in 
Table 1. 

The digit-letter retention scores were 
analyzed in the same manner as the business 
address data in Table 1. However, as neither 
the differences among marginal means nor the 
Row X Column interaction approached sig- 
nificance, the analysis is not described in a 
separate table. The Fs for interest pattern, 
mode of presentation, and interaction were 
1.50, .17, and .16, respectively, all df’s= 
1/56. 


DIscUuSsSION 


It is evident that with at least some pro- 
grams (e.g., the business address program) a 
significant interaction exists between interest 
pattern and mode of presentation. 

A sophisticated theoretical analysis is not 
needed to account for the differences observed 
between the Ly;-M), and the M);-Ly, groups. 
A student who “likes” things mechanical and 
“dislikes” printed materials might be expected 
to respond more favorably to the teaching 
machine on motivational grounds. Conversely, 
an S who “likes” books but “dislikes” ma- 
chines might be expected to exhibit an oppo- 
site effect relative to the two presentation 
modes. 

The results may have particularly broad 
implications for future development in the 
programmed instruction field. They suggest 
that training programs for groups with high 
mechanical interests (presumably factory 
workers, tradesmen, etc.) might best be 
structured for machine presentation while 
those designed for persons with stronger liter- 
ary interests (e.g., students, clerical workers) 
are probably best presented as programmed 
texts. Additional research on the relationships 
between interests and differential effectiveness 
of various types of programs seems to be very 
much in order. 

As has been noted, no interaction effect 
was observed with the digit-letter program. 
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This variable influence of interest patterns 
upon retention may be referable to specific 
differences in the two programs. First, a fun- 
damental difference in format is to be noted. 
The business address program employed a 
fading format in which the general instruc- 
tional goal is memorization. The digit-letter 
program, on the other hand, was designed to 
teach a particular principle from which the 
construction of a particular series of charac- 
ters could follow. 

Second, the difference in the number of 
characters in the two programs is to be noted. 
While a maximum of 88 correct responses 
were involved in the retention of the business 
address, the digit-letter series included only 9 
characters. Thus, the variability in recall was 
restricted in the latter program due to a much 
lower maximum “ceiling” of retention. 

In any event it would appear that with 
some programmed materials arranged in cer- 
tain formats, retention varies as a function of 
particular patterns of measured interests. The 
more specific nature of this interaction awaits 
further analysis as does the relationship be- 
tween interest and personality variables, and 
criteria other than retention. 
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This research studied the extent of individual need fulfillment of career United 
States Army officers at 3 different levels in the organizational hierarchy. The 
relative importance which the officers attach to the needs was also investigated. 
A sample of 504 officers in the ranks of captain through colonel responded to a 
mailed questionnaire. Conclusions are: (a) Perceived deficiencies in need ful- 
fillment tend to increase at successively lower levels in the organizational 
hierarchy. (b) In general, need-fulfillment deficiencies are greatest for the needs 
in the ego and self-actualization categories. (c) The deficiencies which appear 
to have the most deleterious effect on the accomplishment of Army goals are 
those associated with the opportunity to exercise authority and the comparative 
amount of recognition received in command and staff positions. 


Like many industrial organizations, the 
United States Army devotes considerable time 
and effort to the problem of recruitment and 
retention of qualified managerial (officer) 
personnel. Another problem of equal im- 
portance, which has received perhaps less at- 
tention, is that of obtaining maximum effort 
and value from those officer personnel who 
have committed themselves to a military 
career. Myers (1964) and others provide 
evidence to suggest that certain support fac- 
tors such as good working conditions, fringe 
benefits, and fair practices may reduce dis- 
satisfaction to the extent that individuals will 
stay with the organization, but that a differ- 
ent group of factors causes managers to be 
highly motivated toward their work. Myers 
believes that this latter group, which he calls 
“motivators,” includes the factors of achieve- 
ment, growth, responsibility, and recognition. 

McGregor (1957) interprets this same mo- 
tivational problem in terms of the hierarchy 
of needs developed by Maslow (1954). He 
feels that organizations continue to concern 
themselves with satisfying the physiological 
and safety needs of their employees through 
the provision of better wages, better working 
conditions, and more fringe benefits even 
after these two lower level needs have been 
quite well satisfied. The greatest mistake, Mc- 


1 Associate professor of economics and chairman, 
Industrial Relations Committee, Purdue University. 

2 Currently assigned as assistant research officer, 
Office of Military Psychology and Leadership, United 
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Gregor believes, stems from management’s 
failure to recognize that employee dissatisfac- 
tion and failure to put forth more than mini- 
mum effort derive from the lack of opportuni- 
ties to satisfy their social, ego, and self-ful- 
fillment needs. The needs contained within 
these three higher level categories, particu- 
larly the ego and self-fulfillment needs, closely 
parallel Myers’ “motivators.” 

In a series of studies of industrial manage- 
ment personnel, Porter (1961, 1962, 1963a, 
1963b, 1964) assessed the extent of per- 
ceived deficiencies in need fulfillment as a 
function of job level and of line versus staff 
type of job. Using questionnaire items classi- 
fiable into a Maslow-type need hierarchy sys- 
tem he found that the largest deficiencies 
existed in the two highest-order need cate- 
gories and that deficiencies tended to in- 
crease at each successively lower level of the 
management hierarchy. He also found that 
line managers perceived greater need fulfill- 
ment than staff managers but did not differ in 
regard to the importance they attached to 
each type need. 


PURPOSE 


The purpose of the present study was to 
examine the extent of individual need fulfill- 
ment of regular United States Army officers 
at three different levels in the organizational 
hierarchy. Deficiencies in need fulfillment 
were assessed by determining to what extent 
an incentive is provided for each need versus 
the extent to which an incentive should be 
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provided for each need. The discrepancy be- 
tween what is provided versus what should 
be provided is an indirect measure of the sat- 
isfaction or dissatisfaction with each need. An 
investigation was also made of the relative 
importance which the officers attach to the 
needs used in the study. 


MerTHOD 
Subjects 


The Ss were 504 regular Army officers who re- 
sponded to a mailed questionnaire (87.4% response 
rate). Officers attending Army service schools at 
three different levels were selected for the sample. 
The study was limited to officers who expressed their 
commitment to at least a 20-yr. Army career and to 
officers in the three “combat” branches, that is, 
Armor, Artillery, and Infantry. Officers in these 
branches experience a similar and reasonably clear- 
cut career pattern of alternating command and staff 
assignments up through the organizational hierarchy. 
Table 1 contains additional information pertaining 
to the sample. 


Questionnaire 


Using Maslow’s need hierarchy as a guide, the 
needs shown in Table 2 were selected for investi- 
gation. 

Interest in the extent of perceived deficiency in 
need fulfillment prompted the use of two similarly 
constructed questions for each need. The first ques- 
tion asked essentially, “How much opportunity do 
you have at your current rank to satisfy a particular 
need?” The second question asked, “How much op- 
portunity should you have to satisfy this same 
need?” The needs were divided into two categories 
due to the types of responses considered appropriate 
for each category. The first category consisted of 
those needs where the opportunity for fulfillment 
in the Army could be reasonably compared with the 
opportunity for fulfillment provided by other careers 
for which the officers might quality. These needs in- 
cluded financial security, association and _ lasting 
friendships with high-caliber persons, independent 


TABLE 1 
OFFICERS SAMPLED BY RANK CLASS 





Lt. Colonels/ 


Item Captains Majors Casale 

n 271 164 69 
Age 

M 27.98 35.42 44.12 

SD 2.10 2.00 ele 
Years service 

M 5.65 13.00 21.67 

SD 1.41 1.46 2.18 





Note.—WN =504. 
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TABLE 2 


SpeciFric NEEDS BY NEED CATEGORY INCORPORATED 
INTO QUESTIONNAIRE 


Specific needs investigated: 


Need category Need for 


Safety Financial security 

Social Association and lasting friendships 
with high-caliber persons 

Ego 


Self-esteem Authority 

Independent thought and action 

Prestige (from people outside the 
military) 

Recognition (credit for accomplish- 
ment) in command positions 

Recognition (credit for accomplish- 
ment) in staff positions 

Realizing full potential 

Feeling of worthwhile accomplish- 
ment 


Reputation 


Self-actualization 


thought and action, prestige, realizing full potential, 
and a feeling of worthwhile accomplishment. The 
first question for each of these needs, then, asked 
the officers to compare need fulfillment opportunities 
in the Army to other careers with responses on a 5- 
point Likert scale of less to more. The second ques- 
tion for each need was similarly constructed except 
for dropping off the lowest response (less) and add- 
ing a response to the upper end (much more). This 
was done in an effort to obtain a more normal 
distribution of responses to the second question. 
Since it was unlikely that a respondent would state 
that he should have less opportunity to satisfy a 
particular need than he is now being provided, 
identical scales for both questions would tend to 
force the answers to the second question into the 
upper end of the scale. The second category of needs 
consisted of those which had an internal orientation 
and did not lend themselves to a comparative ap- 
proach. These needs included those for authority and 
for recognition in command and staff positions. 
Questions concerning these needs were stated in 
terms of how much recognition do you and should 
you receive and how much authority do you and 
should you get to exercise, considering the amount of 
responsibility associated with the job. This dic- 
tated the use of a low-high scale with responses to 
the first question for each need of low, somewhat 
low, neither especially high nor low, somewhat high, 
and high. As with the first category of questions, the 
responses to the second question were altered by 
dropping off the lowest response (low) and adding 
a response to the upper end (very high). An open- 
ended question was used to solicit any comments the 
Ss desired to make regarding their responses to the 
two questions pertaining to each need. Finally, Ss 
were asked to rank in order of importance to them 
the specific needs under study. The questionnaire 
was pretested on 20 locally available officers. 


NEED FULFILLMENT OF ARMY OFFICERS 


RESULTS 


Perceived need-fulfillment deficiencies were 
measured by computing the difference be- 
tween responses to the two questions for each 
need. These difference scores were analyzed 
and the data are presented in two different but 
related contexts. In the first place, there is 
interest in determining the relative amount 
of dissatisfaction among the various needs 
studied. Table 3 lists the mean difference 
scores for each need by rank class. The abso- 
lute values of these scores are not as im- 
portant as the relative values from one need 
to another. Furthermore, certain trends are 
apparent when the mean difference scores 
for the same need are compared across rank 
classes. 

It can be seen from Table 3 that the cap- 
tains are most dissatisfied with the prestige 
which is accorded their rank by people out- 
side the military while the majors and colonels 
are most dissatisfied with the amount of 
authority they are permitted to exercise in 
command positions. It is interesting to note 
that the captains have three scores which are 
higher than the maximum score for the 
majors. On the other hand, the highest score 
in the group is that of the colonels’ score on 
authority. With six of the nine needs, the 
captains’ score is higher than the majors’ score 
and the majors’ score is higher than the 
colonels’ score. This suggests, of course, that 
need-fulfillment opportunities are better at 
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TABLE 3 


MEAN DIFFERENCE SCORES BY RANK CLASS AS A 
MEASURE OF THE EXTENT OF PERCEIVED 
DEFICIENCIES IN NEED FULFILLMENT 








M difference scores 


Need for 
Captains | Majors | Colonels 

Financial security LST LT 1.3780 1.3768 
Association and last- 

ing friendships with 

high-caliber persons 1085 5244 3043 
Authority 1.8044 1.6280 1.9420 
Independent thought 

and action 1.8155 1.5061 1.3188 
Prestige 1.8893 1.3561 1.3188 
Recognition (com- 

mand positions) 1.4502 1.3598 1.2029 
Recognition (staff 

positions) .8930 .6463 7391 
Realize full potential | 1.6125 1.5000 1.1884 
Feeling of worthwhile 

accomplishment 1.0959 .8049 .6667 





successively higher levels in the organiza- 
tional hierarchy. In general, the greatest de- 
ficiencies exist in the ego and self-actualiza- 
tion categories. 

Though the trends in mean difference 
scores among the rank classes have been 
noted, it was considered desirable to deter- 
mine whether there were statistically signifi- 
cant differences in the distribution of dif- 
ference scores among rank classes for each 
need. Rather than using the mean scores and 


TABLE 4 


SIGNIFICANT DIFFERENCES AMONG RANK CLASSES IN THE EXTENT OF PERCEIVED DEFICIENCIES IN 
NEED FULFILLMENT 











Kruskal-Wallis H values 





Need 
Overall Capt-Maj Maj-Col Capt-Col 
Financial security ns _ = oa 
Association and lasting friendships with 
high-caliber persons 9.5835* 4.2970* ns 8.0991* 
Authority ns = — ey 
Independent thought and action 9.3400* Ah ns 6.8548* 
Prestige 20.8152* 18.0205* ns 7.7898* 
Recognition (command positions) ns — a = 
Recognition (staff positions) ns — — = 
Realize full potential ns — — ay 
Feeling of worthwhile accomplishment 7.0665* 3.6909* ns 5.4324* 





*p<.05. 
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some parametric statistic wherein the ability 
to meet the assumptions inherent in such a 
statistic would be questionable, the Kruskal- 
Wallis H test was used. Hays (1963) believes 
that this is a good rank-order method of deter- 
mining whether a group of independent sam- 
ples comes from the same or different popula- 
tions and states that it compares very well 
with the F test from the analysis of variance. 
Hays (1963) points out that once the H has 
been computed, a correction formula should 
be applied in any case where the number of 
Ss is small or where there is a large number 
of tied observations. Since there was a sub- 
stantial number of tied observations in the 
distributions of difference scores, this correc- 
tion is applicable here. The correction serves 
to increase the size of the H, so a correction 
was applied when the matter of significance 
was in question. 

Table 4 shows the results of these computa- 
tions. An overall H was computed for each 
need and, where the H was significant, three 
additional H tests were computed to deter- 
mine the difference between any two rank 
classes. It can be seen that, for the four needs 
where the overall H tests are significant, in 
each case there are significant differences be- 
tween the captains and majors and the cap- 
tains and colonels but not between the majors 
and colonels. 

The results of the investigation of the 
relative importance which Ss attach to the 
needs used in the study are contained in the 
following hierarchy of needs going from most 
important to least important: 


1. Feeling of worthwhile accomplishment. 

2. Opportunity to realize full potential. 

3. Opportunity for independent thought 
and action. 

4, Financial security. 

5. Opportunity to associate with high- 
caliber persons. 

6. Recognition. 

7. Opportunity to exercise authority. 

8. Prestige. 


No differentiation was made between recog- 
nition in command and staff positions. Based 
on an examination of the frequency distribu- 
tion of responses for each need, the median 
response was selected as the most meaningful 
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index of central tendency for the purpose of 
determining the rank order. For simplicity, 
only the rank order based on the entire sam- 
ple is presented. When the rank classes were 
taken separately, there were slight and gener- 
ally insignificant differences in the rank order 
for majors and colonels. The greatest con- 
sensus among all Ss existed for the two most 
important and two least important needs. 


DISCUSSION 


The Army, like the industrial organiza- 
tions studied by Porter, has apparently cre- 
ated an environment in which the opportuni- 
ties for higher level need fulfillment are 
greater at successively higher levels in the 
organizational hierarchy. Such an environ- 
ment is conducive to continued growth and 
development of managerial (officer) person- 
nel, and it would appear that guidance of 
junior officers should stress the ever-increas- 
ing opportunity for higher level need satis- 
faction as they progress through their careers. 
On the other hand, need-fulfillment opportuni- 
ties at the lower managerial levels must be 
sufficient to prevent widespread dissatisfaction 
at these levels. 

Some interesting implications emerge when 
the obtained need-fulfillment deficiencies are 
considered in conjunction with the relative 
importance which Ss attached to the needs. 
Based on Maslow’s hierarchy it would be pre- 
dicted that needs from the ego and self-actu- 
alization categories would be the most im- 
portant and perceived as the least fulfilled. 
This prediction generally holds up for the 
three needs perceived as most important but 
is inconsistent with the results pertaining to 
the needs for prestige and for the opportunity 
to exercise authority. Not only are these needs 
rated in the last and next to last position by 
all ranks, but also the officers perceive rela- 
tively large need-fulfillment deficiencies for 
these needs. This raises the question of 
whether these needs are truly unimportant to 
the officers studied, relative to the other needs, 
or whether the extent of perceived need-ful- 
fillment deficiencies is sufficiently large to 
have caused the officers to downgrade the 
importance of these needs because they see no 
hope of satisfying them. 

Regardless of the reason for the need for 
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authority receiving such a low ranking, the 
findings relative to this need appear to present 
the Army with a problem of considerable 
magnitude. The Army, of necessity, must 
place great importance on the development 
of commanders. A Department of the Army 
(1964) circular states that “command and su- 
pervisory positions enjoy the highest prestige 
of all positions in the Army, and _there- 
fore are the most sought after duty assign- 
ments [p. 8].”’ The low ranking and the dis- 
satisfaction with the opportunity to exercise 
authority do not bear out this statement. The 
majors and colonels expressed their greatest 
dissatisfaction with this need. Not only is it 
a reversal of trend for the colonels to express 
more dissatisfaction with this need than the 
two lower ranks, but their score is the highest 
of all dissatisfaction scores in the study. The 
questions on authority elicited a number of 
open-ended responses which may help to ex- 
plain these results. The following quote from 
one lieutenant colonel expresses the general 
tenor of these responses: 


One factor, more than any other, promises to 
affect adversely motivation. That is, the trend 
toward highly centralized control and management 
of all military endeavor. This trend not only will 
systematically reduce the aggressive initiative of 
those already in the Army but also will reduce the 
interest of younger men of high caliber who might 
otherwise have been attracted to the challenges of 
military life. At its worst, the trend could produce a 
mediocre Army of conformists, input data statisti- 
cians, and escapists looking for immunity from the 
responsibilities of decision. 


Though the Army relies heavily on the 
proper exercise of authority, it would appear 
that the conditions under which such author- 
ity is being exercised are less than favorable. 

A related problem stems from the findings 
relative to the need for recognition or credit 
for accomplishment in command and staff 
positions. The data show considerably more 
dissatisfaction with the amount of recogni- 
tion received in command positions than in 
staff positions which suggests that the amount 
of recognition being given is not commensur- 
ate with the responsibility associated with 
each type position. This state of affairs can 
only serve to detract from the Army’s empha- 
sis on the attractiveness of command posi- 
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tions. Considering these results in conjunction 
with the results concerning the need to ex- 
ercise authority, it would not be unlikely that 
officers would lean toward staff positions 
where the risk of failure is much less than 
that for command positions. 

The respondents’ reactions to the questions 
related to the feeling of worthwhile accom- 
plishment deserve some comment. The officers 
approached unanimity in the selection of this 
need as the most important, and the perceived 
deficiency in fulfillment of this need was not 
as great as might have been predicted. The 
desire to explain their reasons for selecting 
this need as most important prompted more 
open-ended responses than any other item. 
In general, the officers consider that the Army 
has a mission of great significance, and the 
opportunity to serve one’s country in such a 
capacity is the most important reward of an 
Army career. They also suggest that the mag- 
nitude of this reward is sufficient to overcome 
some of the disadvantages inherent in a 
service career. 
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The causal basis for the relationship between expectancy attitudes and job 
performance was tested in this study by the use of 2 correlational approaches to 
determining causality. Data were gathered from 55 managers on the probability 
that certain rewards could be obtained as a result of their job performance. In 
addition, self, superior, and peer ratings of job performance were obtained for 
each manager. Similar attitude and performance data were collected at 2 points 
in time separated by about 1 yr. A cross-lagged correlational analysis of the 
data generally supported the view that the expectancy attitudes could be best 
thought of as causing the job performance. A dynamic correlational analysis of 
the data failed to show strong positive results. The implications of these findings 
for the time lags that are involved in expectancy attitudes causing job perform- 


ance were considered. 


A number of recent studies have demon- 
strated that a significant relationship exists 
between employees’ beliefs about the prob- 
ability that rewards depend upon their job 
behavior and their job performance. Lawler 
and Porter (1967a), for example, have found 
that the more managers believe rewards like 
pay, promotion, and respect from others stem 
from good performance the more likely they 
are to be rated as good performers by their 
peers, superiors, and selves. Generally, similar 
findings have been reported in four other 
studies of expectancy, reward-probability, or 
path-goal attitudes as this kind of belief has 
been referred to by different researchers 
(Galbraith & Cummings, 1967; Georgopoulos, 
Mahoney, & Jones, 1957; Porter & Lawler, 
1968; Spitzer, 1964). These studies have 
also tended to show that when expectancy 
attitudes are weighted by the value or im- 
portance of the rewards considered, the cor- 
relations between the attitudes and job per- 
formance are increased. Lawler and Porter 
(1967a), for example, report a correlation of 
.25 between unweighted expectancy-attitude 
items and a measure of performance, but a 
correlation of .38 between performance and 
the same expectancy-attitude items when they 
had been weighted by the value of the rewards 
to the individual managers. 

1The work on this project was done while the 


author was a Ford Foundation Faculty Research 
Fellow. 
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The finding that expectancy attitudes are 
significantly related to performance is, of 
course, predictable from the kind of expect- 
ancy theories of motivation that have recently 
been stated by Vroom (1964), Atkinson 
(1964), and others. Earlier statements of 
expectancy theory by Peak (1955) and Rotter 
(1955) also are congruent with these findings. 
All these statements of expectancy theory 
predict that a positive relationship between 
expectancy attitudes and performance should 
exist because expectancy attitudes are re- 
flections of the individual’s motivation to 
perform. Thus, the expectancy attitudes are 
essentially seen to be causing or leading to 
the performance. Hence, the relationship is 
expected not because the performance causes 
the expectancy attitudes or because a third 
variable causes both but because the expect- 
ancy attitudes cause the performance. 

All of the previous studies of the relation- 
ship between expectancy attitudes and per- 
formance have used a static correlational 
approach and thus it is impossible to deter- 
mine the causal basis for the relationships 
found. It typically has been assumed by the 
researchers that the relationship found came 
about because the attitudes caused the per- 
formance but this assumption has not been 
tested. As Porter and Lawler (1968) have 
pointed out, an explanation to the effect that 
the attitudes caused the performance allows 
for the most parsimonious explanation of all 


ATTITUDES AND JOB PERFORMANCE 


the data collected and it fits with motivation 
theory. However, before it is safe to assume 
that the causal direction runs from the at- 
titudes to the performance, direct tests of the 
causal direction are needed. The present study 
represents an attempt to fill this gap in the 
literature by specifically determining the 
causal basis for the commonly found relation- 
ship between expectancy attitudes and job 
performance. 

The usual way of determining whether two 
variables are causally related is, of course, 
to do an experiment in which one variable can 
be manipulated in order to observe its effect 
on another. There are, however, certain re- 
search problems that do not lend themselves 
to the experimental model regardless of 
whether a field or laboratory experiment is 
done. Experimental laboratory studies often 
do not seem to offer an adequate enough 
simulation of the real world for results to be 
used with confidence when more applied prob- 
lems are being dealt with, while field experi- 
ments present a number of practical and 
ethical problems. These limitations of the 
experimental approach are certainly present 
when consideration is given to testing for the 
causal relationship between expectancy at- 
titudes and job performance. There are, how- 
ever, two alternatives to the traditional experi- 
mental approach that would seem to offer 
ways of looking at the causal basis for the 
relationship considered in this paper. Both of 
these approaches can be classified as quasi- 
experimental or correlational-causal in nature 
since they allow causal inferences to be made 
with some degree of confidence without the 
usual experimenter (#) manipulation of vari- 
ables or E control over the situation being 
present, 

The first of these approaches is the cross- 
lagged panel correlational method. This ap- 
proach was initially discussed by Simon 
(1954) and has been more recently considered 
by Blalock (1962), Pelz and Andrews (1964), 
and Campbell and Stanley (1963). It in- 
volves gathering similar data on the two 
variables that are to be tested for causality at 
two different points in time. In the case of 
the present problem, this would mean gather- 
ing identical expectancy-attitude data and 
identical job-performance data at two sepa- 
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rate points in time. Once these data have been 
collected, the kind of cross-lagged correla- 
tional analysis shown in Figure 1 can be set 
up to analyze the relationship between ex- 
pectancy attitudes and job performance. 

This analysis yields six correlation coeffi- 
cients, two of which (3 and 4) tell something 
about the stability of the measures and are 
not directly involved in tests for causality, 
while the others can give some indication of 
the causal direction of the relationship found 
between the attitude and performance mea- 
sures. If the remaining correlations shown in 
Figure 1 stand in the following relationship to 
each other: 5 >(1 = 2)>6, then there is good 
reason to believe that the expectancy attitudes 
caused the performance. However, if the cor- 
relation coefficients stand in this relationship: 
6 >(1 = 2)>5, then there is reason to believe 
that the performance caused the expectancy 
attitudes. The one weakness in this approach 
is that it cannot rule out the possibility that a 
third variable causes the two other variables 
to covary. 

The logic underlying this kind of analysis 
rests upon the time lag that typically exists 
when one variable causes another. The argu- 
ment is that if A causes B then the present 
state of A should be more strongly related 
to B’s future state than to B’s past or present 
state. Thus, where A—> B, then rap where B 
is measured after A should be greater than 
Yan where B is measured either before or at 
the same time as A. Thus, by comparing the 
relative sizes of 7ap,, “ano, aNd fan, Where B 
is measured before, after, and at the same 
time as A, it is possible to determine whether 
the hypothesis A—B or B-—A is more 
tenable. Translating this to the present prob- 
lem it means that, if the expectancy attitudes 


t, te 


Expectancy<———_(3) > Expectancy 





Job Per dfmance<———(4)>Job Performance 


Fic. 1. Cross-lagged correlational model. 
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cause the performance, the strongest relation- 
ship should be between the attitude measure 
at Time 1 (t,) and the performance measure 
at Time 2 (te). 

Vroom (1966) has recently suggested a 
second method that can be used in order to 
determine the causal basis for the relationship 
between two variables. This method, like the 
cross-lagged approach, requires identical mea- 
surements at two points in time. It has the 
advantage of being strong where the cross- 
lagged approach is weak, that is, it can help 
rule out the possibility that a third variable 
is responsible for the relationship found. How- 
ever, unlike the cross-lagged approach it can- 
not tell whether A—B or B— A. All it can 
tell is something about whether a third vari- 
able has caused both. Thus these two ap- 
proaches are complementary in nature and 
should be used together. 

Dynamic correlations are computed by cor- 
relating the difference between A at t; and te 
with the difference between B at t, and to. 
Thus, in effect, changes in one variable are 
being correlated with changes in another 
variable. To produce a spurious dynamic cor- 
relation between two different scores AA and 
AB, a third variable must change in different 
amounts or directions in the members of the 
sample, and these changes must be highly cor- 
related with both AA and AB. Obviously it 
is extremely unlikely that any variable will 
meet these conditions and, because of this, 
most potential third causal variables can be 
ruled out. Thus when a significant dynamic 
correlation is found between two variables, 
one can put more faith in the fact that one 
caused the other than one can when a sig- 
nificant static correlation appears between two 
variables. This is particularly true in situa- 
tions where the researcher can determine that 
many other variables are not changing at all 
or are not changing in a way that would lead 
to their accounting for the significant dynamic 
correlations. 

There is one additional issue that must be 
considered when causal-correlational methods 
such as those described here are used to test 
for causality in situations where the causal 
variable is likely to be continuously changing 
over time. This issue has to do with how much 
time actually elapses between changes in one 
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variable and changes in the other. This issue 
also serves to highlight a difference between 
the two methods. Dynamic correlations would 
seem to be most likely to show positive results 
when the change in one variable leads almost 
immediately to a change in the other variable, 
that is, where the two variables can, for 
practical purposes, be thought of as moving 
almost together, although causality still exists. 
Since this method compares changes in A 
with changes in B between two points in time, 
the changes in A must be quickly reflected in 
B if the changes in the two from t, to te 
are going to be highly correlated. If a long 
time lag exists between changes in A and 
their effects on B then the changes in A be- 
tween t; and te will not be reflected in the 
changes in B between t; and te regardless of 
how far t; and ts are separated. They would 
be more likely to appear in the difference be- 
tween B at te and tz. 

The cross-lagged approach would seem to 
be most likely to show positive results when 
there is some delay between changes in the 
first variable and changes in the second vari- 
able, and when the separation between data- 
collection points approximates the delay be- 
tween changes in one and changes in another. 
If the impact of the change is very rapid, 
then the cross-lagged approach will not show 
positive results since it depends on Variable 
A measured at t; influencing Variable B mea- 
sured at te more strongly than it influences 
Variable B measured at t;. If the time lag 
for the effect of A on B is very short the 
effect probably will already have taken place 
at t;. And, thus, A will be as highly cor- 
related with B measured at t, as with B 
measured at te. Considering the specific prob- 
lem in the present study, it would seem that 
there may be considerable delay between 
changes in expectancy attitudes and changes 
in performance. Although an individual might 
start working hard immediately after an at- 
titude change it undoubtedly would take a 
period of time for this actually to show up in 
the relatively crude performance measures 
that are presently available. On the other 
hand, if t; and te are not separated by a 
sufficient period for A at t; to affect B at to 
the cross-lagged analysis will not show posi- 
tive results. In order to allow the cross-lagged 
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approach sufficient time to show positive re- 
sults, it was decided to separate the two data- 
collection sessions by 1 yr. 


METHOD 
Sample 


Data were collected from 55 managers in social 
service agencies. Included were the directors and all 
the managers of a YMCA, a visiting nurse service, a 
Girl Scout council, and a city department of 
health. Of the 55 managers studied, 41 could be con- 
sidered first-level supervisors while the remaining 
managers were at higher levels of supervision. A 
comparison between the demographic characteristics 
of the sample and the characteristics of other sam- 
ples of middle- and lower-level managers (e.g., 
Porter, 1961, 1962) revealed good comparability with 
one exception. Thirty-one of the managers in the 
present sample were females while none of the pre- 
vious samples contained any female managers. Other 
studies (Lawler, 1967; Lawler & Porter, 1967a, 
1967b) have included data gathered from part of 
this sample. 


Attitude Measures 


The attitude data were gathered by means of a 
short questionnaire. One part of the questionnaire 
was designed to measure the managers’ reward-ex- 
pectancy attitudes. It listed six rewards each of 
which was assumed to be related to the satisfaction 
of one or more needs: pay, promotion, prestige, se- 
curity, autonomy, and opportunities to use skills 
and abilities. For each of these six rewards the 
managers were asked to indicate on a 7-point scale 
how helpful six factors were for obtaining the 
rewards in their organizations (see Lawler & Porter, 
1967a, 1967b, for a description of this questionnaire). 
Two of the factors that were listed were particularly 
concerned with the job performance of the managers 
(working hard and quality of job performance). 
Previous studies (Lawler & Porter, 1967a) have 
shown that these two items tend to correlate highly 
for managers when the managers are asked to indi- 
cate how helpful the items are in obtaining im- 
portant rewards. 

A second part of the questionnaire contained a 
section that asked each manager to indicate how im- 
portant the six reward items were to him. For these 
ratings a 7-point scale (1 unimportant to 7 very 
important) was used. In order to develop ex- 
pectancy-attitude measures for each manager, the 
data from the importance measures were combined 
with the data concerned with the likelihood that 
good performance and working hard would lead to 
the rewards. As suggested by the expectancy theories 
of motivation, this was done by multiplying the im- 
portance of the reward times the degree to which 
it was seen to be obtainable from both good per- 
formance and working hard. The results of these two 
multiplications were then summed for each of the six 
rewards so that for each manager a composite ex- 
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pectancy-attitude measure was developed of the de- 
gree to which he felt receiving each of the six 
rewards was based upon his job performance. 


Job-Performance Measures 


Each manager was ranked relative to other man- 
agers holding similar jobs by his superior and by 
his peers. Managers were ranked on four factors, 
one of which, the quality of his job performance, is 
relevant for the present study. The rankings were 
converted to standard scores with a mean of 50 and 
a standard deviation of 10. Where a manager was 
ranked by more than one peer, an average peers’ 
ranking was computed. In addition to being ranked 
by his superior and peers, each manager was asked 
to rate himself on a number of factors, one of which 
was the quality of his job performance. These rat- 
ings were made on a 7-point scale ranging from 1 
(low in comparison to similar others) to 7 (high in 
comparison to similar others). 

Analysis of the rankings showed fairly good agree- 
ment between the superior and peer rankings (eg., 
r= .56 between peer and superior rankings on job 
performance at t1 and r=.54 for te). It was en- 
couraging to note that superiors’ and peers’ rankings 
correlated most highly when they were ranking the 
managers on identical factors. The correlations be- 
tween superior and peer rankings on the same 
factor also generally tended to be higher than the 
relationships among the superiors’ and peers’ rank- 
ings of the same people on different factors. Thus, 
the superiors’ and the peers’ rankings substantially 
met the requirements of convergent and discriminant 
validity (Campbell & Fiske, 1959). The situation 
with respect to the self ratings was quite different, 
however. The self ratings of job performance showed 
only a low relationship to the peers’ rankings (r= 
.26 at ti) and to the superiors’ rankings (r= .09 at 
ti). Thus, the self ratings did not show convergent 
and discriminant validity. 


Procedure 


Two meetings, 1 yr. apart, were held with all 
the participants in the study. At both meetings they 
were asked to fill out similar attitude questionnaires 
and ranking forms. It was carefully explained to all 
participants that they were participating in a univer- 
sity-sponsored research project and that all the data 
obtained from them would be held in the strictest 
confidence. The researcher remained in contact with 
the organizations during the year in which the 
study took place in order to be aware of any 
major changes in the organizations. None of the 
organizations, however, changed in size, purpose, 
structure, or in administrative procedures. There 
were some changes in personnel during the period. 
A total of five managers left the organizations. This, 
of course, meant some changes in who ranked certain 
people when the te data were collected. Some man- 
agers were ranked by different superiors and peers 
at ti and at te. 
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RESULTS 


In presenting the data collected, first the 
results concerned with the superior rankings 
will be considered, next those concerned with 
the peer rankings, and finally those concerned 
with the self ratings. Figure 2 therefore pre- 
sents the data for the cross-lagged correla- 
tional analysis for the relationships among 
the expectancy-attitude measures and the 
superiors’ rankings of job performance. Be- 
cause expectancy theory calls for a linear 
combination of each manager’s six reward- 
expectancy measures, in order to predict his 
job performance, multiple correlation coeffi- 
cients were used for Relationships 1, 2, 5, 
and 6. These four relationships represent all 
the instances where attitude measures were 
correlated with performance measures. Thus, 
all tests of causality were made by comparing 
the relative sizes of multiple correlation coeffi- 
cients. The correlation between the two per- 
formance measures is a Pearson product- 
moment while the correlation between the 
expectancy attitude measures is the average 
Pearson r for the six rewards considered. 

The correlations presented in Figure 2 be- 
tween the attitudes and the superiors’ per- 
formance measures bear the kind of relation- 
ship to each other that would be expected 
if the attitudes caused the performance. The 
lowest correlation is between the performance 
at t; and the attitudes at te, while the highest 
correlation is between the attitudes at t; and 
the performance at ty. In addition the correla- 
tions between the attitudes and performance 
data when the coefficients are computed from 
data collected at the same point in time fall 
between these two correlations. In order to 
compute the dynamic correlations between 
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Fic. 2. Cross-lagged correlational analysis for mea- 
sures of expectancy attitudes and superiors’ rankings 
of job performance (* p< .05; ** p< .01). 
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the changes in the attitude variables and the 
changes in the performance measure, a multi- 
ple correlation coefficient was used since 
changes in six attitude variables were being 
related to the changes in the performance 
measure. For the superiors’ rankings, this 
coefficient did show positive Beta weights for 
the predictors as would be expected if in- 
creases in the attitude measure were associated 
with increases in the performance measure 
but the coefficient did not reach statistical 
significance (R = .32). 

Figure 3 presents the cross-lagged correla- 
tional data for the analysis of the peers’ rank- 
ings of job performance. These data do not 
provide a close fit for the model that indicates 
the attitudes caused the performance. The 
relationship of Correlation 5 to Correlation 6 
suggests that the attitudes caused perform- 
ance, but Correlations 1 and 2 do not fall in 
between these two as they would be expected 
to if this analysis were to provide strong 
support for the view that the attitudes caused 
the performance. The multiple correlation 
computed between the changes in the attitude 
measures and the changes in the performance 
measures was not significant although the Beta 
weights were positive (R = .41). 

Figure 4 presents the cross-lagged correla- 
tional data for the analysis of the self ratings 
of job performance. As was true for the peer 
ratings, this analysis is suggestive of the fact 
that the attitudes caused the performance but 
the fact that Correlations 1 and 2 do not fall 
between 5 and 6 prevents the data from per- 
fectly fitting the model of the attitudes caus- 
ing the performance. Nevertheless, the fact 
that the attitudes were better predictors of 
the performance in the future than the per- 
formance was of the attitudes in the future is 
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Fic. 3. Cross-lagged correlational analysis for mea- 
sures of expectancy attitudes and peers’ rankings of 
job performance (* p< 05; ** p< 01). 
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suggestive of the fact that the performance 
caused the attitudes. The dynamic correlation 
between the changes in the expectancy at- 
titudes and the changes in the performance 
measure was significant (R= .54, p< .01), 
and as was expected the Beta weights were 
positive. 


DISCUSSION 


The results of the study are generally sup- 
portive of the view that expectancy attitudes 
can best be thought of as causing performance. 
The cross-lagged evidence would seem to rule 
out the possibility that the frequently found 
relationship between expectancy attitudes and 
performance comes about because the perform- 
ance causes the expectancy attitudes. The fact 
that in all three cross-lagged analyses the 
attitudes at t; predicted the performance at 
te better than the performance at t; predicted 
the attitudes at t. would seem to provide 
good support for this point. Also supportive 
of this view is the fact that the attitude to 
performance correlations were generally higher 
for the predictions of the attitudes into the 
future than they were for the static correla- 
tions between the attitudes and the perform- 
ance at the same point in time. These latter 
results also suggest that one way previous 
studies might have been able to obtain a 
stronger expectancy attitude to performance 
relationship would have been to delay the 
collection of the performance data until some 
time after the attitude data had been collected. 

The data from the dynamic correlations is 
only suggestive of the fact that a third vari- 
able is not responsible for the relationship be- 
tween the attitudes and the performance. Only 
one of the three dynamic correlations was 
significant, yet all three generally took the 
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form that would be expected if changes in the 
attitudes led to changes in the performance 
without a third variable being involved. The 
data from the cross-lagged analysis suggest 
that a third variable does not simultaneously 
cause both the attitudes and the performance, 
but it does not rule out the possibility that a 
third variable first causes the attitudes and 
then the performance, or that the attitudes 
cause a third variable which in turn causes 
the performance. Overall, therefore, the data 
would seem clearly to rule out the possibility 
that the positive relationship found between 
performance and expectancy attitudes comes 
about because the performance causes the 
attitudes and is suggestive of the fact that it 
comes about because the attitudes caused the 
performance. 

Part of the reason that the dynamic cor- 
relations were not higher may well be found 
in the time relationships that may occur when 
expectancy attitudes cause performance. It is 
reasonable to expect that there might be a 
fairly long time lag between changes in ex- 
pectancy attitudes and changes in perform- 
ance measures. This would seem to be par- 
ticularly true where superior and peer rank- 
ings of performance are used. Even where at- 
titude changes almost directly lead to perform- 
ance changes, it might be some time before 
the performance changes could actually be 
expected to show up in the rankings of the 
performance. This would be expected because 
of the time needed to convince the raters 
that a real change has actually taken place 
and because of the time needed for the raters 
actually to become aware of the changes 
Some support for this thinking is provided 
by the fact that the dynamic correlation was 
significant for the self ratings but not for the 
peer and superior rankings, since presumably 
the self ratings would change more quickly. 
If a significant amount of time does in fact 
actually elapse between changes in expectancy 
attitudes and consequent changes in per- 
formance, then dynamic correlations may 
never show strong results where these changes 
are considered. In summary, it may be that 
the failure to find high dynamic correlations 
is not so much due to the fact that a third 
causal variable is involved, but the fact that 
there is a significant time lag built into the 
causal relationship. 
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It is impossible at this time to say exactly 
what the time sequence actually should be 
between the two measurements of the at- 
titudes and performance variables if the causal 
relationship is to be tested most effectively. 
The l-yr. separation chosen in the present 
study was an arbitrary time, hence future re- 
search should consider collecting data with 
differing intervals between the two collec- 
tion points. A logical next step would appear 
to be a study where some data are collected 
with a 6-mo. separation, some are collected 
with a 1-yr. separation, and some are collected 
with a 2-yr. separation. By comparing the 
results obtained for the varying periods of 
separation it should be possible better to 
understand the time sequence that is involved 
in any causal relationship between expectancy 
attitudes and performance and to test for the 
causal relationship better. The correlations 
should be highest at the point in time which 
most closely approximates the actual time 
it takes for A to affect B. 

Overall, the results obtained from the two 
correlational-causal approaches that were used 
are sufficiently interesting to suggest that 
this approach can be useful in the study of 
many other topics where causal priorities are 
in dispute. For example, the relationship be- 
tween satisfaction and performance could be 
studied with this approach as could the rela- 
tionship between participant leadership styles 
and employee involvement. This, of course, 
does not mean that this type of analysis can 
ever replace traditional experiments as the 
best way to establish causalities since experi- 
ments do allow causal questions to be an- 
swered with a higher degree of confidence. 
However, the causal-correlational approaches 
do have some advantages that argue they 
deserve more widespread use than they have 
received. They have some of the advantages 
of the field experiment without many of the 
disadvantages. That is, data from which causal 
inferences can be made are obtained for real 
situations, as in the field experiment, but 
many of the ethical, cost, and feasibility 
problems that go with field experiments are 
not likely to occur. 
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This study investigated the relationship between motivator and hygiene satis- 
faction variables to job involvement. Ss were 96 civil service supervisors who 
completed the Wernimont job-satisfaction scale and a job-involvement measure 
developed by Lodahl and Kejner, The results of the study were that motivator, 
but not hygiene, satisfaction variables correlated with job involvement. In 
addition, total motivator satisfaction scores accounted for considerably more 
variance in overall job satisfaction than did hygiene variables. 


Although job involvement has been related 
to turnover (Bass, 1965), the determinants of 
job involvement have not been investigated. 
It is hypothesized that job satisfaction is one 
such determinant. The emphasis placed on 
job-satisfaction variables has resulted in a 
relative neglect of the job-involvement vari- 
able. This is perhaps due to a lack of con- 
ceptual differentiation between job satisfac- 
tion and job involvement, and to an apparent 
failure to realize that it is possible for some 
persons to be highly satisfied, but not in- 
volved, and for others to be highly involved, 
but not satisfied. Moreover, some sources of 
job satisfaction are probably more likely to 
be related to job involvement than are others. 
It was the purpose of this study to investigate 
the differential relationships between various 
sources of job satisfaction and job involve- 
ment. 


Job Involvement 


The definition of job involvement is com- 
plex. Lodahl and Kejner (1965) consider it 
to be indicative of an individual’s work com- 
mitment. Bass (1965) views it as representa- 
tive of the employee’s ego involvement in his 
job and thus relates it to performance. Bass 
(1965) finds that the following conditions 
lead to the strengthening of job involvement: 
opportunity to make more of the job de- 
cisions, the feeling that one is making an 
important contribution to company success, 

1 This research was supported in part by a United 
States Public Health Service Fellowship No. 1-F1- 


MH-33, 376-01 (BEH-A) from the National Insti- 
tute of Mental Health awarded to the senior author. 
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recognition, achievement, self-determination, 
and freedom to set one’s own work pace. 
March and Simon (1958) suggest that job in- 
volvement may also be related to the decision 
to participate and the decision to produce. 
Among other things, both of these decisions 
are affected by the individual’s perception of 
a variety of alternatives. The person who is 
more job involved may perceive fewer avail- 
able alternatives; he may, therefore, be in- 
clined to participate more actively in the re- 
quired activities of the organization. Katz 
and Kahn (1966) state: First, that job in- 
volvement is a necessary condition if the in- 
dividual is to accept fully the organizational 
demands placed upon him by his member- 
ship in an organization; second, that the de- 
gree of job involvement is related to level of 
aspiration and to the degree of internalization 
of organizational goals; and third, that job in- 
volvement is a moderator variable in the rela- 
tionship between satisfaction and performance, 
for only among involved employees does a 
positive relationship between satisfaction and 
performance become evident. Thus job in- 
volvement can be considered as an important 
measure of organizational effectiveness that 
may be, at least in part, influenced by job 
satisfaction. 


Job Satisfaction and the Two-Factor Theory 


Herzberg, Mausner, and Snyderman (1959) 
formulated the “two-factor theory of job 
satisfaction.” Their research concluded that 
there is one set of conditions the presence 
of which induces increased job satisfaction, 
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but the absence of which does not induce job 
dissatisfaction; and that there is another set 
of conditions, the absence of which leads to 
job dissatisfaction but the presence of which 
does not lead to increased job satisfaction. 
They termed the first set of conditions ‘‘mo- 
tivators” since satisfaction with these con- 
ditions was “conceptualized as actuating ap- 
proach rather than avoidance behavior [Herz- 
berg et al., 1959, p. 114, italics added].” The 
second set were termed “hygienes” since these 
conditions appeared to contribute only to the 
prevention of an “unhealthy psychological 
work environment [Herzberg et al., 1959, 
p. 113].” The motivators are: recognition, 
achievement, advancement, responsibility, and 
the work itself. These variables have also 
been termed intrinsic or work-content vari- 
ables, because they are all presumably derived 
from performance of the job itself. The hy- 
giene conditions are: interpersonal relations 
with peers and with superiors, company policy 
and administration, superior’s technical com- 
petence, working conditions, and job security. 
Hygiene conditions have also been referred to 
as extrinsic or work-context conditions be- 
cause they are all derived from the environ- 
ment surrounding the job. Salary, depending 
upon whether or not it is contingent upon 
performance, may be either a motivator or a 
hygiene factor. 


Hypothesis 


In this study job involvement was con- 
sidered to be a quasi-indicator of motivation. 
Therefore, it appeared reasonable to test the 
Herzberg et al. (1959) generalization using 
job involvement as the dependent variable. 
The specific hypothesis which was formulated 
for this research is as follows: Satisfaction 
with motivator sources will be related to in- 
creased job involvement, but satisfaction with 
hygiene sources will be unrelated to job in- 
volvement. 


METHOD 
Organization and Subjects 


The Ss were supervisors of a state (civil service) 
department normally employing about 5,000 persons. 
They ranged in rank from first-line supervisors 
(G 17) to the highest nonappointive position in the 
department, the executive director (G 38). 

One hundred seven persons initially took part in 
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the study. However, all persons did not complete 
every one of the instruments, and some, although 
completing each instrument, left some items blank. 
Female employees were removed from the sample. 
The final sample consisted of 96 male supervisors 
who were between 40 and 62 yr. of age. Their 
salaries ranged from $7,700 to $17,300. They super- 
vised from 0 to 18 persons (some were supervisors 
in name only). The Ss had a range of 8-35 yr. of 
service. The educational level of the supervisors was 
fairly high, and ranged from 12 to 18 yr. of formal 
education. Most of the supervisors had a background 
in accounting or finance. 


Administration Procedures and Instruments 


A questionnaire was distributed and each partici- 
pant was allowed about 1 wk. to complete it. The 
questionnaires were collected at group sessions dur- 
ing which research instruments for another study 
were being administered. All precautions to preserve 
anonymity were carefully observed. 

Wernimont Job Satisfaction Scale.2 Wernimont’s 
(1966) scale, which was devised to measure present 
satisfaction according to a model constructed by 
Herzberg et al. (1959), was used in this study. For 
example, satisfaction with security is measured by 
the following set of statements: 


— I feel happy with the amount of security I 
have on my job. (5) 

—— I feel somewhat happy with the amount of 
security IJ have on my job. (4) 

—— I feel that the amount of security on my job 
is okay. (3) 

— I am not too happy with the amount cf 
security I have on my job. (2) 

— I am not happy with the amount of security 
I have on my job. (1) 


(The numbers in the parentheses following each 
item indicate the scale value of that item.) 

The respondent is instructed to check two state- 
ments from each variable set which best describe his 
present satisfaction. The satisfaction score for each 
source is the sum of the two items selected. The 
higher the score, the greater the degree of satis- 
faction. 

Similar sets of statements are used to measure 
motivator satisfaction (recognition, achievement, 
work itself, advancement, and responsibility) and 
hygiene satisfaction (department policy, technical 
competence-superior, interpersonal relations—superior, 
working conditions, interpersonal relations—peers, 
and, here, satisfaction with pay). 

One additional item asks the respondent to indi- 
cate his present overall job satisfaction on a 9-point 
scale (9 being most satisfied). 

Job Involvement (JI) Scale. The scale used to 
measure job involvement was initially developed by 
Lodahl and Kejner (1965). Originally, a 40-item 
scale was developed; this was then reduced to 20 


2This scale is copyrighted by P. F. Wernimont 
and was used with his kind permission. 
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items “by considering the item-total correlations, 
the communality of an item, and the factorial clarity 
of the item [Lodahl & Kejner, 1965, p. 28].” Lodahl 
and Kejner established the split-half reliability and 
the concurrent validity of this 20-item scale by ad- 
ministering it to three occupational groups: nurses, 
engineers, and students. Construct validity was estab- 
lished by relating job-involvement score to Ghiselli 
Self-Description Index, Smith’s Job Description In- 
dex, and the Ohio State Leader Behavior Description 
Questionnaire scores within the various groups of 
Ss employed (Lodahl & Kejner, 1965, p. 31). 

In order to develop a shorter JI scale for use when 
time and space are limited, Lodahl and Kejner se- 
lected six items from the 20-item scale on the basis 
of a principal components analysis of the data from 
the engineering and nursing samples. The split-half 
correlation of this scale was reported to be .57 and 
its reliability was estimated to be .73. The correla- 
tion between this scale and the 20-item scale was re- 
ported to be .87. The shorter version of the JI 
scale was used in this study. 

Of the six items in the short version of the JI, the 
first five are scored in a positive direction, from 1 
(strongly agree) to 4 (strongly disagree). The last 
item is scored in reverse. A higher total score indi- 
cates lower involvement. Although this scoring ap- 
pears to indicate job alienation rather than job in- 
volvement, it resulted from the method used in the 
construction of the original JI scale (Lodahl & 
Kejner, 1965, p. 27). At that time, items were put 
into a Likert-type format and assigned scores of 1 
to indicate “strong agreement” and 4 to indicate 
“strong disagreement” with scores of 2 and 3 indi- 
cating intermediate positions. As the item selection 
process which led to the construction of the shorter 
6-item scale was continued, the original scoring direc- 
tion was maintained. The items comprising the JI 
scale are listed below: 


1. The major satisfactions in my life come from 
my work. 

. The most important things that happen to me 
involve my work. 


nD 


3. I’m really a perfectionist about my work. 
4. I live, eat, and breathe my job. 
5. I am very much involved personally in my 
work. 
6. Most things in life are more important than 
work, 
Method of Analysis 


Responses to the items on the two instruments 
were scored, and correlations were computed between 
scores for each of the sources of job satisfaction and 
job involvement. Totals were also computed for the 
motivator and hygiene sources, respectively, by add- 
ing scores for the sources in each category. Correla- 
tions between these two totals and job-involvement 
score were also calculated. Finally, the correlations 
between the satisfaction totals and overall satisfac- 
tion, and between overall satisfaction and job in- 
volvement were also computed. 
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Table 1 shows the correlations of the five 
motivator variables and the total motivator 
score with the JI scale score. Table 2 shows 
the correlations of the seven hygiene variables 
and the hygiene totals with the JI scale score. 

Table 1 shows that job involvement is re- 
lated to satisfaction with recognition, achieve- 
ment, and responsibility, although in each 
instance the percentage of total variance in 
job involvement accounted for is relatively 
small. The relationships between job  in- 
volvement and satisfaction with work itself, 
and between job involvement and advance- 
ment, are not quite significant at the .05 level. 
These results confirm the hypothesis: Job in- 
volvement increases with increasing levels of 
satisfaction with motivator variables. 

Table 2 shows that, as predicted, job in- 
volvement is unrelated to all but one of the 
hygiene variables: satisfaction with inter- 
personal relations with the superior. This 
finding lends further support to our hy- 
pothesis since it indicates that, in general, 
satisfaction with hygiene variables seems to 
be unrelated to job involvement. 

Tables 1 and 2 also show that, as predicted, 
the motivator total satisfaction score cor- 
relates significantly (—.30) with job involve- 
ment. However, as expected, the correlation 
between the hygiene total satisfaction score 
and job involvement is not significant. Al- 
though the difference between these two cor- 
relations, and between the percentage of vari- 
ance explained in each case, is not great it 


TABLE 1 


CORRELATIONS OF Motivator VARIABLES AND 
ToraL with JoB INVOLVEMENT 





Correlation with job 


SHE AaIae 
Satisfaction variable Pee eane 





Recognition (a) —.25* 
Achievement (b) —.21* 
Work Itself (c) —.19 
Advancement (d) —.19 
Responsibility (e) —.27** 
Motivator total 
(Z scores a+b+...-+e) — .30** 


Note.—N = 92; 4 Ss did not respond to the JI scale. 
*p <.05, df = 90. 
** D> < 01, df = 90. 











appear to be related to satisfaction with 
motivator variables. It therefore appears that 
the distinction between motivator and hy- 
giene variables introduced by Herzberg et al. 
(1959) can be useful in predicting the job 
involvement of civil service supervisors. At 
the same time, two motivator variables, satis- 
faction with work itself and satisfaction with 
advancement, were not significantly related to 
job involvement. Advancement in the civil 
service, however, is based largely upon sen- 
iority and performance on competitive ex- 
aminations. It therefore seems unlikely that 
this variable would necessarily function as a 


In addition to the findings which supported 
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TABLE 2 
CORRELATIONS OF HYGIENE VARIABLES AND 
ToTaL witn JoB INVOLVEMENT 
Satisfaction variable Correlation with job 
involvement 
Security (f) 03 
Salary (g) —.13 
Policy (h) —.10 
Technical Competence 
Superior (i) —.15 
Interpersonal Relations— 
Superior (j) = eur 
Working Conditions (k) —.16 
Interpersonal Relations— 
Peers (1 04 : 
sg motivator. 
Hygiene total 
(Z scores f+g+...-+) —.18 








Note.—N = 92; 4 Ss did not respond to the JI scale. 
*p <.05, df = 90. 


does lend further support to the hypothesis 
of this study. 

Table 3 presents the correlation of the 
motivator and hygiene totals with overall 
satisfaction and also indicates the percentage 
of total satisfaction variance accounted for. 

Table 3 shows that the total motivator 
satisfaction and the total hygiene satisfac- 
tion scores correlate significantly with overall 
satisfaction scores. There is a relatively large 
difference in the percentage of total variance 
in satisfaction accounted for by each of these 
variables. 


DISCUSSION AND CONCLUSION 
Although the magnitude of the relationships 
was not large, increased job involvement did 


TABLE 3 


CORRELATIONS OF MotTiIvATOR AND HYGIENE 
SATISFACTION TOTALS WITH OVERALL 
SATISFACTION 








Correlation with 


Total overall % of variance 
satisfaction 

Motivator 
(2 scores on motiva- 

tors) 10" 49 
Hygiene 
(2 scores on 

hygienes) .60* 36 





Note.—N = 96. 
*% <.01, df = 90. 


the initial hypothesis, the results (see Table 
3) concerning the relationship between overall 
job satisfaction and the total motivator and 
the total hygiene satisfactions seem to shed 
further light on the controversy surrounding 
the Herzberg et al. (1959) model of job satis- 
faction. The extent of this controversy is il- 
lustrated by two recent reviews of research 
relevant to the two-factor theory. In one such 
review, House and Wigdor (1967) reached 
the conclusion that the two-factor theory 
basically has no merit. In another review, 
Whitsett and Winslow (1967) came to the 
opposite conclusion and indicated that this 
approach to understanding job satisfaction 
has a good deal of merit. The majority of 
studies critical of the two-factor theory 
reached the consensus that both hygiene and 
motivator variables contribute linearly to 
overall job satisfaction. In the present study, 
this conclusion was also borne out, since total 
hygiene satisfaction and total motivator satis- 
faction both correlated significantly with over- 
all satisfaction. However, total motivator 
satisfaction accounted for 49% of the variance 
in overall satisfaction, while total hygiene 
satisfaction accounted for only 36% of this 
variance. Although this study was not de- 
signed to test that hypothesis fully, the 
present findings seem to lend some support to 
the two-factor model of Herzberg et al. 
(1959). The major focus of this study was 
on the consequences of motivator and hygiene 
satisfactions for job involvement, and here, 
also, the results were supportive of the mo- 
tivator-hygiene model. 
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TRAINING FOR VIGILANCE: 


COMBINED CUEING AND KNOWLEDGE OF RESULTS! 


EARL L. WIENER anp DENNIS A. ATTWOOD 2 
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To test transfer of training in a visual monitoring task, 44 Ss were given a 
48-min. training session in 1 of 4 conditions forming a2 X 2 factorial design. The 
4 training conditions were knowledge of results (KR), cueing (also known as 
prompting), KR and cueing, and a control group receiving neither training aid. 
7 days later all Ss performed the same task with no training aids. Results 
showed that the KR-trained groups detected significantly more signals during 
transfer, but the cueing groups showed no significant differences. The group 
trained with the combination of KR and cueing did no better than the KR-only 
group. With respect to commissive errors (false alarms), there was no sig- 
nificant difference due to KR treatments, but the groups receiving cueing during 
training made significantly fewer during transfer. All 4 groups showed a 


decline in detection performance over time periods in the transfer session. 


In an effort to elevate detection perform- 
ance in vigilance tasks, several experimenters 
have turned to transfer-of-training techniques 
employing immediate feedback, or knowledge 
of results (KR), as a training aid. For a 
variety of monitoring tasks, it has been shown 
that operators trained with KR continue to 
outperform those trained without it (NKR) 
in later sessions when the KR has been re- 
moved (Adams & Humes, 1963; Hardesty, 
Trumbo, & Bevan, 1963; Mackworth, 1964; 
Wiener, 1963, 1967). Transfer of training in 
monitoring has been reviewed by Wiener 
(1967). 

Recently another training aid has made a 
bid for attention: This method is known 
variously as guidance, prompting, or cueing. 
While KR is response contingent and occurs 
after a signal, missed signal, or false alarm, 
cueing takes place before the onset of the 
signal, alerting the operator that it is about 
to appear on his display. The leading pro- 
ponent of cueing in vigilance tasks is Annett, 
who states that cueing is superior to KR as 
a training device. Annett and Clarkson 


1This investigation was supported in whole by 
United States Public Health Service Research Grant 
No. UI00014 from the National Center for Urban 
and Industrial Wealth. The authors gratefully 
acknowledge the assistance of Mary Dutcher and the 
staff of the Department of Laboratories of the 
Engineering School, University of Miami. 

2 Now at Defense Research Establishment, Toronto. 
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(1964) found superior transfer in the cueing 
group compared to the KR group, but Annett 
and Paterson (1966) found no differences. 
Annett (1966, Experiment III) also reports 
cueing superior to KR in the transfer session; 
however, in that study the KR was not im- 
mediate feedback, but summary information 
provided every 5 min. In a recent review of 
five of their experiments, Annett and Pater- 
son (1967) conclude that both KR and cue- 
ing increase detections, but that KR also 
tends to increase false reports. Finally, Col- 
quhoun (1966) attempted to compare the 
transfer effects of various cueing and KR 
training schemes, but failed to find any differ- 
ences between the experimental and control 
groups. Training with cueing and KR has 
been reviewed by Aiken and Lau (1967) and 
Lau (1966), who conclude that the benefits of 
cueing as a training device may have been un- 
derestimated, relative to KR, due to the 
prominence of the law of effect in learning 
theory. 

This experiment sought to investigate the 
relative merits of cueing and KR in achieving 
successful transfer of training in visual moni- 
toring, and particularly to examine whether 
any advantage could be obtained by combin- 
ing the two. The experiment is essentially a 
2 X 2 factorial design, consisting of the four- 
fold combinations of KR and NKR, cueing 
and no cueing. 
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MeEtTHOD 
Subjects 


The Ss were 44 undergraduate students in ele- 
mentary psychology courses, 28 male and 16 female. 
They were recruited for the experiment with the 
offer of an hour’s credit toward their experimental 
requirement or $1.50 for each of the two sessions. 
The Ss were randomly assigned to groups with the 
exception that the male:female ratio was kept 
balanced. The groups are designated by their training 
conditions: KR (KR, no cueing), Q (cueing, no 
KR), KR+Q (both training aids), and N (neither 
training aid). 


Apparatus 


The task was an adaptation of one first used by 
Jenkins (1958), in which a voltmeter needle made 50 
rightward deflections of 20°/min, the needle deflect- 
ing rapidly and returning to its resting position 
before the next deflection. The signal was a larger 
(30°) deflection. The meter faceplate was painted 
flat black and the needle white, and Ss viewed the 
meter on small closed-circuit television monitors in a 
room separated from the apparatus. They responded 
to signals by pressing a hand-held cord switch. The 
deflections, both background and signal, and cueing 
were scheduled on punched paper tape, stepped 
through a reader at 50 characters per minute. The 
signal schedule consisted of 32 signals occurring ran- 
domly during a 48-min. vigil, with the restriction 
that eight signals occurred during each 12-min. 
period, and the minimum intersignal interval was .3 
min. KR was provided automatically by a logic 
circuit which determined whether any response was 
a detection during the allowable time (2.5 sec.) or a 
commissive error. This information was displayed on 
a small lightbox located over the television sets. De- 
tections resulted in a green light, commissive errors 
in red, and nondetections were indicated by an 
amber light which remained on for 5 sec. after the 
allowed response period. The box is similar to that 
shown in a previous study (Wiener, 1963, Figure 1). 
The KR box was removed from the booth during 
training for Group Q and Group N and during all 
transfer sessions. All Ss wore earphones which played 
white noise from a Grason-Stadler Model 455C gen- 
erator, to achieve auditory isolation. 

Cueing was provided in the following manner: 3.6 
sec. prior to a signal, the white noise was interrupted 
and a loud 1000-cps tone was presented in its place 
for 2.7 sec. The tone ended just prior to the signal 
deflection. The group receiving both cueing and KR 
(KR +Q) was simply provided with both the audi- 
tory cueing and the visual KR, and since they 
were separated by the signal and a 2.5-sec. response 
period, there was obviously no conflict or confusion 
between the two. 


Procedure 


The Ss received initial instruction on the task 
and special instruction on KR and cueing, as ap- 
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propriate to their group. The Ss who did not receive 
cueing, KR, or both were not aware of its existence 
in other groups. Following the instruction and dem- 
onstration period, a standard 48-min. training session 
was run. The Ss were reminded to return at the 
same time a week later, and on their return all Ss 
performed another vigil with neither cueing nor 
KR. The Ss in the groups that had received one (or 
both) of the training aids were simply told that it 
would not be present for this session. The signal 
schedule for the transfer session was the same as 
the previous session, but run in reverse, provid- 
ing the same intersignal intervals with a different 
order. The data collected were detections and com- 
missive errors (false alarms) by 12-min. periods. 


RESULTS 


The data collected consisted of the number 
of signals detected by each S during each of 
the four 12-min. periods, and commissive 
errors by periods. A commissive error was 
any response not preceded (within 2.5 sec.) 
by the presentation of a signal. 


Detections 


The detection data by groups and periods 
for the two sessions are shown in Figure 1 in 
terms of percentage of signals detected. In 
the transfer session all Ss were run under con- 
ditions of no feedback and no cueing (identi- 
cal to Group N in the training sessions), but 
are labeled according to the conditions under 
which they were trained a week earlier. The 
detection data for the two sessions were 
analyzed separately as two partially hier- 
archal analyses of variance, with Ss nested 
in one of the four KR-Cueing treatments, but 
common to the four time periods during a 
session (Winer, 1962). The data were first 
subjected to Cochran’s test for homogeneity 
of variance, and in neither analysis was the 
null hypothesis of equal variances rejected. 
The analysis of variance for the training ses- 
sion is shown in Table 1, and for the trans- 
fer session in Table 2. 

Table 1 indicates that both training vari- 
ables affected detection rate. Of course, one 
should place little importance on the signifi- 
cance of the cueing variable during the train- 
ing session, as there is no reason why the cue- 
ing groups should have anything less than 
nearly perfect performance. The difference 
between Group Q and Group KR + Q was not 
statistically significant, according to a Duncan 
multiple-range test applied to the pair-wise 
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Fic, 1. Percentage of signals detected during training and transfer sessions as a function 
of training conditions and time periods. 


differences between the four training condi- 
tions, but all other pair-wise differences were 
statistically significant. The fact that the KR 
main effect was statistically significant, as 
was the pair-wise difference between Group 
KR and Group N (p< .05), substantiates 
the familiar finding that KR produces su- 
perior monitoring performance. The signifi- 
cance of the KR x Periods interaction fur- 
ther supports previous contentions (Wiener, 


TABLE 1 


ANALYSIS OF VARIANCE OF SIGNALS DETECTED 
DURING TRAINING SESSION 


1963) that KR groups often show no decline, 
and often even an increment in performance 
at the same time that the control group shows 
the familiar vigilance decrement. 

The analysis of variance of the transfer ses- 
sion clearly indicates the transferability of 
differences produced by training with knowl- 
edge of results, while the differences produced 
by providing cueing in the training session, 
extreme as they were, failed to transfer after 


TABLE 2 


ANALYSIS OF VARIANCE OF SIGNALS DETECTED 
DURING TRANSFER SESSION 





Source df MS F 

Between Ss 43 
KR» (A) 1 41.05 5.16* 
Cueing (B) 1 330.01 41. 45*** 
AXB 1 4.78 — 
Errorp 40 7.96 

Within Ss 132 
Periods (C) 3 2.07 1°55 
AXC 3 6.57 A oe 
BXC 3 2.85 1.86 
AXBXC 3 S57 2.33 
Errory 120 1253 

Total 175 


a Knowledge of results. 
* > < .05. 


Source 
Between Ss 
KR» (A) 
Cueing (B) 
AXB 
Errorp 
Within Ss 
Periods (C) 
AGC 
OE 3 
AO XG Beale 3 
Errory 120 
Total 175 





8 Knowledge of results. 
Rpe OZ. 
FD < 001. 
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a 7-day intervening period. Figure 1 indicates 
that cueing did not affect the performance of 
the group during the transfer session, and 
pair-wise range tests show no difference be- 
tween Groups KR and KR +Q, or Groups 
Q and N. The significance of the period’s main 
effect, coupled with the nonsignificance of 
interactions of the training variables with 
periods, points out that training techniques 
affect the overall performance level of the 
groups after transfer, but fail to prevent vigi- 
lance decrement. 


Commissive Errors 


Since commissive errors characteristically 
show an extreme positive skewness, the au- 
thors have adopted the practice of analyzing 
between-group differences of the number of 
Ss above or below the overall median number 
of commissive errors (see Wiener, 1967). 
Thus, the total number of commissive errors 
for each S was computed, and the median of 
these for the four groups combined was found 
to be 0.0 for the training session and 1.0 for 
the transfer session. The between-groups con- 
trasts were then analyzed by forming a con- 
tingency table with groups as one dimension 
and the frequency of Ss above and below the 
median as the other dimension. The Ss mak- 
ing exactly the median number of commissive 
errors were evenly divided into the frequency 
categories. The data were then analyzed us- 
ing the chi-square test for rom and column 
independence. 

Since commissive error rates are meaning- 
less while cueing is being provided to Ss, 


TABLE 3 


ToTAL CoMMISSIVE ERRORS IN TRANSFER SESSION 


Group® 
Period Total 
KR 
KR N 
+Q 2 
1 4 6 1 10 21 
2 3 10 7 7 27 
3 2 5 4 6 27 
4 1 10 6 11 28 
Total 10 41 18 34 103 


* Abbreviations: KR = knowledge of results, no cueing; 
Q =cueing, no KR; KR +Q = both training aids; N = 
neither training aid. 
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TABLE 4 


NuMBER OF SuBJECTs MAKING LEss THAN, EQuat 70, 
AND GREATER THAN THE OVERALL MEDIAN 
ComMIssIvVE Errors 














Group* 
Errors 
KR+Q KR Q N 
0 7 2, 5 2 
1 2 4 3 4 
>1 2 5 3 5 
Total 11 11 11 11 





* Abbreviations: KR = knowledge of results, no cueing; 
Q =cueing, no KR; KR +Q = both training aids; N = 
neither training aid. 


in the training session only the KR and N 
groups were tested. Of the 22 Ss receiving 
cueing, 19 made no commissive errors, and the 
remaining 3 made a total of 10. Con- 
sidering only the KR and N groups, the 
overall median was three commissive errors, 
and the analysis of the contingency table re- 
vealed a nonsignificant difference (y? = 2.2, 
df =1), indicating that KR did not affect 
the false-alarm rate. 

The raw commissive error data for the 
transfer session are shown in terms of total 
commissive errors by groups and time periods 
in Table 3, and by numbers of Ss making 
less than, equal to, or greater than the median 
in Table 4. Median tests revealed that over- 
all the four groups were not significantly dif- 
ferent (x? = 4.25, df = 3), nor was the con- 
trast of the two KR groups versus the two 
non-KR groups (x?=0.21, df=1). The 
analysis of the two cueing versus the two non- 
cueing groups indicated a significant differ- 
ence (y? = 3.84, df = 1), with the combined 
cueing groups containing more Ss below the 
median number of false alarms. 


DIscussION 


The results of this study are further evi- 
dence that the most effective way to train a 
monitor for a monotonous watchkeeping job 
is to train him with immediate knowledge of 
results, and lend no support to the conten- 
tions of Annett and his collaborators that 
cueing, or prompting, is more effective. In 
fact, for this task, the authors were unable 
to find that cueing was effective at all, and 
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combining cueing with KR offered no im- 
provement over the straight KR _ training 
method. To be sure, there are great differences 
between the auditory discrimination tasks 
used by Annett and his collaborators and the 
simple visual task employed here, and on a 
purely speculative basis the authors would 
agree with Lau (1966) that cueing may be- 
come more effective as the perceptual com- 
plexity of the task increases. In a very sim- 
ple one-dimensional monitoring task, there is 
so little to be learned about the nature of the 
signal itself that the benefits of cueing, per- 
haps what Annett and Paterson (1967) view 
as providing the S with “an image or template 
against which further instances can be as- 
sessed,” are minimized. They also regard KR 
not as a reinforcer, but as an inefficient form 
of retroactive cueing. 

However, for a simple and monotonous 
monitoring task, the authors prefer to inter- 
pret the benefits due to KR in traditional law 
of effect terms, stressing more the motiva- 
tional than the information component of 
KR. In very simple terms, S learns in the 
training session to maintain a higher level of 
alertness due to the reinforcing properties of 
KR, and he carries this tendency to the 
transfer session. The S may indeed obtain 
some retroactive cueing from the KR and 
learn something about the nature of the 
signal, but this should be viewed as a 
bonus dividend of KR and not its primary 
payoff. There is a strong possibility that S 
actually learns little or nothing as long as 
cueing is being provided, either because he 
may become “cue dependent” and pay little 
attention to the character of the signal itself, 
or because he may actually regard cueing as 
annoying. Some Ss in this study volunteered 
unfavorable comments about cueing, describ- 
ing it as irritating and “taking the challenge 
out” of the training session. 

In summary, with the exception of the 
finding that the cueing groups produce sig- 
nificantly fewer commissive errors than the 
noncueing groups (as also reported by Annett 
& Paterson, 1967), the results of the present 
study, and of Colquhoun (1966), offer no sup- 
port for cueing as a training aid in watch- 
keeping. Annett’s strongest case is based not 
around detections, but the ability of cueing 
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to reduce commissive errors (which the 
authors confirm), and the tendency of KR to 
encourage commissive errors (which the au- 
thors do not confirm, in this or in a number 
of other experiments of this type). One should 
recognize that the method in which Ss were 
prompted in this experiment was quite arbi- 
trary, and an entire array of questions on 
the most effective way of presenting the 
prompt remains to be answered. These ques- 
tions involve such things as the length of the 
prompt, the sense modality, the duration be- 
tween the end of the prompt and the onset of 
the signal, and a host of other hardware and 
software considerations. But based on the 
evidence to date, the authors are far more 
impressed by the potential of KR than any 
other training aids, as a practical solution to 
the many problems of man at watch. The 
authors will be eager to see if Aiken and Lau’s 
prophecy, that in the future there will be as 
much research on cueing as there has already 
been on KR, will come to pass. 

Finally, Swets and his collaborators (Swets, 
Harris, McElroy, & Rudloe, 1964; Swets, 
Millman, Fletcher, & Green, 1962) have ex- 
amined the efficacy of computer-aided instruc- 
tion in learning auditory identification, and 
Weisz and McElroy (1964) have found su- 
perior performance in visual identifications in 
a group combining cueing and KR. Noted 
recently is Mackie and Christensen’s (1967) 
lament over the lack of practical application 
(to training) of the findings of experimental 
psychologists, and their call for “learning 
engineers” to tackle the problem. It appears 
that the monitoring task is ripe for the imple- 
mentation of new technologies, and that the 
tools, ranging from hardware such as com- 
puter terminals and teaching machines, to 
software such as cueing, KR, signal schedules, 
and various tangible rewards, are readily at 
hand. 
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FACTORS IN THE DETERMINATION OF THE 
SUCCESS OF MANAGERS 
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Measures of the traits of intelligence, supervisory ability, and self-assurance, 
together with measures of the desires for job security, high financial reward, 
self-actualization, and power were obtained on 271 managers. In order to ascer- 
tain whether the motivational factors operated as moderators, on each of the 
motivational factors the managers were divided into those for whom the moti- 
vation was strong and those for whom it was weak, and separately for each 
group the relationship between scores on the 3 trait measures and job success 
was determined. It was found that in % of the instances the relationships 
between the trait measures and managerial success were significantly moderated 
by the measures of motivation thus indicating that there are interactive effects 
between traits and motivational factors in the determination of the success of 


managers. 


Measures of various sorts of traits, both 
traits of ability and traits of personality, have 
been found to be related to the degree of suc- 
cess attained in management positions. Vari- 
ous sorts of motivational factors, too, have 
been shown to play a part in determining 
managers’ performance. While it seems an ob- 
vious point, the fact of interaction between 
traits on the one hand and motivational fac- 
tors on the other has not been given much 
attention. Even though it is possible to ac- 
count reasonably well for variation in the job 
proficiency of managers in terms of the joint 
effects of traits and motivations in simple 
additive terms, because of the dynamic char- 
acter of motivation it would appear likely that 
some model other than a linear combina- 
tion of variables would be much more effec- 
tive. 

Unfortunately the term interaction is not 
easily defined, though obviously it implies 
that the joint effects of two or more variables 
are not of some simple additive sort. Cer- 
tainly different sorts of methodologies reveal 
different kinds of interactive effects. The in- 
teractive effects disclosed by analysis of vari- 
ance give one sort of result, and the multipli- 
cative rather than the additive combination of 
scores on two or more variables provides still a 
different indication. Recently there has been 
interest in the interactive effects as shown by 
moderator variables. Each of the three ap- 


proaches provides pertinent information, but 
at the same time it cannot be said that any 
one is more effective than any other in indi- 
cating interactions. Likely they all show dif- 
ferent sorts of departures from simple linear 
combinations of variables. 

A proposition that generally is taken as 
basic in psychological measurement holds that 
errors of measurement and errors of predic- 
tion are completely random. It follows from 
this proposition that there are no reliable 
differences among individuals in the extent of 
these errors, and hence any apparent individ- 
ual differences in such errors necessarily must 
be unrelated to all other variables. However, 
it has become increasingly apparent that this 
state of affairs is by no means invariably the 
case, for in numerous instances it has been 
found that the degree of relationship between 
two variables, an indication of the magnitude 
of the errors of measurement and of predic- 
tion, is in fact a function of a third variable 
(Ghiselli, 1963a). That is, when individuals 
are sorted out into subgroups on the basis of 
the scores they earn on one variable, the de- 
gree of relationship between the scores they 
earn on two other variables varies system- 
atically over the subgroups. A variable which 
operates in this fashion, interacting with two 
other variables in determining the degree of 
relationship between them for different indi- 
viduals, is termed a moderator. 
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Success oF MANAGERS 


There has been relatively little attention 
devoted to the nature of the variables which 
moderate the relationship between pairs of 
other variables. Intuitively, at least, some psy- 
chological properties seem to be rather 
“static,” whereas others give the impression 
of being “dynamic.” Thus traits such as the 
capacity to solve problems in syllogistic rea- 
soning or in arithmetic tend to be thought of 
as static traits, whereas motivational factors 
such as the desire to express one’s own indi- 
viduality or to dominate others are likely to 
be considered dynamic. Again intuitively, it 
seems probable that variables of the first sort 
would combine in simple additive ways, but 
that when variables of the latter sort are 
combined interactive effects might well be 
found. If this be the case, then it would be 
expected that measures of motivational fac- 
tors would moderate the predictive power of 
measures of traits. 

The traits of intelligence, supervisory abil- 
ity, and self-assurance seem to stand up as 
having significance to performance in sundry 
managerial positions (Ghiselli, 1963b). The 
relationships between scores on measures 
of these traits and success in mana- 
gerial positions are modest, but none- 
theless seem to be consistently positive. It 
has also been found that three sorts of moti- 
vational factors, the desire for job security, 
for high financial reward, and for self-actuali- 
zation, also play a part in determining the 
success of managers (Ghiselli, in press). A 


481 


fourth motivational factor, the desire for 
power over others, apparently is unimportant 
in this respect. The purpose of the present in- 
vestigation is to ascertain whether or not these 
four motivational factors moderate the rela- 
tionships between the three trait measures on 
the one hand and the job success of managers 
on the other. That is, for those individuals who 
are strong in a given sort of motivation, is 
the relationship between scores on a trait 
measure and job success different from that 
for those individuals who are weak in that 
motivation? If differences of this sort do oc- 
cur, then it can be said that there are inter- 
action effects between traits and motivational 
factors in the determination of success in 
managerial positions. 


METHOD 


A forced-choice inventory was used which not only 
provides indexes of the traits of intelligence, super- 
visory ability, and self-assurance, but also of the 
motivational factors the desire for job security, for 
high financial reward, for power over others, and for 
self-actualization (Ghiselli, 1963b, in press). This in- 
ventory was administered to 271 persons holding 
middle management positions in a variety of different 
types of business and industrial organizations. For 
each manager a rating of his success in his own 
organizational setting was obtained. These ratings 
were dichotomous, each person was judged as being 
either a more or a less successful manager. Of the 
271 managers, 55% were classified in the upper cate- 
gory and 45% in the lower one. 

On each of the four motivational factors the 
managers were divided into the upper and the lower 
halves as nearly as could be achieved on the basis of 


TABLE 1 


COEFFICIENTS OF CORRELATION BETWEEN THE SCORES EARNED BY MANAGERS ON FIVE 
TrEsts AND THEIR JoB SuccEss FOR SUBGROUPS FOR WHOM EACH OF THE 
Four DIFFERENT MOTIVATIONAL FACTORS ARE STRONG OR WEAK 








Motivational factors 





High financial reward 


Power over others Self-actualization 





Test Job security 
Strong Weak Strong Weak Strong Weak Strong Weak 
Intelligence lid 01 .26 03 .05 E20 17 —.01 
Supervisory ability 24 ESS AL 42 45 39 23 47* 
Self-assurance 48 LOO ss —.02 36** ob) al2® .06 hte 
No. cases 134 137 101 169 154 147 121 150 


* Difference significant at p > .05. 
** Difference significant at p > .01. 
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their scores on the scales measuring those factors. 
Then separately for each of these two subgroups, 
those in whom each given motivation was strong and 
those in whom it was weak, the coefficient of corre- 
lation between scores on each of the three trait mea- 
sures and job success was determined. 


RESULTS 


The findings are shown in Table 1. Of 
the 12 differences between pairs of coefficients 
of correlation, 4 reach statistical significance 
at better than the .01 level of confidence and 
2 at better than the .05 level. The desire for 
job security clearly has moderating effects 
on the relationships between success and 
the traits of supervisory ability and self-assur- 
ance. For those managers who are strongly 
motivated toward job security the relation- 
ship between supervisory ability and success 
is lower than for those in whom that motiva- 
tion is weak. The opposite is the case for the 
trait of self-assurance. The desire for financial 
reward moderates the relationship between 
self-assurance and success, there being no re- 
lationship at all between self-assurance and 
success for those who are strongly motivated 
for such reward, and a fairly substantial re- 
lationship for those in whom such a motiva- 
tion is weak. The desire for power over others 
moderates the relationship between success 
and _ self-assurance, the relationship being 
higher for those managers for whom the de- 
sire for power over others is strong. Finally, 
the desire for self-actualization moderates the 
relationships between managerial success on 
the one hand, and both supervisory ability 
and self-assurance on the other, the relation- 
ships being higher for those for whom the 
desires are strong. 


DISCUSSION 


All of the four types of motivational fac- 
tors function as moderators. The differences 
between pairs of coefficients of correlation are 
somewhat larger for job security than for the 
other three factors, so possibly job security is 
the motivational factor most likely to produce 
interaction between traits and success. The 
desire for power seems to have least im- 
portance as a moderator since for it the differ- 
ences between the pairs of coefficients appear 
to be smallest. Therefore different types of 
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motivational factors differ in their effective- 
ness as moderators. This means that in the 
determination of the success of managers, 
some sorts of motivations interact with traits 
pertinent to performance in managerial posi- 
tions to a greater extent than do others. Fur- 
thermore, some traits seem to be more sus- 
ceptible to interactive effects than do others. 
None of the differences between the pairs of 
coefficients of correlations reaches acceptable 
levels of statistical significance in the case 
of the trait of intelligence, whereas all four 
motivational factors operate as moderators in 
the case of self-assurance. 

In a sense, the desire for security and the 
desire for self-actualization are opposite sorts 
of motivations. Those individuals who seek 
safety in their employment situations would 
be expected to avoid acting in creative ways, 
thereby inviting attention to themselves, 
whereas it is likely that those who seek to 
express their individuality and to use their 
talents to the fullest would not be concerned 
with security. The results of the present in- 
vestigation suggest that with managers who 
eschew either goal, job security or self-actuali- 
zation, supervisory ability plays a much more 
significant part in determining success. Only 
when either of these two sorts of motivational 
factors is relatively weak does the individ- 
ual’s ability to direct the activities of others 
assume a prominent role in determining his 
success in administrative and executive posi- 
tions. 

The four motivational factors dealt with 
in the present investigation can be classified 
into two general sorts. The desires for se- 
curity and for power might be thought of as 
selfish motivations which protect and gratify 
the ego, and the desire for high financial re- 
ward and for self-actualization perhaps could 
be conceived of as arising from individual 
strivings, the one being the reward expected 
for individual action and the other the ex- 
pression of individuality. Looked at this way 
it then appears that when either the selfish 
motivations are strong or the individual ones 
are weak, then self-assurance assumes a para- 
mount role as a determiner of managerial suc- 
cess. That is, the belief in one’s self has a 
positive effect upon manager’s performance 
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only when his goals are selfish and are not an. Gutsetr1, E. E. The validity of management traits 
expression of his individuality. in relation to occupational level. Personnel Psy- 
chology, 1963, 16, 109-113. (b) 


REFERENCES Guiseti1, E. E. Some motivational factors in the 
" ; . success of managers. Personnel Psychology, in 
Guiseti1, E. E. Moderating effects and differential press. 


relativity and validity. Journal of Applied Psy- 
chology, 1963, 47, 81-86. (a) (Received January 19, 1968) 
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TASK SUCCESS, TASK POPULARITY, AND SELF-ESTEEM 
AS INFLUENCES ON TASK LIKING * 


ABRAHAM K. KORMAN 


New York University 


Report of 3 experiments which support the general hypothesis that there is a 
positive relationship between task success and task satisfaction for persons who 
hold themselves in high self-esteem but not for those whose self-esteem is 
low. Implications for a general theory of job satisfaction as stemming from the 
moderator influences of personal and task variables are discussed. 


Considerable interest in recent years has 
focused on the extent to which liking for a 
task or job is dependent on the extent to 
which one is successful in the given situation. 
Thus, while some argue that task success is 
important to job satisfaction but not neces- 
sary to job dissatisfaction (cf. Herzberg, 
Mausner, & Snyderman, 1959), others have 
claimed that such a distinction is too limited 
and that it is vital for both job satisfaction 
and dissatisfaction (cf. Dunnette, Campbell, 
& Hakel, 1967). Whichever theoretical ap- 
proach one adopts, and the evidence tends to 
support the latter somewhat more strongly 
than the former (cf. Dunnette et al., 1967), 
it would seem apparent that increasing in- 
terest is being taken in achievement as a pos- 
sible key determinant of the satisfactions one 
may derive from one’s job, particularly since 
experimental evidence continues to be re- 
ported which supports the hypothesis quite 
well (Locke, 1965, 1966). 

Despite such findings, however, some rea- 
son to question the proposed relationship in 
terms of its generality has been indicated. 
For example, in one of the studies (Locke, 
1965) supporting the basic hypothesis it is 
pointed out: 


The author would not predict that there is any 
automatic relationship between task success and task 
satisfaction; rather the relationship should be high- 
est in those cases where Ss are trying hardest to 
succeed and lowest (or nonexistent) in situations 
where they are trying least hard to succeed. If situ- 
ations could be devised where Ss were not trying to 
succeed, the author would predict that the correla- 
tion between success and satisfaction would be 


1Portions of this paper were presented at the 
meeting of the Eastern Psychological Association, 
Washington, D. C., April 1968. 


lower than the correlation for the same task where 
the Ss were trying to succeed [p. 385]. 


Since a number of writers have emphasized 
that the desire not to succeed and not to do 
well is particularly likely to occur both as a 
function of the personality characteristic of 
the individual and of role demands and ex- 
pectations in lower levels of bureaucratic or- 
ganizations (cf. Argyris; 1964; Korman, 
1967), it would seem desirable that this pos- 
sible limitation of the task-success—task-liking 
relationship be tested. Should such a limita- 
tion be established, this would have important 
implications for those wishing to develop an 
adequate theory of those conditions which 
facilitate satisfaction with one’s employment 
situation. 

The major purpose of the experiments to 
be reported in this paper was to test this 
possible limitation to the task-success—task- 
liking relationship within a self-esteem bal- 
ance theoretical hypothesis to be described 
below. In addition, as the experiments pro- 
ceeded, other hypotheses were developed 
which were deemed to be of relevance to the 
understanding of task satisfaction. These will 
be discussed at a later point in the paper. 


‘THEORETICAL FRAMEWORK 


The theoretical framework used in this re- 
search stems from, but is not identical to, 
conceptions that have been offered by Vroom 
(1964) and Lecky (1945), in that it is based 
on a “self-consistency” hypothesis as to satis- 
faction, consistency leading to satisfaction, 
and inconsistency to dissatisfaction. The 
difference between the framework used here 
and their previous proposals lies mainly along 
three dimensions. First, it lays explicit heavy 
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emphasis on self-evaluation and _ self-percep- 
tion as part of the cognitive structure, and 
suggests different sources as to consequent 
variations in self-perceptions. Second, it in- 
corporates within a common framework con- 
ceptions of the self-deriving from past rein- 
forcement history as well as contemporary 
determinants of self-esteem (i.e., experimental 
manipulations and other interpersonal evalu- 
ations). Third, the basic theoretical assump- 
tion from which the derivation tested in this 
paper was taken attempts to predict varia- 
tions on the three major dependent variables 
of industrial psychology: work performance, 
work satisfaction, and work choice, within a 
single framework.” 

Thus, the major assumption of the frame- 
work is as follows: 
All other things being equal, individuals will (a) 
choose, (6) adjust their performance to, and (c) 
find satisfying those behavioral roles which will 


maximize their sense of cognitive balance or con- 
sistency. 


This being the case, then the following deriva- 
tion can be made: 


Individuals will tend to find most satisfying those 
job and task roles which are consistent with their 
self-cognitions. Thus, to the extent that an indi- 
vidual has a self-cognition of himself as a compe- 
tent, need-satisfying individual, then, to that extent, 
he will find most satisfying those situations which 
are in balance with these self-perceptions. 


It is assumed further that one’s self-esteem 
or self-evaluation is the extent to which he 
sees himself as a competent, personal, need- 
fulfilling individual. Differences in such self- 
evaluation may arise, for our purposes, in 
three ways. First, one may conceive of a 
relatively chronic level of self-esteem, that is, 
a relatively persistent personality trait that 
occurs relatively consistently across various 
situations. Second, one may conceive of an 
individual’s self-perceived competence con- 
cerning a particular task or job at hand. This 
may arise as a result of differential learning 
experiences or the specific characteristics of 
the moment. Finally, one’s self-esteem is also 
a function of the expectations which others 
have of us. As Tannenbaum (1962) has 
argued, to the extent that others think that 


2A. K. Korman, A Hypothesis Relating to Work 
Behavior. Unpublished manuscript, 1967. 
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(a) we are competent and need-satisfying and 
able and (/) they exhibit such thoughts by the 
kinds of behavior which they exhibit toward 
us, to that extent our self-perceived compe- 
tence concerning the task at hand is increased 
and so is, we would predict, task performance. 
The rationale for this is that such interper- 
sonal evaluation provides a base of “reality” 
along which we may guide our behavior, as 
well as providing the “reality” which we 
learn from our previous experiences (Deutsch 
& Gerard, 1955). 

From these considerations it was predicted 
in the experiments to be reported below that 
task success and task liking should be posi- 
tively related for high self-esteem and unre- 
lated for low self-esteem. Such a prediction 
was based on what seemed to be the reason- 
able assumption that success in a given situ- 
ation is equivalent to adequacy in that situ- 
ation and hence is in “‘balance” for high-self- 
esteem people, but not for low. In addition, 
as we have indicated, certain additional pre- 
dictions were made in the last two experi- 
ments, with these to be described at that time. 


EXPERIMENT I 
Method 


The tasks involved in this experiment were Sub- 
tests 1 and 2 of the Watson-Glaser Critical Thinking 
Appraisal, Form AM (Watson & Glaser, 1952), and 
12 words from the medium anagrams lists of Tres- 
selt and Mayzner (1966). (The subtests used are 
titled (1) Inference and (2) Recognition of Assump- 
tions.) The Ss were tested in groups of 15-20 and 
were presented the tasks as part of a normative study 
of college students’ opinions about various ways of 
measuring creativity and intellectual ability. “Suc- 
cess” and “Failure” were determined by £ informing 
Ss of “pass-fail” points (based on scores from pre- 
tests) prior to asking them to make their ratings on 
a 5-point scale, but after they had completed the 
task. 

As in previous research by the author (Korman, 
1966, 1967), self-esteem was determined by scores 
on the Self-Assurance scale of the Ghiselli Self- 
Description Inventory,? with the cutoff for “high” 
and “low” being the median on the nationwide 
norms. Since evidence for the construct validity of 
this scale is contained mostly in an unpublished 
manual, Tables 1 and 2, consisting of a series of 
studies by the author of this paper and others, are 
presented as partial evidence for such validity. An 
examination of the relationships which are signifi- 


3Edwin E. Ghiselli, Self-Description Inventory. 
Undated. 
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TABLE 1 


RELATIONSHIPS BETWEEN GHISELLI SELF-ASSURANCE SCALE AND OTHER PSYCHOLOGICAL MEASURES 




















Highs Lows 
Sample Measure SSS eee ee 
N MiNnN| M 
Engineering students | Gough Adjective Check List, Self-Confidence Scale 14 | 52.6 | 20 | 46.6 | .05 
Industrial foremen Miner Sentence Completion Scale (a projective test of 
organizational power orientation) 10 | 4.70 | 12 .60 | .05 
Business students Crites Need for Social Service (Likert-type scale) 355| S17 36") 17, O7minOs 
Business students Crites Need for Job Freedom (Likert-type scale) 35 | 9.60 | 36 | 8.22 | .10 
Industrial foremen Frequency with which parents supervised Ss’ jobs 
and tasks* 15 | 2.93 |. 23 |) 2.30006 
Industrial foremen Frequency with which Ss argued with their parents 
during teens? 15°} 2,73: (23.4! S24 OS 
Liberal arts students | Marlowe-Crowne Social Desirability Scale® 89 
Business students Bass Self-Orientation (forced-choice tetrads) 20 | 27.10 | 15 | 24.06 | .10 














Note.—Division between highs and lows was based on the median of national norms. All tests of significance are two-tailed. 


4 Low score indicates greater frequency. 
b High score indicates greater frequency. 


¢ The author is indebted to Virginia Dunda and Charles Miller for these data; r = .16, ms. 


cant and those which are not, as presented in the 
two tables, seems to offer support for the interpreta- 
tions which are placed on this scale. 

Subjects. The Ss for Task 1 consisted of 35 high- 
self-esteem (HSE) Ss and 36 low-self-esteem (LSE) 
Ss, defined as stated above, while the n’s for Task 2 
were 30 and 55, respectively. (Fifty-one Ss were ad- 
ministered both tasks.) Finally, for Task 3, there 
were 25 HSEs and 42 LSEs. All Ss were introductory 
psychology students fulfilling course requirements. 


Results 


Table 3 summarizes the findings of this 
experiment, providing complete support for 
the hypothesis. There is a significant relation- 
ship between task success and task liking for 
HSE individuals, but no such relationship ex- 
ists for LSE individuals, with this holding 
true for all three tasks. 


Discussion 


The complete support for the hypothesis 
found in Table 3 appears to suggest in a 
fairly conclusive manner, at least for experi- 
mental laboratory tasks, that task achieve- 
ment is important for task satisfaction, but 
only if one is used to task achievement and 
sees it as “proper” and “appropriate” for one- 
self. 

However, the question now arises for the 
LSE individual as to what satisfies him, if 
task achievement does not? The task-liking 
and/or job-liking response occurs for him as 
well as for the HSE individual, but ap- 


parently the determinants are different. Ex- 
periment II was designed to provide a pre- 
liminary answer to this question with its 
major hypothesis being that task liking for 
the LSE individual is primarily a function 
of social influence processes. That is, it 
was predicted that, for LSE individuals, if 
others like the task they would, and if others 
did not they would not; the logic for this 
hypothesis being that we tend to use others 


TABLE 2 


RELATIONSHIPS BETWEEN GHISELLI SELF-ASSURANCE 
SCALE AND RANKING AND RATING VERSIONS OF 
THE STEIN SELF-DESCRIPTION INVENTORY 














Ranking M Rating M 
Need scale |————————_| p ,|———————| 9 
Highs | Lows Highs | Lows 

Abasement 16.5 16.2 ns 3.6 a5 ns 
Achievement 6.2 9.1 .05 2.2 2.6 -05 
Affiliation au 5.9 ns D2 2.1 ns 
Aggression 13.5 15.0 ns Sui 3.8 ns 
Autonomy 12.2 8.3 -O1 oa 2.9 -O1 
Blamavoid 11.9 10.0 ns oak 2.9 ns 
Counteraction 6.9 Lis -O1 2.5 2.9 .05 
Defendence 13.2 PStS ns 3.4 aes ns 
Deference 11.8 12.0 ns B22 3.2 ns 
Dominance 10.8 12.6 ns 2.9 3.0 ns 
Exhibition 11.4 11.1 ns Seo: 3.2 ns 
Harmavoid L320 13.0 ns 3.2 3.0 ns 
Infavoid 1352 17 ns 4.0 3.6 -05 
Nurturence 8.2 50 .O1 2.8 2.4 05 
Order 7.4 9.8 .05 203 2.4 ns 
Play 10.9 8.3 ns 2.9 2.4 -05 
Rejection 9.9 12.0. ns 333) 3S ns 
Sentinence 10.1 9.4 ns Dil 2.6 ns 
ex 6.9 6.3 ns 2.0 2.0 ns 
Succorance 10.1 8.9 ns 2.8 2.6 ns 


Note.—Low mean indicates strong need; N = 69 college 
students. All significance tests are two-tailed. The author is 
indebted to Charles Miller for these data. 
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TABLE 3 


LIKING FOR TASKS AS A FUNCTION OF TASK 
SUCCESS AND SELF-ESTEEM 





High self-esteem Low self-esteem 








Task  |—————_—_|_¢ _|——-—_—_——_| 3 
M|SD\|N M | SD |N 
Inferences 1.95* 49 
Succeed 3,00) 1.2 1 22 S20) 4.2 || 20 
Failure 3.08), 2.0: 43 3.00} .65 | 11 
Recognition 
of assump- 
tions 2.38** 254 
Succeed 4.07 .62 | 14 3.38) | 1.2 29 
Failure aot ed. 16 3.32 .83 | 26 
Anagrams 5.07** 1,20 
Succeed 4.21 .68 | 14 Sao 1S 26 
Failure DO tle Lcas 11 3.00 .82 | 16 
*p <.05. 
p< 01), 


as a guide as to how we should evaluate vari- 
ous phenomena in the environment (including 
ourselves) (cf. Deutsch & Gerard, 1955). It 
was also predicted that within each task 
there would be no relationship between task 
success and task liking. On the other hand, 
for the HSE individual, it was predicted that 
there would be such task-success—task-liking 
relationships for each task (i.e., high “others 
like” and high “others do not like”). The 
prediction for these latter individuals as to 
“social influence effects’ between the two 
tasks was less clear, however. On the one 
hand, there seemed to be no reason to expect 
a lack of a “social influence” effect for HSE. 
Yet, it also appeared logical that the task- 
success—task-liking relationship might mitigate 
this effect somehow. For these reasons, no 
specific prediction was made as to how much 
“social influence” might affect the HSE indi- 
vidual liking for a task, except that it was 
thought that the effect would be somewhat 
less than for the LSE Ss. 


EXPERIMENT II 
Method 


The task involved in this experiment was the same 
anagrams task used in the first experiment, that is, 
12 words from the anagrams list of Tresselt and 
Mayzner (1966). In addition, all other procedures 
were exactly the same with the sole exception that, 
in this case, after the task was completed and the 
individual was informed of his “success” or “failure,” 
he was also told by E that most other students “did” 
like the task or “did not” like the task. After such 
information was presented, Ss were then asked to 


rate their own liking for the task on a 10-point scale. 
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As before, the measure of self-esteem used was 
the Ghiselli Self-Assurance scale. 

Subjects. All Ss were introductory psychology stu- 
dents fulfilling a course requirement. The breakdown 
by self-esteem scores resulted in 57 HSEs and 72 
LSEs, with the division point being the median on 
the national norms. 


Results 


The analysis consisted of an “unweighted 
means” ANOVA (Winer, 1962), analyzing 
each self-esteem group separately, the predic- 
tions being that there would be a pass-fail 
effect for the HSE individual and possibly a 
“social popularity” effect, whereas only the 
latter would be true for the LSEs. The results 
are summarized in Table 4 and indicate com- 
plete support for the moderator influence of 
self-esteem on the task-success—task-liking re- 
lationship. However, support for the differen- 
tial importance of social influence processes 
on liking for a task, when success on the task 
is controlled for, as a function of self-esteem, 
does not occur. In fact, the effect occurs more 
strongly for the HSE group than it does for 
the LSE, a result opposite to the prediction, 
but the effect is not significant for either. 


EXPERIMENT III 
Method 


This third experiment constituted a replication of 
the second experiment with the following exceptions: 

(a) The task used was the Inferences task from 
the Watson-Glaser Critical Thinking Appraisal; 

(6) In addition to £ stating that other students 
liked or disliked the task, an accomplice * was used 
to say that he liked or disliked it and to request of 
the £ that he state his opinion. All three of these 
opinions were confirming of one another and thus, 
hopefully, provided a stronger “popularity” manipu- 
lation effect than in Experiment II (although a 
request for opinions after Experiment II similar to 
the one we will describe below for Experiment III 
indicated that the “popularity” manipulation was 
believed by all Ss). 

(c) Most importantly, self-perceived competence 
for the task (a component of self-esteem as we have 
defined it) was manipulated by £ in the following 
manner. In Treatment A, Ss were presented with the 
task and told that this was the first time this norma- 
tive opinion study was being done at New York 
University. Prior to this time the test had been ad- 
ministered at Ivy League institutions which were 
highly selective in their admissions policies and it 
was from these individuals that we had our knowl- 


*The author is indebted to James Hoon for his 
assistance in this experiment. 
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TABLE 4 
LIKING FOR TASKS AS A FUNCTION OF TASK SucCEsS, SELF-ESTEEM, AND OTHERS’ 
LIKING FOR THE TASK FOR EXPERIMENT IT 
High self-esteem Low self-esteem 

Statis- Succeed Fail Succeed Fail 

tic a | 

t Others do ._ | Others do .. | Others do | Othersda 
Others like Aakilike Others like nat ike Others like aut Vike Others like nothta 

M 7.13 5.83 4.93 3.58 5.95 5.38 6.27 5.00 

SD 2.2 Sel Dik 2.7 Da 2.4 2S 2.8 

N 12 18 15 12 19 21 22 | 10 











Note.—High self-esteem succeed-fail F = 9.53, p < .01; low self-esteem succeed-fail F < 1; high self-esteem others like-do 
not like F = 3.65; low self-esteem others like—do not like F = 2.59. 


edge as to what average performance and liking 
levels were. Now we (the researchers) were moving 
into more “mass” institutions where students were 
of a much wider range in quality and where the 
average student was considerably below the Ivy 
League student in ability. Hence, the researchers did 
not expect that they would get the same levels of 
performance here. 

In Treatment B, the same basic procedure was 
used except that Ss were told that previously the 
researchers had been working on junior and com- 
munity colleges and were now moving up to more 
“quality” institutions where they expected the level 
of performance to be higher. In both this treatment 
and in Treatment A, well-known examples were 
given of “high” and “low” quality institutions with 
which the Ss at the institution were being classified 
in order to solidify further the Es’ evaluation of 
their abilities. 

The effectiveness of the general experimental pro- 
cedure was checked by asking each S to write a 
paragraph indicating his reaction to the whole 
research situation in which he was participating. 
Of 179 Ss, only 1 S indicated that he did not be- 


lieve the normative purposes of the study, while 
only 4 did not believe the fact that the Es worked 
in the universities and/or colleges indicated previ- 
ously. These Ss were, of course, eliminated. Of the 
175 remaining, every paragraph written indicated 
that they took seriously the stated purposes of the 
study as indicated by the E. In addition, their com- 
ments indicated that the “popularity” manipulation 
was accepted as well as in Experiment II. 

Subjects. All Ss were introductory psychology stu- 
dents fulfilling course requirements, with 94 being 
run in the “high” condition and 80 in the “low.” 
These were further broken down, as in the second 
experiment, into high others-like-it and low others- 
like-it, as well as pass-fail, with these breakdowns 
being indicated in Table 5. 


Results 


The analysis performed here used the same 
procedures as in Experiment II, but with 
different results. There is an interaction effect 
for the high group indicating that achieve- 


TABLE 5 


LrikInG For TASKS AS A FUNCTION OF TASK SUCCESS, SELF-ESTEEM, AND OTHERS’ 
LIKING FOR THE TASK FOR EXPERIMENT III 

















High self-esteem Low self-esteem 
Statis- Succeed Fail Succeed Fail 
tic 
.. | Others do : Others do 3 Others do -;_, ||, Othersida 
Others like awenike Others like nok bike Others like nee fies Others like notte 
M 1.53 4.16 5.82 4.58 6.96 3.80 6.19 4.35 
SD 2.0 1.9 Dei LF 1.8 2.0 1.4 2.6 
N 28 37 i 12 30 20 16 14 








Note.—High self-esteem succeed-fail F = 1.81; low self-esteem succeed-fail F = 2.27; high self-esteem others like-do not 
like F = 23.7, p < .01; low self-esteem others like—do not like F = 34.4, p < .01. 
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ment has an effect on satisfaction only for 
the condition where others “like” the task, 
but that there is no effect when others do 
“not like” it. There is an others like/dislike 
the task effect for both groups, but there is no 
pass-fail effect in either case for the low group. 


DIscuSsSION 


Taken in context, the results of these ex- 
periments are highly consistent in supporting 
the general hypotheses advanced in the intro- 
duction to this paper. It seems apparent that 
the meaning of satisfaction and liking varies 
as a function of the self-esteem variable and 
what induces liking for a phenomenon in one 
group will not induce liking in the other. We 
may consider then that the balance hypothe- 
sis advanced here has received support similar 
to that which it has received in other papers 
(cf. Korman, 1966), and that claims as to 
the great significance of task achievement for 
satisfaction are oversimplified at best. For 
some people, it is apparent, task achievement 
is not a source of satisfaction nor is task 
failure a particular source of dissatisfaction. 
To be more specific, task achievement to the 
LSE person is apparently irrelevant to his 
satisfactions. In addition, it also appears as 
though a task must meet at least some stan- 
dards of “social desirability” before achieve- 
ment on it will serve as any kind of satisfac- 
tion at all. People for whom task achievement 
is a source of satisfaction will not feel such 
satisfaction if the task is not worth achieving 
on. 

What, then, does determine the satisfied 
response of the LSE individual, assuming 
his satisfied response is a lawful response? 
Our hypothesis that social influence processes 
may be involved in that perhaps the LSE 
person uses other individuals’ response as a 
guide to his own response in a given situation 
was supported under the “strong” manipula- 
tion, but not the ‘“‘weak.” Since authority (£) 
influence was also involved as well as peer 
evaluation, we have little way of knowing 
whether it was the additive aspects of the 
influence in Experiment III which were im- 
portant or its qualitatively different aspects. 
(The fact that the “popularity” manipulation, 
as measured by postexperimental opinions, 
was equally accepted in both cases only adds 
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to the inconsistency of the findings.) Further 
testing of this hypothesis seems to be neces- 
sary, particularly when it might be moderated 
in its effects by the self-esteem variable, 
something which did not occur here. One pos- 
sibility which might be investigated is that 
the effects of self-esteem might be curvilinear, 
rather than linear, as recently suggested by 
Cox and Bauer (1964). 

What does seem to be clear, however, from 
these results, are the implications they sug- 
gest as to how to increase the feeling of satis- 
faction one gets from task achievement. Ap- 
parently what appears to be necessary is that 
one must first change the individual in order 
that he sees achievement as befitting him and 
being appropriate for him. It is only after 
such personal change has taken place that he 
will be satisfied with achievement on a de- 
sirable task and gain pleasant effect from it. 
How does one introduce this in the organiza- 
tional environment? There are various ways 
of varying one’s feelings of self-esteem and 
self-competence through interpersonal evalu- 
ation, varying control procedures and leader- 
ship behavior which seem to be quite com- 
mon in the organizational environment, as 
Argyris (1964) and Korman (see Footnote 
2), among others, have pointed out, and which 
would seem to be quite germane to this hy- 
pothesis. Hence, it would seem that a pro- 
gram of experimental research using these 
organizationally relevant variables would be 
of considerable value in further testing this 
hypothesis, particularly when task character- 
istics are also varied, a type of change al- 
ready of some interest and concern in the 
modern organization. 
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MULTIVARIATE RELATIONSHIPS BETWEEN A MEASURE OF 
VOCATIONAL INTERESTS AND A MEASURE OF 
VOCATIONAL NEEDS * 


ROBERT M. THORNDIKE, DAVID J. WEISS, anp RENE V. DAWIS 


University of Minnesota 


The Minnesota Vocational Interest Inventory (a measure of the interests of 
nonprofessional men) and the Minnesota Importance Questionnaire (a measure 
of vocational needs) were administered to 505 male vocational rehabilitation 
clients. The group was split into 2 random samples of 246 and 259, and a 
canonical correlation analysis was developed on each group independently. 
Significant canonical correlations of .65, .56, and .52 were obtained for Group 
1, and .62 and .54 for Group 2. The first 2 sets of weights for each group 
were cross-validated on the other group. The Ist set of canonical weights 
from each group yielded identical cross-validation coefficients of .38. Possible 
interpretations of the canonical variates and some theoretical and methodologi- 


cal considerations are discussed. 


The purpose of the present study was to 
test the hypothesis that measured vocational 
interests and vocational needs reflect a com- 
mon domain of psychometric behavior. Various 
attempts to relate interest measures to person- 
ality variables by zero-order or multiple- 
correlation methods have generally yielded 
low to moderate relationships. For example, 
Kohlan (1966) correlated the Strong Voca- 
tional Interest Blank (SVIB) and the Minne- 
sota Importance Questionnaire (MIQ), a 
measure of vocational needs (Weiss, Dawis, 
Lofquist, & England, 1966). In predicting 
each individual SVIB scale from the 20 scales 
of the MIQ, Kohlan obtained multiple Rs 
ranging from .28 to .62 with a median of 
42. The low magnitude of relationship found 
is typical of that appearing generally in the 
literature, leading Thorndike, Weiss, and 
Dawis (1968) to argue for the use of canoni- 
cal correlation for such multivariate prob- 
lems. In reanalyzing Kohlan’s data, Thorn- 
dike et al. (1968) obtained a maximum 
canonical correlation coefficient of .78 be- 
tween the 45 SVIB occupational scales and 
the 20 MIQ scales. This coefficient accounted 
for 62% of the variance of the canonical 


1 This study was supported, in part, by Research 
Grant RD-1613-G from the Vocational Rehabilita- 
tion Administration, United States Department of 
Health, Education and Welfare. Based on a paper 
presented at the meeting of the Midwestern Psy- 
chological Association, Chicago, May 1967. 
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variates. A replication study with these in- 
struments yielded similar results. 


The present study presents results on the 
canonical correlation of a different measure 
of vocational interests, the Minnesota Voca- 
tional Interest Inventory (MVII), with the 
same measure of vocational needs, the MIQ. A 
secondary objective of the present study was 
to determine the reliability of the obtained 
canonical relationships by subjecting the 
canonical regression weights to cross-valida- 
tion. 


METHOD 


The Ss for this study were 505 male applicants for 
services at the Minnesota Division of Vocational Re- 
habilitation. All Ss completed both instruments in one 
testing session as part of a larger battery of tests 
given to assist in vocational counseling. The Ss were 
administered the instruments in small groups 
throughout 1966 as part of the continuing research 
program of the Work Adjustment Project at the 
University of Minnesota. 

After the data had been collected, the answer 
sheets were ordered according to a six-digit identifi- 
cation number and a table of random numbers was 
used to generate two groups of 246 and 259, respec- 
tively. Tests for differences of scale means and vari- 
ances showed that the two groups did not differ in 
these respects. Possible effects of age, education, and 
marital status were also effectively minimized by the 
randomization procedure. 

The data from the two samples were then sub- 
jected to independent analysis using the technique 
of canonical correlation, a correlational method 
which determines the maximum relationship between 
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two weighted linear composites (Cooley & Lohnes, 
1962; Hotelling, 1936). The 21 MVII empirical scales 
(occupational keys) were treated as one set of varia- 
bles and the 20 MIQ scales as the other. Due to 
the multiple and simultaneous determination of the 
regression weights in either set by the variables in 
the other set, the predictor-criterion distinction of 
the multiple correlation technique does not apply, 
and scores on either set of variables can be pre- 
dicted from those of the other set. The computer 
program used (Cooley & Lohnes, 1962) calculates 
the canonical regression weights for the larger set of 
variables, then the weights for the smaller set of 
variables for each successive canonical correlation in 
turn. The canonical variates may be thought of as 
“factors” extracted from each set of variables such 
that the correlation between the “factor scores” (the 
canonical correlation) will be a maximum for each 
successive pair of variates. The magnitude of the 
correlation coefficient indicates the degree to which 
these “factor scores” (weighted linear composites of 
the original “predictor” and “criterion” variables) 
covary with each other or can be predicted from 
each other. 

After the sets of canonical regression weights had 
been determined for each group, they were applied 
to the data for the other group in a double cross- 
validation design. Predicted total scores on each 
instrument were computed, based on the beta 
weights previously obtained from the other sample, 
and the product-moment correlation between these 
predicted total scores was obtained for each group. 

Cross-validation of a canonical correlation is 
analogous to that of multiple correlation, except that 
the “actual criterion score” is the predicted total 
score of the “criterion” set. 


RESULTS 


The canonical correlations resulting from 
the development equations are shown in Table 
1. For Group 1, three sets of weights yielded 
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statistically significant canonical correlation 
coefficients of .65, .56, and .52, accounting for 
43%, 31%, and 27% of the variance of the 
canonical variates, respectively. The data for 
Group 2 yielded two significant canonical cor- 
relation coefficients of .62 and .54 which ac- 
counted for 38% and 30% of the variance of 
these canonical variates. 

Tables 2 and 3 list the scales scored on each 
instrument and the beta weights for the sig- 
nificant canonical correlations, rounded off to 
two decimal places. In using the weights for 
cross-validation, five decimal places were re- 
tained. It should be noted that the weights 
for the first canonical correlations of the two 
groups are more similar than the weights for 
any two successive canonical correlations for 
the same sample. This is to be expected due 
to the fact that succeeding pairs of canonical 
variates for a given sample must be orthogonal 
to all preceding pairs. 

Table 2 shows that, for the first canonical 
correlation for Group 1, the variate defined 
by the MVII scales Milk Wagon Driver, 
Printer, Truck Driver, and Plumber was re- 
lated to the variate defined by the MIQ Re- 
sponsibility scale (positive weight) and by 
the MIQ Activity and Independence scales 
(negative weights). Table 3 shows that, for 
the first canonical correlation for Group 2, 
the variate defined by the Milk Wagon Driver, 
Stock Clerk, Printer, and Truck Mechanic 
scales of the MVII was related to the variate 
defined by the Responsibility and Supervision— 


TABLE 1 


CANONICAL CORRELATION, WitKs’ LAMBDA, VALUE OF THE CHI-SQUARE STATISTIC, AND SIGNIFICANCE 
TESTs OF THE First SEVERAL Roots OF THE INTERCORRELATION MATRICES BY GROUP 











Root RS 

1 65 

“ 2. 56 

Group 1 3 52 
4 49 

1 .62 

Group 24 2 54 
3 47 











® The associ ated latent root, As = Ro?. 


b Probability of error in rejecting the null hypothesis of no significant difference from zero canonical correlation. 


eN = 246, 
aN = 259. 
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TABLE 2 


CANONICAL REGRESSION WEIGHTS FOR THE THREE SIGNIFICANT 
CANONICAL CORRELATIONS FROM Group 1 


MVII weights 
Scale Ra = 05 Ree =O) Ro = B52 

Baker —.21 — 45 1.07 
Food Service Manager 19 44 —.605 
Milk Wagon Driver 1.11 —.55 1.32 
Retail Sales Clerk .90 — .06 64 
Stock Clerk 23 — .38 —.21 
Printer 1d — .32 — .32 
Tab Machine 

Operator vail —.57 —.54 
Warehouseman — .66 —.37 1.34 
Hospital Attendant po, —.41 —.71 
Pressman —.01 —.45 eos 
Carpenter mT — 34 —1.10 
Painter 14 —.40 —.27 
Plasterer —.04 18 78 
Truck Driver 1.07 —.69 30 
Truck Mechanic 20 — 1.46 —.29 
Industrial Education 

Teacher 83 54 90 
Sheet Metal Worker —.40 als: 1.60 
Plumber .96 —.31 — .24 
Machinist 132 —.69 04 
Electrician 28 —.40 — .39 
Radio-TV Repairman oul 46 .60 
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MIQ weights 
Scale eae e On eitaat=—te 501) os nea 2 

Ability Utilization 05 39 —.10 
Achievement — .16 —.22 —.01 
Activity — .35 —.05 —.08 
Advancement .28 —.10 17 
Authority .19 —.11 34 
Company Policies ‘ 

and Practices aif — .04 15 
Compensation .07 —.05 —.08 
Co-workers 08 16 34 
Creativity 29 39 24 
Independence — .46 B23 —.27 
Moral Values .06 24 —.28 
Recognition 14 .06 —.16 
Responsibility eo =09 —.52 
Security —.31 —.26 29 
Social Service ai 07 00 
Social Status .06 mi — .02 
Supervision—Hu- 

man Relations 08 .06 —.09 
Supervision— 

Technical — .02 —.23 —.08 
Variety .20 18 .29 
Working Conditions —.16 .10 04 








Human-Relations scales (positive weights) 
and the Independence scale (negative weight) 
of the MIQ. Scales receiving large weights on 
the first pair of variates were found in com- 
mon for each instrument and for both groups: 
Milk Wagon Driver and Printer (both re- 
ceived positive weights) for the MVII and 
Responsibility (positive) and Independence 
(negative) for the MIQ. Comparison of the 
weights for the second canonical correlation 
does not reveal the same similarity between 
groups. 

By emphasizing those scales which show 
fairly reliable and large weights across the 
two samples an attempt can be made to in- 
terpret the canonical vectors. On the MVII 
the Printer and Milk Wagon Driver scales re- 
ceived consistently high weights. A vector 
defined in large part by these two scales is 
most highly correlated with a bipolar MIQ 
vector defined primarily by the Responsibility 
scale on the positive end and by the Inde- 
pendence scale on the negative end. Persons 


whose interest response patterns are similar 
to those of Printers and Milk Wagon Drivers 
also express preferences for job situations in- 
volving responsibility but lacking in inde- 
pendence (working alone). An independent 
validation of this relationship with appropri- 
ate occupational groups would be necessary 
before such an interpretation could be ac- 
cepted as having been empirically demon- 
strated. 

The weights which define the vectors of the 
second canonical correlation are so different 
for the two randomly developed groups that 
an interpretation of the above type seems un- 
warranted. 

Table 4 contains the correlations obtained 
when the weights developed on Group 2 are 
applied to the data from Group 1 and vice 
versa. The first set of weights developed on 
Group 2 yielded a statistically significant 
correlation of .38 when applied to Group 1, 
while the second set of weights from Group 2 
did not yield significant results. When the 
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TABLE 3 


CANONICAL WEIGHTS FOR THE Two SIGNIFICANT CANONICAL CORRELATIONS FROM GROUP 2 





MVII weights 








MIQ weights 


























Scale Re: = .62 | Reo = .54 Scale Ror = .62 | Rez = .54 

Baker .07 —.11 | Ability Utilization .20 O01 
Food Service Manager —.72 .32 | Achievement ao —.23 
Milk Wagon Driver 1.07 14 | Activity —.15 44 
Retail Sales Clerk .68 .76 | Advancement .09 .10 
Stock Clerk 1.01 —.69 | Authority 19 —.09 
Printer 1.68 .22._| Company Policies and Practices 06 —.03 
Tab Machine Operator —.10 .29 | Compensation —.05 —.12 
Warehouseman —.22 .56 | Co-workers a —.15 
Hospital Attendant 42 .63 | Creativity — 03 —.21 
Pressman —.03 .84 | Independence —.46 — .23 
Carpenter mii .52 | Moral Values — .03 02 
Painter 43 .07 | Recognition 06 33 
Plasterer 09 —.29 | Responsibility 56 — .04 
Truck Driver .68 .30 | Security —.14 .06 
Truck Mechanic 1.29 —.67 | Social Service 02 34 
Industrial Education Teacher .80 —.99 | Social Status — .04 ae 
Sheet Metal Worker oO —.34 | Supervision—Human Relations so 45 
Plumber .06 .79 | Supervision—Technical —.13 — 34 
Machinist .46 .68 | Variety 04 al, 
Electrician 56 1.75 | Working Conditions — 32 —.05 
Radio-TV Repairman esa) Seto 

first two sets of weights developed on Group canonical equations, which involved 41 


1 were applied to Group 2, the first set yielded 
a correlation of .38 and the second a correla- 
tion of .28. Both of these correlations are 
statistically significant at p < .01. 


DISCUSSION 


The fact that significant correlations of .38 
were found in both of the cross-validation 


TABLE 4 


Cross-VALIDATION CORRELATION COEFFICIENTS. 
F Tests, AND SIGNIFICANCE LEVEL FOR 
THE First Two Sets oF WEIGHTS 
FROM Eacu Group 











Set r F p 











Group 2 weights applied to 
Group 1 


First canonical weights 38 | 41.7 <.001 
Second canonical weights —.05 62) \>205 
Group 1 weights applied to 
Group 2 
Virst canonical weights 38 | 42.8 <.001 
Second canonical weights 28 | 21.5 <.001 








8 Probability of error in rejecting null hypothesis of no dif- 
ference from zero correlation, 


weights each, is noteworthy. These results 
suggest a stable multivariate relationship be- 
tween measured vocational interests and mea- 
sured vocational needs. 

The authors attempted an interpretation of 
the first set of canonical weights, but only in 
the presence of cross-validation evidence 
which very strongly indicated that such an 
interpretation was justified. Other authors 
(e.g., Dunteman & Bailey, 1967) have in- 
terpreted canonical variates in cases where 
the evidence was less strong. Although those 
authors obtained two slightly larger correla- 
tions (.69 and .61) for the first two pairs of 
vectors, their sample was smaller and they 
reported neither a replication nor a cross- 
validation of their results. The present study 
shows that cross-validation or replication is 
required before any attempt at interpretation 
of canonical variates can be made. 

The fact that a correlation of .54 became 
—.05 when the development weights were 
applied to a second random sample from the 
same population is suggestive of the amount 
of sample-specific variance that can be in- 
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volved in the calculation of canonical corre- 
_ lations. An interpretation of the second set of 
weights from Group 2 would have been mean- 
ingless for Group 1. It would therefore seem 
advisable to put little confidence in the in- 
terpretation of a pair of canonical variates 
developed from a single sample unsupported 
by additional evidence in the form of either 
a replication which produces very similar 
results or a cross-validation. Unfortunately, 
such interpretation is commonly included in 
the meager but growing number of research 
reports using canonical correlation techniques. 

The results of this study lend support to 
an important trend in research methodology. 
Various authors, particularly Dunnette 
(1963), have argued that a single criterion 
measure is not adequate in the applied pre- 
diction situation because what is being pre- 
dicted is very seldom a unidimensional at- 
tribute which can be adequately expressed as 
a single criterion variable. The use of a priori 
weights in a composite criterion is not very 
much better because the investigator seldom 
has a very accurate knowledge of what it is 
in the composite that he is in fact predicting. 
However, this less than desirable methodology 
has continued in common use because of the 
restrictions in the correlational techniques 
which have been used. 

One of the weaknesses which is often men- 
tioned in using the canonical correlation tech- 
nique in an applied context is that the ‘“cri- 
terion” weights developed to satisfy a mathe- 
matical least-squares solution may not reflect 
the values of the variables expected by the 
researcher. In this regard, it should be under- 
stood that canonical correlation analysis is 
primarily a research technique, the purpose 
of which is to show the investigator what his 
data imply. Canonical correlation analysis 
identifies the “criterion” variables which the 
“predictor” battery predicts and their rela- 
tive importance from a linear-weighted com- 
posite point of view. If the “criterion” varia- 
bles were designed to be used as part of a 
composite criterion, the canonical weights give 
the best regression weights for maximum pre- 
diction of that linear-weighted composite. 
Divergence of these canonical regression 
weights from the researcher’s a priori weights 
should result in a reexamination and modifi- 
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cation of the predictor battery so that the em- 
pirical and a priori weights become less dis- 
crepant, or in a modification of the research- 
er’s a priori weighting system. 

Although the method of canonical correla- 
tion has been known for over 30 years, the 
difficulties of computation have caused it to 
be generally disregarded until quite recently. 
This method, although still a tool for research 
rather than practical prediction, allows the 
applied psychologist to explore the variables 
of interest to him in the complex multivariate 
manner in which they occur. 

In the context of the present substantative 
question, that of the relationship between 
measured vocational interests and measured 
vocational needs, canonical correlation has 
served to uncover a relationship hidden by 
other techniques. The present study, in con- 
junction with the previous study using the 
SVIB (Thorndike et al., 1968), suggests a 
moderate (cross-validated R. ™ .38) relation- 
ship between vocational needs and the mea- 
sured interests of nonprofessional men, and 
probably a relationship of similar magnitude 
(un-cross-validated R.™.75) between mea- 
sured vocational needs and the measured in- 
terests of professional men. Further, it has 
been shown that these relationships do not 
exclusively reflect the operation of chance 
factors in that cross-sample stability exists 
for the magnitude of the relationship and, 
in the present study, that the pattern of 
weights defining the relationship yields sta- 
tistically significant results when applied to 
another sample from the same population. 
These data, therefore, suggest that vocational 
interests and vocational needs reflect the op- 
eration of similar underlying variables. 
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Some hypotheses in Holland’s theory were tested in a longitudinal study using 


large, diverse samples of college students. 


Using the appropriate operational 


definitions, students were classified as personality types and personality pat- 
terns and then examined to learn if they exhibited the characteristics suggested 
by the theory. This testing of concurrent and predictive relationships based 
on the theoretical formulations was generally positive. The testing of college 
influences upon vocational choice and satisfaction was accomplished by char- 
acterizing colleges by the Environmental Assessment Technique and students as 
personality types. These analyses of student-college interactions were partially 
successful, Taken together, the results support the usefulness of the theory. 


This report is one of several studies under- 
taken to explore the usefulness of a theory 
of vocational choice and personality by test- 
ing some of its hypotheses (Holland, 1959, 
1966b). The first and second monographs 
were longitudinal studies of bright students 
over 1-4 yr. periods (Holland, 1962, 1963). 
The third study was a longitudinal study of 
change in major field plans (Holland & 
Nichols, 1964). The fourth study examined 
some of the concurrent relationships between 
vocational preferences, occupational images, 
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for their counsel and skillful help in the planning and 
execution of this study: Thomas T. Frantz, Sandra 
W. Lutz, James M. Richards, Jr., William P. Alston, 
Theodore Cooper, Leonard P. Rand, and Lorraine 
M. Rand. He is also indebted to Alexander W. 
Astin, Leonard L. Baird, and Donald P. Hoyt for 
their constructive reading of an earlier draft of the 
manuscript. 
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and self-ratings (Holland, 1963-64). The fifth 
report was another longitudinal study (Hol- 
land, 1964) that replicated some older hy- 
potheses and tested some new and more re- 
mote hypotheses. It also fostered several re- 
visions of the theory (Holland, 1959) and 
emphasized the need for a more explicit and 
comprehensive statement. Such a statement 
has since been published (Holland, 1966b). 
Although these earlier studies support the 
theory, they were all based on samples of 
high-aptitude high school seniors or college 
students. The purpose of the present report 
was to test some of the hypotheses in the re- 
statement of the theory and to do so in a 
large student sample approximating the typi- 
cal college student. Consequently, this ex- 
amination of the theory’s hypotheses should 
be more explicit, and the findings should be 
applicable to a more general population. 
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TABLE 1 


SAMPLES OF STUDENTS AND COLLEGES 








College Male | Female 
Spring sample 

University of Alabama 186 201 
Arkansas Polytechnic College 72 S2 
California State College at Hayward 62 74 
Colorado State College 31 107 
Southern Connecticut State College 89 273 
Wesleyan University (Connecticut) 86 — 
Indiana State University 126 164 
Burlington Community College (Iowa) 79 31 
Kansas State University 322 216 
University of Kentucky 139 154 
Westbrook Junior College (Maine) — 103 
William Jewell College (Missouri) 74 66 
Plymouth State College (New Hampshire) 38 86 
Glassboro State College (New Jersey) 121 393 
University of North Dakota 89 123 
Southeastern State College (Oklahoma) 97 61 
Mount Mercy College (Pennsylvania) — 104 
Swarthmore College (Pennsylvania) 54 46 
Black Hills State College (South Dakota) 60 38 
University of Tennessee 205 184 
Baylor University (Texas) 64 101 
Fairmont State College (West Virginia) 93 85 

Total 2087 | 2662 

Fall sample 

California State College at Hayward 69 102 
Chico State College (California) 109 173 
Amherst College (Massachusetts) 238 _- 
University of Massachusetts 759 875 
Baldwin-Wallace College (Ohio) 221 265 
Cuyahoga Community College (Ohio) 180 156 

Total 1576 1571 





Briefly, the theory consists of several sim- 
ple ideas and their more complex elaborations. 
First, we assume that we can characterize 
people by their resemblance to one or more 
personality types. The closer a person’s re- 
semblance to a particular type, the more likely 
it is that he will exhibit the personal traits 
and behaviors associated with that type. Sec- 
ond, we assume that the environments in 
which people live can be characterized by 
their resemblance to one or more model en- 
vironments. Finally, we assume that the pair- 
ing of persons and environments leads to 
several outcomes that we can predict and 
understand from our knowledge of the per- 
sonality types and the environmental models. 
These outcomes include vocational choice, 
vocational stability and achievement, personal 
stability, creative performance, and suscepti- 
bility to influence. These ideas, along with 
some operational definitions, are elaborated in 
The Psychology of Vocational Choice (Hol- 
land, 1966b). The following sections sum- 
marize the methods of collecting and analyzing 
the data and describe the findings. 


MetHops or ASSESSMENT 


The data for the present study come from two 
American College Surveys described earlier by 


Richards, Holland, and Lutz (1966). Students were 
polled for their vocational aspirations and admin- 
istered several scales and inventories. About 8 or 12 
mo. later, students were polled again for their aspira- 
tions and satisfactions. 

Students came from two college samples. The 
freshmen in the fall sample of six colleges were 
polled in the fall of 1964 and in May of 1965—an 
8-mo. interval. The spring sample of college fresh- 
men was polled in May of 1964 and again in May 
of 1965 when they were sophomores—a 12-mo. in- 
terval. The spring sample included 22 colleges. Both 
samples contained students with a great range of 
scholastic potential, vocational interests, and socio- 
economic status. Table 1 shows the samples of 
colleges and students. 

Because the follow-up data were obtained for only 
39% of the spring sample and 44% of the fall sam- 
ple, it was necessary to examine the differences be- 
tween students with complete pre- and postdata and 
students with only initial data. Comparisons of 
students with complete and incomplete data in an 
earlier study using the same sample revealed that 18 
out of 44 comparisons were significantly different 
for a variety of academic and nonacademic measures, 
but such mean differences always were less than one- 
quarter of a standard deviation (Richards et al., 
1966). In the present study, comparisons of students 
with complete and incomplete data on the Voca- 
tional Preference Inventory scales, the most relevant 
variables in the present study, reveal that all mean 
differences are less than one-sixth of a standard 
deviation. The distributions of vocational choices 
are also similar. In short, there is no compelling 
evidence that the results would be substantially dif- 
ferent if we had obtained complete data on the 
entire sample, although this occurrence is always pos- 
sible. More important, attrition of the original sam- 
ple is of little relevance because the main concern of 
the study is the testing of theoretical relationships in 
the same sample rather than the estimating of 
parameters for the original population. 

A description of the assessment devices used in 
this study follows. The reliability estimates are based 
on the original spring samples of 6,289 male and 
6,143 female college freshmen. 

Vocational Preference Inventory (Sixth Revision). 
This personality and interest inventory is composed 
entirely of occupational titles (Holland, 1965). To 
take the inventory, a student indicates which oc- 
cupations he likes and which he dislikes. Scores on 
only the following scales were used for this study: 
Realistic, Intellectual, Social, Conventional, Enter- 
prising, and Artistic. Reliabilities (Kuder-Richardson 
20) for these 14-item scales ranged from .83 to .89 
for male college freshmen and from .76 to .89 for 
females. 

Preconscious Activity Scale. This scale is an a 
priori scale developed to measure Kubie’s (1958) 
notion of preconscious activity as a process in crea- 
tive performance (Holland & Baird, in press). The 
Preconscious Activity Scale is a 38-item, true-false 
scale with reliabilities (K-R 20) of .72 and .68 for 
male and female college freshmen. The predictive 
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validities of this scale and its concurrent relation- 
ships with originality and interest measures imply 
that the Preconscious Activity Scale should be in- 
terpreted as an originality measure, especially in the 
fields of art, literature, and music. 

Range of Competencies. Using a list of activities, 
students checked the activities which they claimed 
they could do well or competently. The assumption 
underlying these scales is that a large number of 
competencies is conducive to achievement generally 
and that competencies in a particular field are con- 
ducive to achievement in that field. Typical items 
from this list include: I have a working knowledge 
of Roberts’ Rules of Order, I can dance, I am a good 
cook, I can make jewelry, I can read blueprints, I 
can read Greek, I can operate a tractor, I can use 
logarithm tables, etc. Competencies were categorized 
by three judges into several areas of competence: 
scientific, technical, business and clerical, social and 
educational, arts, and leadership and sales. Using 
unit weights, students were then scored for each 
kind of competency. The reliability for the com- 
petency scales used in the present study ranged from 
48 to .87 and from .38 to .85 for men and women. 
The very low reliabilities for a few scales appear to 
result from the small number of items in such scales. 

Interpersonal Competency Scale. This 20-item, a 
priori scale was modeled after the work of Foote and 
Cottrell (1955), who defined interpersonal com- 
petence as “acquired ability for effective interaction,” 
and who outlined a program of research to study 
this concept. Scale items simply poll the subiect for 
those factors which Foote and Cottrell believe to be 
conducive to, or typical of, interpersonal com- 
petency—good health, social experience and com- 
petencies, positive self-regard. The reliability (K-R 
20) of the Interpersonal Competency Scale for male 
and female freshmen was .69 and .67, respectively. 

Dogmatism Scale. This scale, developed by Rokeach 
(1956) to measure dogmatic and rigid thinking, con- 
sists of 40 true-false items dealing with beliefs and 
attitudes. (The first version by Rokeach is in mul- 
tiple-choice form.) The reliability (K-R 20) was .77 
for male freshmen and .75 for female freshmen. 

Student Orientation Survey, Form C. Farber and 
Goodstein (1964) developed four a priori scales to 
assess the student orientations implied in Trow’s 
(1960) student typology. These scales are Academic, 
Collegiate, Non-Conforming, and Vocational. The a 
priori scales were revised by an internal consistency 
item analysis to develop homogeneous, 10-item 
scales. Reliabilities (K-R 20) ranged from .39 to .45 
for males and from .36 to .50 for females. 

Other Descriptive Information. Students were 
polled for their educational and economic aspira- 
tions, their life goals, and their self-ratings. They 
were also asked to indicate their choice of vocation 
and field of training and to provide background in- 
formation. 

Students indicated the degree to which different 
life goals and achievements were “essential, very im- 
portant, somewhat important, or of little impor- 
tance” (e.g., being a religious person, making a con- 


tribution to scientific knowledge, being happy and 
content). 

Using a list of traits and abilities such as original- 
ity, scholarship, and conservatism, students rated 
their personal traits and abilities on a 4-point scale— 
top 10%, above average, average, and below average. 


PERSONALITY TYPES AND PATTERNS 


The purpose of these analyses was to learn 
if a student characterized as a particular per- 
sonality type, or as having a particular per- 
sonality pattern, possesses the attributes hy- 
pothesized in the theory (Holland, 1966b). 
More concretely, if we determine a student’s 
resemblance to each of the personality types— 
Realistic, Intellectual, Social, Conventional, 
Enterprising, and Artistic—by administering 
the Vocational Preference Inventory (VPI) 
and coding his profile of six scales according 
to the appropriate theoretical rules, will he 
exhibit the characteristics suggested by the 
theory? The following analyses show how 
well such student characteristics as personal 
traits, attitudes, aspirations, satisfactions, 
grades, and vocational roles can be predicted 
by using the theory to interpret the meaning 
of a student’s VPI scale scores. 


Student Traits and Orientations 


In Tables 2—7, students have been clas- 
sified by their highest VPI scale score, their 
two highest scores, and by their three highest 
scores. Then, students with different high- 
point codes and profile patterns were com- 
pared for their competencies, life goals, self- 
ratings, attitudes, and personality traits. 

Table 2, for men, reveals what students 
are like who peak on the various VPI scales. 
Generally, the student characteristics associ- 
ated with individual VPI scales are expected 
ones. For example, Realistic peaks are as- 
sociated with technical competencies and me- 
chanical ability; Intellectual peaks are as- 
sociated with scientific competencies, develop- 
ing a scientific theory, mathematics ability, 
and an academic orientation. To facilitate 
study of these and succeeding tables, the “ex- 
pected” high mean scale score for a given 
student attribute is shown in italics. And, al- 


2 The author is indebted to Sandra W. Lutz, Lor- 
raine M. Rand, and Leonard P. Rand for reading the 
statement of the theory (Holland, 1966b) and estab- 
lishing the predictions shown in these tables. 
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TABLE 2 


RELATION oF HicHEST VPI ScorE TO STUDENT CHARACTERISTICS (MEN) 








Highest VPI score 




















Student characteristics Real TInt Soc Conv Ent Art F 
(N= (N= | (N= (N= | (N= (N= 
885) 1774) 845) 428) 798) 630) 
Competencies 
sch x 3.85 5.22 ao) 3.43 3.43 3.66 | 110.81 
Scientific SHulleas. Pre yey og Yeaet e 
Fate xX 14.63 | 12.99 10.82 11.94 12.26 | 10.78 94.34 
Technica SD 3.68 | 4441 441 | 414] 436 | 4.82 
Basi EG 1.61 1.95 1.82 2.39 2.31 1.92 33.48 
Memes? SD 1.26 1.38 1.31 1.36 1.33 1.39 
, , XG 4.60 So 6.08 5.48 6.15 6.30 55.87 
Social and educational SD 2.83 2.99 2.76 2.92 2.86 2.97 
fae XG 5.44 Row 7.39 6.02 7.21 | 12:32 5 \ esas 
rtistic SD 4,67 | “5.67'| *5.46 | 5.241 * 5.84 | Gee 
ence Xe 3.23 4.34 4.77 4.37 5.20 4.99 45.42 
cecerenlp SD 2.60 | 3.07 3.07 3.05 3.16 3.13 
Life goals 7 
P EG 1.60 ie 1.36 1.41 1.48 1.43 43.96 
Developing useful product SD 7 84 64 71 72 72 
L tea Ex 1.53 2.10 1.38 SS 1.38 1.45 | 174.03 
Developing scientific theory SD as 05 68 65 69 72 
A . XG 1.37 1.39 1.34 1725 1.36 1.95 67.46 
Producing artwork SD 74 | nati 69 spre 63" 1 e"-73 ie 
Becoming expert in finance and x 1.67 1.61 1.61 2.50 2.35 1.64 | 200.66 
commerce SD .80 84 83 1.00 1.05 87 
ek = : aX 2.64 Broil 2.70 DSP 2.55 Dese 14.13 
Active in religious affairs SD 98 (21034) 7.04." oy JP os, Wage 
; : Bx 2.29 2.28 2.28 2.77 Zi3i 2.19 56.83 
Executive responsible for work of others sp "92 87 98 87 86 ‘93 
Self-ratings 
a 2.21 2.47 2.38 2.26 2.48 200) 65.35 











Originality SD 66 74 74 69 3 6 

ReaD ena ne a a pe ao 105.65 

aca X 9:07 Weal 2.49 | 2.39 2.05 | 2.44 Gee 
pp uae, SD 64 69 aA 64 a7 78 

Understanding of others Be ee ie HH ae oe rs 37.88 

ante K 73) We Zod 2.07 2.50 2.10 | 2.09 Niate6iss 
ee SD .89 .93 90 92 90 96 

. oe Xx 2:26 | ae 220 23h 2.25 ot 4.26 
OnSery ane SD 65 74 74 70 aL 83 

Personality and attitudinal scales 

i: Baa nee ee tte X. | 15.14 |-17.86 | 16.944) 12.841 14:00 | 2767igenes 
SoS a ; SD 4.46 | 4.89 4.93 4.22 4.52 5.12 

7 X | 18.79 | 17:35 | 1748 010¥8.22>| 17.41 51697 
oematism SD 6.00 5.84 5.83 5.60 6.04 | 5.77 

hositeaa) Bt x 415 | 4.84} 4.63 | 4.23 4.21 4.83 | 24.90 
Cate SES SD 1.85 1.99 1.98 1.93 2.01 1.91 


Weeatiinaltteyne xe 5.41 4.79 4.81 5.63 5.33 4.19 62.33 
YI SD 1.61 1.74 1.79 1.64 1.62 1.88 
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TABLE 2—Continued 














Highest VPI score 
Student characteristics Real Tt Soc Gon Ent Art F 
(N= (N= (N= (N= (N= (N= 
885) | 1774) | 845) | 428) | 798) | 630) 
‘ ane Non ime O7ale 25°) 344 1205 | 338| 374. | 16.84 
EIS moms type Spe oo 4.74) 171 | 450-| £68 || 1.85 
aaeyt aes 47 456 P A590 | ©3505 | 4170) 20.68 
poe ate ape Sp |. 188 | 192 | 1.83 | 184] 180 | 201 
x | 10.24 | 1097 | 11.78 | 10.72 | 11.86 | 11.37 | 28.68 
es reoea Competent; Spies 21 es.43 0 336.1 93.29) 337) 3.64 











Note.—All Fs are significant beyond the .05 level and the majority are significant beyond the .01 level. In Table 2, an F 


of 3.04 equals the .01 level. 


The Ns for various VPI codes or scales in Tables 2—7 are larger than the same codes or scales in all other tables, because 
Tables 2-7 contain data for students with and without follow-up data. All other tables are based only on data for students with 


pre- and postdata. 


Italicized means indicate theoretical predictions. If prediction is correct, italicized mean should be the highest X. 
Abbreviations: Real or R = Realistic; Int or I = Intellectual; Soc or S = Social; Conv or C = Conventional; Ent or E 


= Enterprising; Art or A = Artistic. 


though the highest mean is not always sig- 
nificantly different from other means for the 
same characteristic, 19 out of 25 predictions 
(76%) accord with the theoretical models. 

Table 3 presents the student characteristics 
that go with a student’s highest and second 
highest VPI scale scores. This table clearly 
indicates that students with the same high- 
point scale can still be distinguished by their 
second highest scale. In Table 3, for instance, 
men with a Realistic-Intellectual pattern are 
distinguished by their scientific competencies 
and goals of developing a useful product and 
scientific theory, whereas men with a Real- 
istic-Social pattern are characterized by their 
social and educational competencies. In all, 
69 out of 92 predictions (75%) are con- 
sistent with the theory. 

In Table 4, students whose first two highest 
scales were the same were compared to see if 
they could still be distinguished by their third 
highest scale. To insure more reliable out- 
comes these analyses were performed only 
for the 20 three-variable patterns with the 
largest Ns. Despite this severe test of the 
validity of the personality patterns as assessed 
by the VPI, the F tests—although smaller in 
size and significant for fewer variables—reveal 
that the third scale still distinguishes one 
subgroup from another and usually in ap- 
propriate ways—25 out of 39 predictions are 
correct (64.1%). 


The results for women are given in Tables 
5, 6, and 7 for one-, two-, and three-variable 
VPI codes. In Table 5, 21 out of 25 predic- 
tions are correct (84%). In Table 6, 52 out 
of 69 predictions are correct (75.4%). And in 
Table 7, 24 out of 33 predictions are correct 
C7251 Te ye 

The results presented in Tables 2-7 imply 
that the formulations for the personality 
types have some construct validity. Similarly, 
the interpretations of student profiles—VPI 
two-and-three-digit codes—by the application 
of the typological formulations possess similar 
validity. 


Educational Aspiration, Satisfaction, and 
Achievement 


In Tables 8, 9, and 10, the relations be- 
tween personality types (high-point VPI 
codes) and satisfaction with college, educa- 
tional aspiration, and college grades were 
examined. In Table 8, the chi-square tests 
show that high-point codes are not inde- 
pendent of a student’s satisfaction with his 
college 8 mo. after he was assessed by the 
VPI. Students rated their satisfaction with 
their college by responding to the following 
item: 


How satisfied are you with your final choice of a 
college ? 


This is the best possible college for me that I know 
of. 
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TABLE 3 


RELATION OF Two-Dicir VPI CopEs to StupDENT CHARACTERISTICS (MEN) 



























































Two-digit VPI codes 
Student characteristics be ae le ad a is a | sa ; Fs 
(N=317)| (N=97) | (N=92) | (VN=117)| (N=46) 

Competencies J A 3 i nee 

Scientific sp | 340 | 210 | 224 | 222 | 2 | 

tn SA erties aoe 

Social and educational Ee ae a ae oe on ie 

Artisti sb} sags | oat | 43m]. asc) 2 
Life goals 

Developing useful product a a ae ce pe es 

Developing scientific theory a tee oS oe oe be 

Becoming expert in finance and commerce ee whe Hen aA ne ae ia 

Active in religious affairs ee oe ae: a ie on 62 
Self-ratings 

Mechanical ability alae. “3p | 78. 00 | aa 

Math ability e fe er os ie oa a 
Personality and attitudinal scales 

Preconscious activity (originality) os es a peu vee a a 

Collegiate type sp | is .|. is | 176.) das |) oa 

IR IS le TE IA 
(N=552) ||(V=307) | (N=68) | (N= 142) | (V=307) 

Competencies oa 

Technical oT ek tn ie cae am ee 

Business sp | i390 | ta) 130 | 243 | 12 | 

Social and educational . a a aes ce oe e 

Artistic sp | 329 | aor | sz | 63s | 58 | 

sevbotns sb» | -240-/|) 407 lower jlpuaanlnls olin 
Life goals 

Developing useful product eS aoe a a ues Ks aig 

Developing scientific theory ee eo os aa oo aa ve 

Producing artwork ne ey a on ree ie _ 
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TABLE 3—Continued 


Two-digit VPI codes 








Student characteristics = fra 
IR Is Le LE IA 
(N=552) | (N=307) | (V=68) | (V=142) | (V=307) | 
Becoming expert in finance and A iL SY 1.52 2.09 2.02 1.50 18.97 
commerce SD 14 ate 1.05 1.02 79 
Disees ae 2 x 2.55 2.67 Desi 2.47 Drill 7.08 
Active in religious affairs sp 1.00 101 1.04 1.02 1.06 
: ae ; _ XG 22 223 2.68 2.96) Deiidl 12.01 
Executive responsible for work of others sp ‘st 86 78 83 ‘92 
Self-ratings 
ee 7 Xe 2.33 242 2.41 2.49 279 D2KSS 
salty SD ‘67 ‘68 ‘65 68 30 
é ae EXG 2.66 2.04 PEM D5 2.28 27.44 
Mechanical ability sp '88 86 84 "97 88 
: BE 2.36 250) 2.47 2.60 2.29 aa 
Baletity SD 65 68 16 1 45 
a XG 2.58 2.92 2.68 2.79 Del 12.36 
Understanding of others sD 67 a4 67 ‘67 74 
ae EX, 2.60 2.49 2.88 2.54 2.71 4.04 
ae buly SD 91 04 ‘88 93 04 


Personality and attitudinal scales 


Preconscious activity (originality) xX 16.94 18.10 14.99 16.26 21.31 63.34 




















SD 4.22 4.72 4.60 4.15 4.68 ; 
Pun eater ee 
vc ee ee | 
Nemfamit Beate ee 
cote ee ee ce 
ges! competency sp | sis | 33s | 290 | 330 | 3a | 
SR SI SC ee ese 
(N=73) | (N=133)| (W=47) | (N=212) | (V=148) 
Competencies 
ets sp | 247 | 222 | 194 | 24 | 236 | > 
te ai ea cls) | ei) eae 
oo sp | as | aat | sia | 52s | sm | 
Le pass ce) iene 
Life goals 
Developing scientific theory ES aa ee vee - ae 2.81 
Producing artwork & i ae pe ee 3.97 
Becoming expert in finance and commerce a me es a ie ae all 
Executive responsible for work of others a oes Te aie me om 2.38 
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TABLE 3—Continued 





Two-digit VPI codes 




















Student characteristics ss [rs 
SR SI Sc SE SA 
(N=73) | (N=133) | (N=47) | (N=212) | (N=148) 
Self-ratings 
Gate X 2.16 2.37 2.02 2.41 2.58 7.24 
ee SD 64 wi 56 7 79 
Ce ce x 2.26 2.15 1.64 77 1.68 | 11.60 
eee ee SD 81 88 70 85 16 
Panne xX 283 2.44 O35 2.64 2.48 3.86 
I ; SD 70 3 76 69 64 
a xX 2.00 2.20 2.40 2.00 1.92 3.78 
a SD 04 95 89 82 00 
Personality and attitudinal scales 
Bee ee eae ee 7% 16.04 17.42 14.89 16.09 19.91 “Voie 
eS SD 4.69 4.10 3.84 4.54 4.73 
Dae xX 17.51 17.34 19.53 16.63 16.80 2.68 
8 SD 6.21 6.19 5.29 5.52 5.54 
voeee eee X 5.22 4.62 5.23 5.06 4.24 6.91 
; yP SD 1.85 1.78 1.74 1.74 1.77 
Balen eee ne 4.67 4.31 4.53 4.97 4.13 5.57 
Oe SD 1.84 1.71 1.61 1.83 1.82 
FO tole hook eee Di 11.04 11.44 10.79 i237 12.19 4.62 
P Oe SD 307 3.19 3.34 3.29 3.22 
CR CI Ge CE CA 
(N=48) | (N=34) | (N=40) | (V=197)| (N=9) 
Competencies 
ee iy 3.37 5.29 2.87 3.21 3.11 7.20 
SD 1.78 2.84 oAT 2.16 2.33 
ree xX 13.62 12.74 10.97 11.75 10.56 3.43 
SD 3.51 4.21 3.96 4.02 3.98 
Rees xX 22 2.56 2.12 2.55 1.67 2.41 
SD 1.25 1.22 1.35 1.32 1.63 
Life goals 
DevdlopittRccatinetheor) ia 1.44 1.76 1.30 1.24 1.33 6.13 
SD 61 73 56 56 67 
: : r 2.60 2.68 2.40 3.06 3.00 5.38 
Becoming expert in finance and commerce sp 1.06 1.10 1.09 "80 105 
Self-ratings 
Bae pe 2.10 2.50 2.05 2.30 2.67 3.85 
ed SD AT 65 63 70 67 
, Pe eK 2.44 2.18 1.90 2.04 959 2.68 
ee SD ‘1 '86 (30 85 79 
i xX 2.46 2.97 2.50 2.40 2.44 2.94 
Ce SD 79 02 1.00 00 83 
ER EI ES EC EA 
(N=77) | (N=67) | (N=153)| (V=196)| (N=96) 
Competencies 
ot xX 3.53 4.12 3.59 24 3.10 2.51 
oe SD 2.33 2.03 2.67 2.29 2.18 
; yi 13.96 12.27 11.63 11.87 12.02 4.06 
poe SD 3.96 4.18 4.46 4.29 4.74 
; Da 2.06 928 2.31 2.61 2.30 227 
HERES SD 1.35 164 1.38 {37 1.30 
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Two-digit VPI codes 














Student characteristics i 
ER EI ES EC EA 
(N=77) | (N=67) |(N=153) | (N=196)| (N=96) 
; x 5.45 5.91 6.82 5.80 6.31 4.14 
l 
so tel ane -eos vena SD 2.82 3.27 2.80 2.64 3.04 
er xX 6.05 6.30 7.93 5.81 11.00 | 15.74 
Artist 
ia SD 5.29 4.39 6.11 5.26 6.40 
xX 4.30 5.10 5.46 5.17 5.80 DG 
Leadersl 
ce SD 3.12 3.27 3.25 2.95 3.22 
Life goals 
‘ xX 27 1.33 1.32 1.29 1.59 3.55 
Prod twork 
OS es ea a SD iss 16 65 6) 80 
, x 2.03 2.43 2.02 2.89 2.26 | 20.75 
xpert in fi 
Becoming exper In Hnance and Dee See Sy 97 95 1.00 99 99 
Self-ratings 
Seals. Xx oe 2.61 2.48 2.32 PUP 6.43 
Onginality SD ‘81 83 166 66 77 
: re og 2.43 15 1.99 2.02 1.90 4.73 
Mechanical ability SD 86 ‘97 ‘00 "95 ‘gs 
x 2.47 2.63 2.78 2.61 2.73 3.05 
=p opalenty SD 78 "5 67 70 (3 
: xX Day 2.78 2.90 2.69 2.83 3.20 
Understanding of others sp 26 “a5 a4 a4 oo) 
bee x 213 2.36 1.92 2.20 1.86 5.25 
eet) SD 39 03 o1 00 A 
Personality and attitudinal scales 
Oe iM 38 x 13.47 15.48 1507 13.59 18.14 | 21.72 
Preconscious activity (originality) Sp 4.34 4.60 423 374 457 
x 10.91 ie 12.37 11.77 12.49 Bai 
Be 2 a) Competency SD 3.15 3.31 3.26 3.44 B16 
AR AI AS AC AE 
(N=41) | (V=145)| (V=161)| (N=6) | (N=146) 
Competencies 
Le! ee Bt 4.42 3.36 3.50 312 6.15 
Scientific SL 241 2.38 2.50 06 2.30 
a 12.24 11.14 9.70 6.17 10.93 4.63 
Pechnical SD 4.72 4.68 4.60 2.91 4.94 
: xX 1.46 1.85 1.81 2.33 29) 3.36 
2S SD 117 1.33 1.32 04 1.50 
: , ee 5.10 5.97 6.77 6.00 6.58 3.46 
peeeeeees cucational SD 2.80 2.87 2.91 3.51 3.05 
ar x 9.54 12.19 13.08 12.50 12.73 2.61 
Be HetIC SD 5.70 6.77 6.15 5.62 6.53 
Se 4.07 4.59 5.18 4.83 572 3.54 
peadership SD 3.06 2.98 3.21 241 3.17 
Life goals 
He 1.66 1.52 1032 ya 1.43 2.67 
Developing useful product Sp "84 26 65 37 70 
weet x ie? 1.66 133 1.50 1.38 4.91 
Developing scientific theory Sp 65 85 ‘61 6 61 
Becoming expert in finance and commerce x 1.66 1.46 1.45 ale oe ae 
SD 84 74 80 1.07 97 
A: : xX 2.00 2.00 2.03 3.00 2.50 8.65 
Executive responsible for work of others sp 99 ‘99 ‘92 58 89 
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TABLE 3—Continued 


HOLLAND 














Two-digit VPI codes 
Student characteristics a =| oor Rare, Up He Se Fe 
AR AI AS AC AE 
(N=41) | (V=145) | (V=161)| (V=6) | (V=146) 
Self-ratings 
tS we a 2.56 2.05 1.74 1.50 1.38 | aay 
Mechanical ability i) 94 7 ‘gs 50 38 | 
Pa aulenen xe Deh 2.41 2.41 2.00 2.54 2.48 
daa nk SD 79 71 78 58 78 
, : : xX 213 2.93 3.06 2.83 2.82 2.67 
Understanding of others SD 93 23 as 69 16 
ae xX Dan 2.30 1.81 2.67 1.99 6.32 
\ ability 
pe SD 99 1.05 87 ‘AT ‘88 
Personality and attitudinal scales 
" : ney M7 eee xX Dita 23.30 22.80 19.50 20.77 6.13 
Preconscious activity (originality) SD 511 5.20 5 36 3.69 4.30 
A : xX 4.51 3.86 4.04 5.00 4.53 3:13 
Re ear SD 1.84 1.82 2.00 1.53 1.85 
4 : xX 3.56 4.00 4.04 4.50 4.77 4.86 
Gomceate ype Syn 2a? fet | (2.07, os, VP 
xX 10.05 11.29 11.43 9.83 12.09 3.46 
] r d 
Interpersonal competency SD 3.62 3.62 3.07 177 3.43 




















Note.—See note of Table 2. 
a All F ratios are significant beyond .05 level. 


This is a good college for me, but there are a few 


othersytha tlechinkvanesbet tenure rier reir 2 
This is only a fair college, and there are many others 
which would probably suit me better. ........... 1 


The results for men in Table 8 conform 
closely with the theoretical statement (Hol- 
land, 1966b, p. 48): satisfaction should be 
highest for the Intellectual type, and in de- 
scending order, Social, Artistic, Conventional, 
Enterprising, and Realistic. The rank-order 
correlation between the actual mean satisfac- 
tion scores and the mean expected scores 
equals .83 (p < .05, one-tailed test). At the 
extremes, the male Intellectual type is most 
satisfied with his college and the Realistic 
type is least satisfied. The results for women 
follow an unpredictable pattern and do not 
conform to expectation. 

In Table 9, student educational aspirations 
are not independent of high-point codes for 
each sex (p < .001). The two rank-order cor- 
relations (each rho =.77) between the ob- 
served and predicted medians for the six 
types are positive, but they are not significant. 
The expected level of educational aspiration 
by personality type is identical with the 
theoretical ordering reported earlier for satis- 


faction with college and also constitutes the 
expected order for level of academic achieve- 
ment in Table 10. 

In Table 10, college grades, for men, are 
not independent of personality types (p < 
.05). For women, grades and types are in- 
dependent of one another (p > .05). Although 
grades, for men, are not independent of types, 
the rho of .71 between the observed and ex- 
pected median grade for each type is not sig- 
nificant. (A rho of .83 is needed for a one- 
tailed test at the 5% level.) 


Vocational Role Preferences 


In a follow-up survey, about 8 or 12 mo. 
after the administration of the VPI, students 
responded to the following item: 


What special role would you like to play within 
your first occupational choice? 


Being a practitioner of my occupation. 

Training or teaching others about my occupation. 
Leading or supervising people. 

Doing research in my field. 

Acting as a consultant or expert to others. 
Undecided. 

Other role. 
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TABLE 4 
RELATION OF THREE-Dicir VPI CopEs To STUDENT CHARACTERISTICS (MEN) 
Three-digit VPI codes 
Student characteristics Fa 
RIS RIC RIE RIA 
(N=59) (N=59) (N=59) (N=44) 
Competencies 
Bee area ce tienal XG Basil Sul 4.93 4.09 7.42 
SD 2.65 2.62 2.83 2.40 
Artistic Xa 5.14 3.88 6.24 7.82 evil 
: SD 3.88 3.95 4.44 5.02 
ete chi xe 3.54 2.39 3.86 3.41 3.65 
P SD 2.26 2.42 2.61 2.90 
Life goals 
2 4 de 1.22 LDS 1.47 eo 2.88 
Producing artwork Sp 52 sd ‘st mn 
4 ’ , , 1.61 EDS! 1.88 1.41 3.74 
Becoming expert in finance and commerce sp 26 a4 “2g 58 
Self-ratings 
Poselant xX 2.24 2.10 2.44 2.30 3.16 
P y SD 65 60 59 55 
Personality and attitudinal scales 
é Se: okie ree Ds 14.73 15222 17.05 17.80 5.38 
Preconscious activity (originality) SD 433 4.60 4.66 4.35 
S X 3.88 3.95 4.85 4.41 3.53 
eee retype SD 1.70 161 1.94 2.04 
KE 10.64 9.14 10.73 10.00 3.54 
Interpersonal competency sp 3.00 2.97 3.02 2.90 
IRS IRC IRE IRA 
(N=115) (N=76) (N=89) (N= 86) 
. Competencies 
j : X 5.56 4.04 5.29 4.58 5.30 
Social and educational sp 2 85 2.62 3.02 271 
Artisti XG 6.30 4.99 5.76 7.56 S155 
Pe SD 5.43 4.61 4.89 5.59 
Peasant dx 3.90 3.14 4.33 3.45 3.03 
P SD 2.80 2.49 2.87 2.58 
Life goals 
: iweb ak 1.98 2.30 Dal 2.30 2.97 
Developing scientific theory SD "85 ‘80 93 ‘07 
: é z 1.49 1.66 1.80 1.30 7.73 
Becoming expert in finance and commerce SD 65 a7 "84 lei 
_ P 5 oa XG ZelS 2.08 2.47 2.03 5.96 
Executive responsible for work of others SD ‘ay ‘a4 29 "30 
Self-ratings 
Originalit xs DOT 2.29 2.25 2.56 4.40 
ce SD 62 68 60 69 
pe XG 2.47 2.96 2.54 DES? 5.29 
pea ability SD 88 82 06 89 
Personality and attitudinal scales 
Preconscious activity (originality) x LO 16.32 15.88 19.09 11.06 
ee ee SD 4.28 4.14 3.92 3.87 
WReational tyoe Exe 4.91 paz) 5.45 4.78 2.97 
2 yP SD 177 1.62 1.69 1.57 
Galleviate type aXe 4.40 3.79 4.61 4.19 3.00 
eter’ SYP SD 1.91 1.74 1.78 1.81 
Pov eoininetenc xX 11.25 9.75 10.69 10.58 3.59 
EP ee SD 2.92 3.05 3.01 3.40 



































12 Joun L. HoLitanp 
TABLE 4—Continued 
Three-digit VPI codes 
Student characteristics 9 : Te 
IAR TAS IAC TAE 
(N=93) (N=94) (via) (N=49) 
Competencies 
Technical xX 12.87 10.89 11.57 12.90 3.91 
SD 4.35 4.34 3.33 4.48 
aces ex4 1.74 1.88 3.14 Dine 3.83 
r ‘ SD 1.29 1.14 83 Tesi 
ee ee potas X 4.55 6.37 4.71 6.37 yea 
Social and educational SD 3.04 3.01 167 2.76 
Weadercnts xX 3.76 5.01 4.43 6.10 6.45 
I SD BTS 3.24 2.92 3.39 
Life goals 
5 ae 2 x eon 1.47 1.71 1.92 Sallis. 
Developing useful products 5 '87 ‘65 10 92 
; ee ; x 1.38 1.30 Zot 1.90 12.16 
Becoming expert in finance and commerce ¢7, 65 a4 90 95 
3 7 : : Xa 1.87 Data 2.86 2.49 6.28 
Executive responsible for work of others Sp "89 ‘94 “64 07 
Self-ratings 
rate x 2.83 2.90 2.00 2.82 2.83 
Onginality SD ‘84 3 53 82 
: te X 2.54 2.07 Des 2.10 5.21 
Mechanical ability sp ‘92 ‘84 1.05 rv 
; x 2.14 Deon 3.00 Zoo) eel 
Popularity SD “14 66 1.69 70 
Personality and attitudinal scales 
, Xx, 4.20 3.70 4.71 4.73 3.98 
Vocational type SD 1.89 1.80 1.48 1.50 
XY 9.76 11.00 9.43 12.37 5.65 
Interpersonal competency Sp 3.68 405 159 3 24 
ISR Se ISE ISA 
(N= 84) (N=12) (N= 60) (N=60) 
Competencies 
d xX 1.70 1.67 245 1.88 3.74 
ee SD 1.21 1.43 1.50 1.42 
— xX 6.30 8.08 ales, 9.25 4.47 
ana SD 4.40 5.47 4.51 5.45 
Life goals 
om: X 1.18 1.42 tet eae: 5.34 
Producing artwork SD 47 86 45 "88 
Self-ratings 
eee xX 2.40 2.50 2.28 2.70 4.30 
Originality SD 64 65 61 nt 
ern dS 2.20 2.42 1285 1.98 2.95 
Mechanical ability SD ‘04 '86 23 26 
Personality and attitudinal scales 
; aes Maks x 17.61 17.92 16.32 21.28 12.49 
Preconscious activity (originality) SD 4.35 3.86 4.93 4.77 








Note.—See note of Table 2. 


4 All F ratios are significant beyond .05 level. 
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TABLE 5 


RELATION OF ONE-Dicit VPI CopEs to STUDENT CHARACTERISTICS (WOMEN) 








One-digit VPI codes 





Student characteristics Real Int Soc Conv Ent Art Fa 








Cae W N= | ive | t= | we 
31) | 554) | 3033) | 271) | 148) | 1284) 
Competencies 
eae x| 3.52 | 451 | 283 | 2.28 | 247 | 297 | 65.94 
pcientific sp| 249 | 2421 214] 181 | 214] 221 
ay! x| 945 | 563 | 478 | 490 | 526 | 4097 | 19.72 
Technical sp| 410 | 331 | 3.07 | 313 | 369 | 323 
. x| 200 | 194 | 203] 249 | 232 | 202 | 9.29 


SD ed 1.24 1.25 1.20 1.39 1.24 
xX 6.19 6.67 8.01 6.62 7.01 7.56 | 40.19 


Social and educational SD 272 2.90 7 38 250 273 263 


Arts xX 8.23 9.68 10.07 7.76 10.87 13.89 | 106.34 
SD Dror) 5.34 5.50 4.58 6.11 6.41 
oe sp| 291 | 297) 307 | 307] 332 | sn | 
Life goals 
Developing useful product es ae oe re a He re 9.05 
Developing scientific theory ma nice he es ae o ie 85.15 
Producing artwork on a a a ye ie 65.66 
Becoming expert in finance and commerce ie oe ae ais a i ee 31.58 
Active in religious affairs es ah es my ae a ae Hosts 
Executive responsible for work of others e ee ae oe a oe 15.84 








Self-ratings 








ae Reo ise i 23r 1224 |) 912 | 945.1 2509) 94.22 
oe SD 71 aa 66 67 68 74 
: a aso en 25. 1:57 | Mete67 tiliieetroseull Ut 6St"l 91 00 
Be gical ability Spy moone sn | 70| 77 | 78. | 7s 
: anon 20s 9236 |) 2:97 | 249 | 9:33 -) .4.55 
Bopwtarity SpiMeee eees cs | 6611.60 | «68 
: Melee aes 2d | 284 | 9.60 | 2:78 | 284 | 974 
Understanding of others SD a4 73 ‘67 63 67 70 
a Saleen e736 | 61.90 |) 92.32 1.95 | 1.86 | 38.58 
pe ebilty sp| 82 09 85 07 33 ‘88 
ee Rel Osnoo 218s) 237 ) 290 | 210 | 6.68 
SD 54 a 63 63 74 69 


Personality and attitudinal scales 
Da enl/co2me LO Oi Molle ete. 408 Ne iOve i aa.20) 215.60 
SD| 4.76 4.61 4.49 4.30 4.94 4.50 
X | 16.68 | 16.26 | 17.05 | 18.26 | 17.68 | 16.60 6.56 
SYD) |) — oxs{0) 02 SG) 5.30 5.09 5.48 
X| 4.29 5.00 4.57 4.27 4.39 4.87 | 10.78 
SD} 1.99 2.03 1.92 1.82 1.93 1.95 
X | 4.71 4.16 4.57 5.01 4.58 4.01 | 30.24 
SD eco 1.65 1.67 1.79 1.67 1.68 
XG oreo 2.82 2.74 2.61 2.94 Ta eel tet) 
SD 5 1.55 1.58 ESS 1.88 1.74 
X| 4.42 4.56 2 4.87 5.18 4.69 | 19.98 
SDs eel. 1.99 1.97 1.95 1.97 2.06 
XO LO OG OCSm eit Sa ie LOLS Un iedteo ti 11°62 8:93 
SD | 3.39 3.30 20 3.35 3.45 3.26 


Preconscious activity (originality) 








Dogmatism 
Academic type 


Vocational type 








Nonconformist type 


Collegiate type 











Interpersonal competency 





Note.—See note of Table 2. 
8 All F ratios are significant beyond .05 level, 
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TABLE 6 
RELATION oF Two-Dicir VPI CoprEs To STUDENT CHARACTERISTICS (WOMEN) 
Two-digit VPI codes 
Student characteristics ay 3 a <a Fa 
IR IS LG IE IA 
(N=32) | (N =234) (N=25) | (N=14) | (V=150) 
Competencies 
Technical xX 7.09 5.40 6.96 4.57 5.29 3.82 
ane SD 632 3.07 S997) PEST 3.2500 
icine x 1.78 2.02 BOG 2ST, 1.65 5.03 
SD 99 | 122 L205 1.24 1.19 
Seal acl educational xX 4.62 7.09 6.72 8.21 6.27 7.34 
: ‘ SD 2.81 2.67 2.88 254 3.02 
Artichic XG 7.44 9.58 7.24 10.43 10.57 3.84 
/ SD 4.41 SES 3.95 | 4.84 5.31 
pene x ne, 4.45 4.72 6.29 3.89 3.83 
P Spi) Bo 3.03 2.89 2.99 2.83 
Life goals 
‘ , BX 1.47 1.42 1.40 ri lane 5.07 
ROO ROE SD| 83 78 57 04 00 
Becoming expert in finance and commerce x 1.22 1.35 1.60 1.93 1.27 4.57 
“I SD 54 67 80 80 60 
, ; ; . dE 1.50 1.87 2.20 2.43 ois 4.90 
Executive responsible for work of others Sp 74 '80 "89 1.05 '83 
Self-ratings 
Onginatiey X 2.16 226 2.04 2.50 2.43 3.02 
= SD aS) 65 a 13 We 
: f ce xX 2.62 1.75 2.08 2.07 1.81 9.29 
Mechanical ability ST "89 "80 ‘80 "88 a7 
: XE 2.09 2.29 2.16 Zid 2.14 3.79 
pepulee Spi. V a52 66 54 70 63 
4° xX 2.34 2.80 2.48 3.29 Ooi 5.76 
Understanding of others Sp 69 67 ‘a5 70 ‘a7 
a OG DeTe PES 2.84 2.43 Ds 3.46 
Sy SI 84 04 92 1.05 1.01 
Personality and attitudinal scales 
: as 8S eee Ot 18.50 19.21 16.84 18.93 21.69 11.91 
Preconscious activity (originality) sD 4.15 456 444 4.33 3.95 
eure x 4.44 5.10 3.92 5.07 5s Dede 
pes SD| 1.85 1.98 1.49 1.22 2.14 
F XG 3.66 4.69 4.72 6.14 4.43 4.51 
Collegiate type sp| 2.07 1.93 1.95 1.88 1.92 
P x: 9.78 11.16 10.72 12.36 10.08 4.17 
eater eer onal eprapetency, Spi ¢ 367 3.14 3.58 2.84 3.43 
SR SI SE SE SA 
(N= (N= (N= (N= (N= 
21) 444) 360) 536) 1077) 
Competencies 
xan: De 3.10 4.00 2a 2.54 2.73 43.97 
Cone sp| 2.31 2.34 1.95 2.06 | 2.01 
; x 6.95 5:25 5.09 4.63 4.50 8.78 
tee SDil | b3.37 3.43 3.11 2.94 2.92 
Bases BXG 2.05 1.95 ZECE, Det 1.95 4.43 


SD 1.09 1.23 123 1.29 1.24 
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TABLE 6—Continued 
Two-digit VPI codes 
Student characteristics SR ST sc SE SA Fa 
Ne (N= v= (N= CN 
21) 444) 360) 536) 1077) 
: 3 EX? 7.29 7.91 7.74 8.34 8.10 4.55 
Social and educational sp 2.00 9.49 204 9 42 2 36 
mene xX 8.29 9.27 8.26 9.80 Weary 32.62 
peueus spD| 5.48 5.29 4.66 5.24 5.68 
F x 4.90 4.95 4.71 5.40 5.18 3.19 
po orerip SDI) 3.0 2.97 3.06 3.09 3.06 
Life goals 
’ Q Xe 1.19 1.24 ited 1.16 1.19 2.38 
Developing useful product SD 66 sd 43 44 52 
4 tg adh co i x 1.19 136 Asi 1.15 1.16 12.85 
Developing scientific theory SD 66 68 ‘50 46 "49 
7 / x 1.33 1.43 Loy} 1.41 1.58 9.01 
Producing artwork SD "BA a7 65 ‘a5 86 
2 : 5 XG 1.29 1eS2 1.56 1.44 1.28 14.06 
Becoming expert in finance and commerce SD 63 67 80 a4 Gl 
toes be ; Xe 2.76 3.03 3.14 2.95 3.06 2.82 
Active in religious affairs sD ‘07 ‘O4 "89 ‘OF ‘OA 
; : aw. XG 1.76 2.05 Dale ae; 1.94 5.78 
Executive responsible for work of others sp "8 92 "80 "88 "86 
Self-ratings 
Originalit X 2.00 De? 2.08 Os ZeOe 9.76 
ie ae SD 53 65 56 63 mA 
2 ee Xe 2.00 1.69 1.60 1.54 152, 6.72 
Mechanical ability SD 87 26 20 ay 68 
: Xs 2.29 2.34 PDS 2.43 Por 5.86 
Popsiarity SD eens 65 51 65 65 
; BXG OS 2.83 2.76 2.87 2.86 273 
Understanding of others SD 00 ‘69 ‘65 ‘64 68 
ie. X 1.86 2.03 1.97 1.82 1.84 5.82 
pee ility sp| _.89 87 ‘84 30 85 
5 XG 2.10 DOR 2.23 2.10 2.20 3.68 
ee STD Sess 65 61 58 66 
Personality and attitudinal scales 
ere aciccy, originality) BXg ie oaaal 18.52 15.62 16.91 19.75 86.67 
SD 4.59 4.19 3.73 4.28 4.24 
ee atiers Xa 16.14 16.95 18.21 eed 16.91 4.30 
6 SD| 5.44 5.37 5.30 5.62 5.45 
Readanic te X Sea 4.87 4.46 4.35 4.68 7.14 
; yP cp 2 Ot (282 1.83 1.93 1.95 
Rrecationall type XG 5.14 4.37 5.05 4.68 4.40 13.58 
yP SDV) 1832 1.69 1.61 1.66 1.60 
Ree Saformict type XC 2.76 2.74 DS) 2.87 BYS 2.56 
¥1 SD. neees0 1.54 1.51 1.67 1.59 
Pie sate tine XG 5.62 5.09 S25 5.66 5.03 10.14 
ae SD| 1.65 1.96 1.76 1.94 2.02 
Pte erconal competenc xX 11.24 11.66 11.41 ZEZe: 11.86 4.20 
ae a SA Sua S32 3.30 3.01 3.10 3.19 
CR (en CS CE CA 
N= SF NS) CN Te (N 5S) CN =21) 
Competencies 
Scientific Exe 4.00 Ses 2.04 2.34 2.19 2.46 
SD 2.16 1.96 1.76 1.83 1.71 
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TABLE 6—Continued 
Two-digit VPI codes 
Student characteristics Hike 
CR Gl cS CE CA 
(N= Se GV nt N= Tee Vie a3) ao —= 2) 
Techuieal ox 11.67 4.27 4.63 4.55 5.29 4.54 
SD 2.62 3.07 3.01 2.69 2.69 
Life goals 
Becoming expert in finance and commerce ae aS ue oe ve oe 2.85 
Eee ; xX 3.00 eT 3.00 3.00 2 2.44 
Active in religious affairs sp "82 37 92 ‘07 ‘98 
Executive responsible for work of others - oe Pe a ae a 4.44 
Self-ratings 
5 etre BX 2.67 1.80 1.50 1.74 1.95 3.83 
Mechanical ability Sp ‘47 65 65 93 ‘00 
: x 1.67 2.33 2.68 2.62 Deda 2.85 
; 
Understanding of others Sp 47 7 60 ‘68 20 
Personality and attitudinal scales 
. ; } x 5.67 2.40 2.60 2.81 2.67 3.10 
pin Contosea st type SD} 1.89 88 1.49 1.69 1.61 
ER EI ES EC EA 
(N=2) (N=5) | (N=44) | (V=13) | (V=25) 
Competencies 
Eee EXE 5250) 7.00 10.98 9.69 15.24 3.89 
ey Tieiie SD 450-4 (¥8-63all) 6.52 5.33 5.52 
, Exe 2.50 5.60 6.32 3.38 Tale, 3.59 
Leadership SD 50 1.62 3.54 2.87 2.86 
Life goals 
Developing scientific theory ae ae “a ae Nae ae 2.96 
Personality and attitudinal scales 
Preconscious activity (originality) ee ae Mie ee es oe 5.67 
AR AT AS AC AE 
(N=13) | (N=175) | (N=773) | (N=13) | (V=118) 
Competencies 
Ee DE 3.00 4.00 Dei, 3.69 2.12 17.30 
Sclentife SDV" 2.00 lea s5 i) eetoganh meee. 1.87 
, BX Talhi 5.45 4.79 3.62 5.09 4.71 
pepe SD) 3a 3.62 3.05 2.59 3.37 
7 2 Exe 6.00 25 7.56 6.15 7.18 5.63 
sociation eeucasiond| sp| 2.72 2.61 2.56 2.90 2.78 
ae EX: 9.85 14.79 14.14 Lia 13.82 2.40 
fnustic Sp| 5.92 7.21 6.26 7.36 6.28 
: X 3.46 4.69 5.06 SR) 5.61 Sb) 
hepdersnny SDN 2:98 3.24 3.06 2.61 3.08 
Life goals 
; Poa dh EXG 1.31 1.41 1.20 1.15 1.19 4.83 
Developing scientific theory sp él 73 36 36 36 
, 4 EG 1.85 2.24 1.92 2.00 2.02 3.33 
eS SDs 1.03 1.13 1.04 78 1.02 
: ; , re BXG 1.08 12 1.30 1.69 152 4.43 
Becoming expert in finance and commerce sp ‘7 63 64 Of 85 
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TABLE 6—Continued 


Two-digit VPI codes 























Student characteristics oe ain eo i? - are re 
AR Al AS AC AE 
(NV=13) | (V=175) | (WV=773) | (V=13) | (V=118) | 
ee ay: * ea X 2.23 2.49 2.82 2.69 2.76 4.69 
Active in religious affairs SD ‘OF 1.07 1.02 "99 1.05 
Be fe. Pale i) 1.70 1.84 1.92 2.09 | 3.79 
Executive responsible for work of others op) ‘a9 87 86 1.00 ‘OL 
Self-ratings 
; ae xX Bes 1.83 1.56 1.46 1.54 Meow, 
eee ey SD 70 81 72 50 71 
: ae xX 2.46 2.81 2.89 2.46 2.69 4.15 
Understanding of others SD 63 72 74 50) ice 
Personality and attitudinal scales 
‘ pee spr Boe xX 23.08 23.64 22.39 18.69 ZnS 8.31 
Preconscious activity (originality) SD 5 23 437 4.24 414 443 
: : xX 4.08 4.47 4.76 3.38 4.85 2.48 
pert type SD ges) o2.0t 2.09 | 2.413 1.96 
XE || OB AS 11.88 9.08 LESH 4.21 
ge sonel competency SD, 3.00 3.32 397 3.36 3.04 





Note.—See note of Table 2. 
a All Ff ratios are significant beyond .05 level. 


The analysis in Table 11 was performed to 
learn if role preferences for students with dif- 
ferent VPI codes conform to theoretical ex- 
pectations (Holland, 1966b, p. 65, Table 7). 
The distributions of student role preferences 
for each VPI code were rank ordered and 
correlated with the expected ranks. The re- 
sults in Table 11 are all positive, but only 
rhos of .90 or 1.00 are significant at the 5% 
level for a one-tailed test. In short, students 
with some high-point codes have very pre- 
dictable choices of vocational role, notably, 
Realistic, Social, and Conventional men, and 
Social and Conventional women. 

To summarize, the evidence in Tables 2-11 
suggests that the personality types are useful 
models for interpreting and organizing a broad 
range of information about students. Despite 
some undesirable overlapping of character- 
istics across types, the majority of expected 
typological characteristics conformed to the 
theoretical formulations. However, some re- 
visions of the theory are needed for the role 
expectations for each type, for the assignment 
of a few traits to types, and for the applica- 
tion of the theory to women. 


PREDICTION OF VOCATIONAL CHOICE 
AND OTHER CRITERIA 


The theory assumes that Realistic types 
select Realistic occupations and that Intel- 
lectual types select Intellectual occupations, 
etc. The following analyses test this main 
hypothesis with different assessment devices— 
the VPI and the student’s initial expressed 
vocational choice. In addition, several closely 
related hypotheses were examined: the im- 
portance of a student’s preference of voca- 
tional role in prediction, the prediction of 
degrees of change in a student’s vocational 
aspiration, and the improvement of predictive 
efficiency by using various combinations of 
predictors or special attributes of the pre- 
dictors. 


Classification Scheme 


For the following analyses and all other 
analyses in this report, student vocational 
choices were categorized according to the 
classification scheme developed earlier: six 
categories for men—Realistic, Intellectual, 
Social, Conventional, Enterprising, and Ar- 
tistic—and eight categories for women—TIn- 
tellectual, Social-Intellectual, Social-Conven- 
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TABLE 7 
RELATION OF THREE-Dicit VPI CopEs To STUDENT CHARACTERISTICS (WOMEN) 
Three-digit VPI codes 
Student characteristics ener, Ser amas fe Ser fra 
SIR She SIE SIA 
(N=22) (N=34) (N=78) (N=175) 
Competencies 
Technical x 7.45 6.12 5.41 5.05 351 
ee SD 5.29 3.54 3.60 3.16 
Artisti xX 8.91 7.97 9.62 10.58 2.63 
ae SD 4.07 5,99 56 He eae 
Personality and attitudinal scales | | 
Preconscious activity (originality) - Sis oe eee ele ie 
y ee z = ae SD 4.79 4.13 3.94 4.02 
Wacatiedalserne BX 4.36 5.06 4.72 4.11 4.11 
yP SD 1.55 1.76 1.66 1.75 
I eterserannaliconipetcecs XE) APS 10.41 12.32 ite Oa 
Be ° SD 3.49 2.86 2.84 3.27 
SER SEI SEC SEA 
(N=15) (N=43) (N=130) | (V=173) 
Competencies 
sey xe 1.93 3.35 2.09 2.48 4.95 
oe Spl. ht.5% 1.96 1.68 2.09 
Fi ; X 6.93 4.77 4.39 4.49 3.67 
oe sD| 3.26 2.55 2.88 2.87 
: : xX 7.93 8.81 8.06 8.89 3.50 
Social and educational sp 272 047 252 217 
eae BX 7.60 9.88 8.54 LIGHE. dais 
oe Spi") 3.67 4.04 5.05 5.36 
Life goals 
A xX 1.67 Heal 1.09 ee 7.36 
Developing useful product sD '87 55 31 ‘46 
‘ ; SRN: ’ X 1.47 1.23 WP 112 3.50 
Developing scientific theory SD ‘st 56 30 40 
Self-ratings 
ee EX 1.60 1.93 1.92 1.69 Onis 
SUS SD ‘80 79 ‘81 70 
Personality and attitudinal scales 
< Some fae ep xX 15.40 17.09 15.62 18.20 10.99 
Preconscious activity (originality) sp 326 45] 3.98 3.96 
: Xx 5.00 5.14 4.27 4.38 2.70 
eee SD\ 2.07 1.80 2.02 1.86 
xX 11.13 12756 11.85 12.50 3.29 
Interpersonal competency SD 275 776 3.07 3.10 
SAR SAI SAC SAE 
(N=16) | (N=206) (N=93) | (N=443) 
Competencies 
NS: Xa 1.81 3.46 2.76 2.62 9.71 
POLI SD| 1.24 2.14 2.29 1.89 
4 : XC ees 7.80 7.98 8.58 (OR: 
Social and educational SD 2.63 245 |\ 2.29 219 
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Student characteristics 


TABLE 7—Continued 


Three-digit VPI codes 











SAR SAI SAC . oe i 
(N=16) | (N=206) | (N=93) | (N=443) | 

sai Fe ee neg Tits [e se 

ee BLE Ss ae far | an 
Self-ratings 

Originality om es = eS tes bg 

Math ability SC merigemes ses} eagle nage oe 
Personality and attitudinal scales 

Preconscious activity (originality) es me oe Se se 8.51 

Collegiate type SDH L959 \6 tas. 2021) azo etre 

Interpersonal competency a ue: ae ee ee 2 

ASR AST ASC ASE 
(N=22) | (W=211i) | W=34) | (V=339) Fa 

Competencies ' i z 

Scientific sp| is | 207 | to | a | 

Buen XG 2.09 1.86 279 2.08 6.19 

SD 1.24 1.14 eel 122, 

Social and educational Se i ae Fae mo ag 

Leadership sp| 22 | 27 | 3a | 313 | 
Life goals 

Executive responsible for work of others ee oe oe fer Ae oa? 
Self-ratings 

Popularity cle ost veg abe a 

Math ability Se eee camel dos/$e/ Poy) |. 
Personality and attitudinal scales 

Preconscious activity (originality) ee Rial oe oe a ON 

Collegiate type Me eo |e | 




















Note.—See note of Table 2. 


a All F ratios are significant beyond .05 level. 
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TABLE 8 


RELATION OF SATISFACTION WITH COLLEGE AND PERSONALITY TYPE 
(VPI HicH-Pornt CopEs) FOR FALL SAMPLE 


Women (N = 1383)> 


VPI Men (N = 1353) 














high-point | ——__— —-—— |- wee - 

code Low Middle High Low Middle High 

Real 19.3 (22) | 46.5 (53) 33.3 (38) 0.0 (0) 0.0 (0) | 100.0 (3) 
Int 8.2 (45) 50.5 (277) 41.2 (226) 12.3 (28) 49.8 (113) 37.4 (85) 
Soc 9.4 (20) 50.9 (108) 38.7 (82) 12.4 (93) 44.9 (338) 42.6 (321) 
Conv 12.4 (13) 43.8 (46) 43.8 (46) 4.0 (2) 48.0 (24) 48.0 (24) 
Ent 15.0 (29) 53.9 (104) 31.1 (60) 13.9 (5) 61.1 (22) 25.0 (9) 
Art 10.2 (19) 49.5 (92) 39.2 (73) 17.4 (55) 47.6 (151) 34.7 (110) 








Note.—Chi-square calculated from frequencies shown in parentheses. Abbreviations: Real = Realistic; Int = Intellectual; 
Soc = Social; Conv = Conventional; Ent = Enterprising; Art = Artistic. 

ay? = 21.97; df = 10; p < .02. 

biti 25527 GF = 10s < 102. 


TABLE 9 tional, Social-Enterprising, Social-Artistic, 

RELATION OF EDUCATIONAL ASPIRATION TO Conventional, Enterprising, and Artistic (Hol- 

Personality Types (Hicu-PoInt land, 1966a). Tables 12 and 13 indicate the 
VPI CopEs) 


assignment of vocational choices to vocational 








Men (N=1328)" | Women (N=1363)» Classes for men and women. Students selected 
Velen es peaueet their career choices from a coded list of 99 
high-point} Ba or | Advanced| BA or [Advanced careers. All classifications, as well as the 
rae less | training | less | training scoring and establishing the highest VPI scale 













































































o7, OF, 7 O7, 
(%) | (%) %) /o) codes, were performed by a computer. We as- 
Real 43.8 56.3 100.0 0.0 sumed, then, that the classifications were per- 
Int 17.6 82.4 48.2 S18 formed with perfect reliability. 
Soc 29.1 70.9 58.6 41.4 
Cony 47.6 52.4 80.0 20.0 
Ent 31.6 | 68.4 68.6 31.4 
Art 16.9 | 83.1 45.9 Sal! TABLE 11 
i = 3 ras ' CORRELATIONS (RANK ORDER) BETWEEN PREDICTED 
eae We oe AND OBSERVED ROLE PREFERENCES FOR THE 
bx? = 38.74; df =5;p < .001. PERSONALITY TYPES 
TABLE. 10 Personality types (VPI high-point codes) 
RELATION OF COLLEGE GRADES TO PERSONALITY TYPES 
(VPI Hicu-Pornt Cops) Real | Int | Soc | Conv | Ent | Art 
Men Women Fall sample (1359 men; 1386 women) 
_VPL (N= 1359) (N= 1386)» 
high-point 
code Men .90* .60 ‘SOT O0** aie .20 
SC | 2C+ | SC || 2€6 Women | .75 | .80_) 90% || 90" =]) (S0mmmmem 
Realistic 50.9 49.1 S30) 60.7 
Intellectual 43.4 56.6 42.3 S77 Spring sample (1773 men; 2336 women) 
Social 50.0 50.0 45.3 54.7 
Conventional Soe 44.8 52.0 48.0 
Enterprising 53.9 46.1 55.6 44.4 Men 70 60 LOO 00" * aime od 18 
Artistic 44.1 55.9 44.5 Deo) Women | .50 .60 .90* | 1.00** | .68 10 
Ry? = 1A dy — ols pean OOK * p <.05, one-tailed test. 
by2 = 3.68; ns, ** ) <.01, one-tailed test. 





Agricultural Science 
Architecture 

Civil Engineering 
Farming 


Aeronautical Engineering 
Anthropology 
Astronomy, Astrophysics 
Biochemistry 

Biology 

Botany 

Chemical Engineering 
Chemistry 

Dentistry 

Electrical Engineering 


PSYCHOLOGICAL CLASSIFICATION SCHEME FOR VocaTIONS (MEN) 
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TABLE 12 


Realistic class 


l’orestry 
Geography 
Industrial Arts Education 


Industrial Engineering 
Mechanical Engineering 
Trade & Industrial Education 


Intellectual class 


Engineering; General, Other 
Engineering Science 
Geology, Geophysics 


Oceanography 
Other Biological Science Fields 
Other Health Fields 


Mathematics Education Pharmacy 
Mathematics, Statistics Physical Therapy 
Medical Technology Physics 

Medicine Physiology 


Metallurgical Engineering 
Military Service 
Natural Science Education 


Veterinary Science 
Zoology 


Social class 





Clinical Psychology 
Counseling & Guidance 
Education, General, & 

Other Specialties 
Education of Exceptional Children 
Educational Psychology 
Elementary Education 


Experimental & General Psychology 
Foreign Language Education 
Foreign Service 

General Social Sciences 

History 

History Education 


Industrial & Personnel Psychology 
Physical Education, 
Recreation, & Health 
Social Work 
Sociology 
Theology, Religion 


Conventional class 














Accounting Business Education Finance 

Enterprising class 
Economics Other Business & Commerce Public Relations 
Law Political Science Purchasing 
Management Public Administration Sales 
Marketing 

Artistic class 

Art General Humanities Music Education 
Art Education Journalism, Radio-TV, Other Fine & Applied Arts 
Drama Communication Philosophy 
English, Creative Writing Literature Speech 
English Education Music 





Vocational Choice and VPI * 


The prediction of a student’s final voca- 
tional choice from his first choice 8 mo. 


3These analyses were reported earlier as ACT 
Research Report No. 18 (Holland & Lutz, 1967). 


earlier is shown in Table 14 for men in the 
fall sample. The percentage of correct predic- 
tions varies from 63.2 to 71.2%. The total 
number of correct predictions always exceeds 
base-rate expectations and cannot be at- 
tributed to chance. 


bN 
i) 


Joun L. HoLitanp 


TABLE 13 


PSYCHOLOGICAL CLASSIFICATION SCHEME FOR VOCATIONS (WOMEN) 


Agricultural Science 
Architecture 
Biochemistry 
Biology 


Clinical Psychology 

Dentistry 

Experimental & General Psychology 
Mathematics Education 


Intellectual class 


Chemistry 

Mathematics, Statistics 
Medicine 

Natural Science Education 


Social-intellectual class 


Medical Technology 
Nursing 

Other Health Fields 
Pharmacy 


Other Biological Sciences 
Physics 

Veterinary Medicine 
Zoology 


Physical Therapy 

Political Science, Government, 
International Relations 

Theology, Religion 





Social-conventional class 





Business Education 


Educational Psychology 
Management, Business 
Administration 


Clerical, Office Work 
Social-enterprising class 


Purchasing 


Secretarial Science 


Sales 





Counseling & Guidance 
Education, General & 
Other Specialties 
Education of 
Exceptional Children 
Elementary Education 


Social-artistic class 


History Education 

Home Economics 

Home Economics Education 
Housewife 

Law 

Modern Foreign Language 


Physical Education, 
Recreation, & Health 

Public Relations, Advertising 

Social Science 

Social Work, Group Work 

Sociology 




















English Education Education Speech 
History 

Conventional class 
Accounting 

Enterprising class 
Marketing 

Artistic class 

Art Journalism, Radio-TV, Modern Foreign Language 
Art Education Communication Music 
Drama Library Science, Archival Science Music Education 


English, Creative Writing 
Foreign Service 


Literature 


Other Fine & Applied Arts 
Philosophy 
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TABLE 14 
PREDICTION OF FINAL VOCATIONAL CHOICE FROM I’irst VOCATIONAL CHOICE 
(Fal SampLe, MEN, N = 1359) 
| Final vocational choice 
First | cae ——— — = = - = 
vocational choice | | | | Homer Wat 
Real Int Soc | Conv Ent Artie i Usd) 3} ne NR N 
| | | | hits 
| 1 
+ : = ‘ : an eee é be |= 
Realistic (Real) | 106 13 4 1 2) 14 67.9 | 6 156 
Intellectual (Int) 31 355) 21 14 29 8 Sif 67.6 | 10 525 
Social (Soc) te aso 8 110 1 12 3 19 | 68.8 4 160 
Conventional (Conv) 1 2 42 7 7 led, 59 
Enterprising (Ent 4 5 15 10 155 4 32 67.1 6 231 
Artistic (Art) | 1 2 8 6 48 10 63.2 1 76 
Undecided (Und) | 4 14 13 4 18 4 53 1 111 
No response (NR) | 10 8 5 8 9 1 41 

















Note.—The percentages of hits in Table 14 and succeeding tables are based on the total N in each row and are aconservative 
estimate, because students who are ‘‘undecided”’ or give ‘‘no response”’ in the follow-up survey are included in the total NV. 


The prediction of a student’s final voca- 
tional choice from his highest score (high- 
point code) among six VPI scores, obtained 
8 mo. earlier, is given in Table 15. The per- 
centage of correct predictions ranges only 
from 21.5 to 51.4%. In this instance, it is 
clear that simply asking the student is su- 
perior to using the VPI, although both meth- 
ods of assessing personality types have pre- 
dictive validity. 

The discrepancy between the efficiency of 
a student’s expressed choice and his VPI 
scores becomes even greater when we select 


a subsample of students whose first two voca- 
tional choices, at the time of initial testing, 
fall in the same vocational class: for example, 
physics and chemistry, education and social 
work, art and literature, etc. The exact ques- 
tionnaire items for this analysis were as 
follows: 


My present career choice is: (Select the appropri- 
ate number from the list of coded careers and cur- 
ricula) 

Tf I could not have my first choice (above) I 
would select the following occupation: (Select the 
appropriate number from this list) 


TABLE 15 


PREDICTION OF FINAL VOCATIONAL CHOICE FROM VPI HiGH-PoInt CopE 
(Fatt SaMpte, MEN, V = 1359) 











Final vocational choice 

VPI high-point code | Fs 

Real Int Soc Conv Ent Art Unds ae NR» N 

| 

Realistic (Real) 43 28 9 if 8 17 Baill 2 114 
Intellectual (Int) 70 282 45 9 43 14 70 51.4 16 549 
Social (Soc) 8 32 78 9 34 11 37 36.8 3 212 
Conventional (Conv) 11 19 4 33 23 13 31.4 2 105 
Enterprising (Ent) 9 16 Ws 18 98 2 29 50.8 4 193 
Artistic (Art) 8 31 28 1 41 40 SS Dilias 2 186 








Note.—To make single predictions from the VPI, it was necessary to omit students whose two highest scores were tied. This 
occurrence then necessitated the omission of students with tied profiles from the tables of expressed choice so that the comparisons 
of the VPI and expressed choice are based on identical samples. If, however, ‘‘expressed"’ choice predictions are based on all students 
(with and without VPI ties), the total predictive efficiency shown in Tables 14 and 17 vary only by 1% or less. 

«Und = undecided. 

b NR = no response. 
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TABLE 16 


PREDICTION OF FINAL VOCATIONAL CHOICE FOR STUDENTS WHOSE FIRST 
Two CHoIces FALL IN THE SAME CLASS 


(FALL SAMPLE, MEN, NV = 586) 


linal vocational choice 





First and second 











vocational choices | GF 
Real Int Soc | Cony Ent Art Und ue | NR N 

| | hits 
See — & | = liao. =| ry . ¥ 
Realistic (Real) 36 2 1 6 76.6 i! 47 
Intellectual (Int) 18 239 5 4 10 4 21 78.1 5 306 
Social (Soc) 47 3 1 9 78.3 60 
Conventional (Conv) 12 2 85.7 14 
Enterprising (Ent) 1 z eo 69 2 9 78.4 5. 88 
Artistic (Art) 3 3 Oe, 1 Tore 1 30 
Undecided (Und) 4 1 2 1 5 13 
No response (NR) 9 3 3 6 7 28 


Table 16 reveals that subgrouping students 
whose first two choices belong to the same 
vocational class results in a substantial gain 
in predictive efficiency. Correct predictions 
range from 73.3 to 85.7% as contrasted with 
63.2 to 71.2 for the total sample of men. The 
subgroup of students whose first two choices 
fall in different vocational classes are, as ex- 
pected, less predictable from their first choice 
than students whose first two choices belong 
to the same class. Although the percentages 
of hits are lower—53.0 to 66.7—these per- 
centages still exceed the percentages obtained 
by the use of the VPI—21.5 to 51.4. 

The results for women in the fall sample 








are presented in Tables 17, 18, and 19. The 
main findings for women parallel those for 
men. The analyses for women are identical 
to those for men except for the use of a 
special classification scheme developed for 
women in which the social class is divided 
into four subclasses. This subdivision was 
made to distribute women more equally 
among the classes or types. Expressed voca- 
tional choices predict 34.8-83.8% of later 
choices (Table 17). 

The VPI in Table 18 predicts final choices, 
on the average, less efficiently than asking the 
student (0.0-81.9%). In Table 19, the for- 
mation of a subsample of women whose first 


TABLE 17 


PREDICTION OF FINAL VOCATIONAL CHOICE FROM FIRST VOCATIONAL CHOICE 
(FALL SAMPLE, WoMEN, VV = 1386) 





Final vocational choice 




















First vocational choice ; 
Int i ae en re Conv | Ent | Art | Und ae NR | Total 
Intellectual (Int) 54 19 2 2, 18 2 A | 15: | 42:5) See ons 
Social Intellectual (Soc Int) 10 183 3 6 42 1 1 10 7 | 66.3] 13 | 276 
Social Conventional (Soc Conv) 17 1 1 33 Onl 23 
Social Enterprising (Soc Ent) 2 8 10 1 1 34.8 1 23 
Social Artistic (Soc Art) 4 20 4 4 bos 1 25 | 37 | 83.8) 121"660 
Conventional (Conv) 2 1 1 5 5516 9 
Artistic (Art) 1 3 2 34 73 4) 19. 5251) Sale o) 
Undecided (Und) 1 al 1 1 28 11 | 24 12 85 
No response (NR) 5 4 16 2 7 9 43 
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TABLE 18 
PREDICTION OF FINAL VOCATIONAL CHOICE FROM VPI HicH-Point CopE 
(FALL Sampite, WomEN, NV = 1386) 
Final vocational choice 
VPI high-point code | ’ i; | | 
Soc | Soc | Soc | Soc i | all a F 
Int int | Cael Bat Art Cony | Ent Art | Und hits NR Total 
Realistic (Real) MD 1 0.0 3 
Intellectual (Int) 46 64 4 2 55 1 17 25 20.3 13 227 
Social (Soc) 14 TEL 18 ie 481 3 1 38 54 81.9 26 753 
Conventional (Conv) 5 9 5 2 12 5 3 5 10.0 4 50 
Enterprising (Ent) 7 3 16 1 2 5 0.0 2 36 
Artistic (Art) 12 43 4 a 139 1 1 65 23 20.5 22 Sil 
alnd = undecided. 
b NR = no response. 
TABLE 19 
PREDICTION OF FINAL VOCATIONAL CHOICE FOR STUDENTS WHOSE FIRST 
Two CHoIces FALL IN THE SAME CLASS 
(FALL SAMPLE, WoMEN, NV = 545) 
Final vocational choice 
First and second 
vocational choice Soe Soc ae cae oF 
7O he 
Int Cr iGsay W encuieAct Conv | Art | Und hits NR | Total 
Intellectual (Int) 25 a 1 1 3 65.8 1 38 
Social Intellectual (Soc Int) 2 85 2 2 5 1 1 | 85.0 2 100 
Social Conventional (Soc Conv) 8 1 1 (Pet 1 11 
Social Enterprising (Soc Ent) 1 1 50.0 2 
Social Artistic (Soc Art) 2 8 i 299 8 le: | CRS 7 338 
Conventional (Conv) 1 1 
Artistic (Art) 1 1 8 23 BD || OS 3 38 
Undecided (Und) 1 1 
No response (NR) 1 3 5 1 5 1 16 
two choices belong to the same class yields TABLE 20 
the most efficient predictions—50.0 to 88.5%. SUMMARY FOR SPRING SAMPLE 
Again Table 19 demonstrates that students 
whose first two choices belong to the same % correct predictions 
vocational class are more predictable than a, ae ——- 
= ; Kinds of prediction 
students whose choices belong to different Men Women 
classes (32.6-78.9%). (The data for the (NV = 1773) | (N = 2336) 
latter analysis are not presented. ) Ce ys 
Because of the controversial character of Ghoicee-rotal 68.7 789 
the results obtained for the fall sample, the VPI—highest scale 45.1 59.6 
same analyses were performed for the spring pressed vocational . 
1 ‘th simil lts. Th 1 choice—same 82.5 86.4 
Sap e wil Suaar results. : ese ana Nees Expressed vocational 
involve a longer interval of time (1 yr. in- choice—different 64.2 71.9 


stead of 8 mo.) and a different stage of col- 
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lege life (end of freshman year to end of Without exception, the results in Table 20 
sophomore year, as opposed to beginning of replicate what we found earlier—various cate- 
freshman year to end of freshman year). gorizations of expressed vocational choice are 
Table 20 is a summary of the main results. clearly and substantially superior to the VPI. 


TABLE 21 


COMPARISON OF THE PREDICTIVE VALUES OF HicH-Point VPI CopE AND PREFERRED ROLE, AND 
INITIAL VOCATIONAL CHOICE AND PREFERRED ROLE (FALL SAMPLE) 


Preferred vocational role 


























ze | | 
7 Sue Practitioner Teacher | Leader | Researcher Consultant 
| | } 
| |— bes | 
ji 5 Fhits|) f  %hits| f i hits | % hits | i %hits| f % hits 
Men (N = 1359) 
WAR 
Real 45 22.2 34 47 1 6 OKO) |) il 54.5 9 3545 9 55.6 
Int } 155 SOnuemelnae: 57.8 1° 35 a eee 30.4 | 160 06.3 Wee, 36.4 
Soc ieetsS 27.4 77 39.0 | 27 48.1 | 13 46.2 8 37.5 Weis 42.9 
Conv 42 21.4 32 34.4 3 OOn/ att Oi al 25/0 5a 69.2 
Ent 63 47.6 69 53.6 5 0.0 | 30 63.3 5 60.0 | 21 42.9 
Art al 16.9 65 29.2 9 SO LOm LO 15.0 13 Tal 8 25.0 
Total 30.5 46.9 43.5 | 38.9 58.8 44.8 
Vocational choice | 
Real | 39 61.5 60 WUT 7 Selena 75.0 21 57.1 Sil 84.6 
Int | 129 eae etal emer 64.9 | 22 40.9 | 149 165 Nlea 70.6 
Soc 36 52.8 Si eae Gad: LD) 57.9 13 462) \ ial 81.8 
Conv 20 50.0 17 82.4 1 100.0 2 0.0 2, 1000 ai 88.2 
Ent (6) SOA 87 [LAN 75.0 | 43 107 MN i ADO ee 58.3 
Art 22 50.0 34 ka i ail ea 4 100.0 3 66.7 2 50.0 
Total 56.7 File: 72.6 65.1 Files 73.8 
Women (N = 1386) 
VPI 
Real 1 0.0 2 0.0 
Int 65 15.4 76 OS feaeali7, OOa 12 8.3 31 33:3) 6 50.0 
Soc 205 (Amen oa 87.0 | 80 Move: \\ Sy 93.0 21 66.7 | 43 81.4 
Conv 17 2B. 18 5.6 4 0.0 5 0.0 3 0.0 3 0.0 
Ent 10 0.0 13 0.0 2 0.0 7 0.0 5 0.0 1 0.0 
Art 88 Dem ens 19.6 | 37 27.0 | 18 16.7 11 0.0 | 20 20.0 
Total 48.2 57.8 50.7 57.6 34.8 57D 
Vocational choice 
Int 34 20.6 38 63.2 5 20.0 1 100.0 44 40.9 5 60.0 
Soc Int 62 Geom e29 74.4 | 26 46.2 | 29 552 24 75.0 6 50.0 
Soc Conv 4 50.0 13 76.9 4 100.0 2 50.0 
Soc Ent 1 0.0 | 345 ite 3 0.0 4 75.0 4 25.0 
Soc Art 141 75.9 1S eo es 86.0 16 62-5 Wes 86.5 
Conv 7 ‘Sel 1 100.0 1 0.0 
Ent 
Art 39 35.9 59 O27 eZ 1 52.4 5 80.0 2 0.0 | 14 50.0 
Total 59.7 29.7 64.2 ees 53:5 69.1 























Note.—The Ns in this table represent all available data. The Ns for totals and subtotals differ from the total sample Nsand are 
not internally consistent because students who had tied VPI profiles, gave unclassifiable choices, or failed to respond have been 
omitted. The italicized percentages indicate the ‘‘expected"’ vocational role for each class or type, 

* Undecided, Other, No Response, 
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Vocational Choice, VPI, and Role 


The following analyses (see Footnote 3) 
were performed to see how well we could 
predict a student’s later vocational choice 
when his preferred vocational role was con- 
sidered along with his first vocational choice 
or his VPI scores. Because special vocational 
roles are associated with each environment 
(Holland, 1966b, pp. 64-65), and by implica- 
tion with each type, we assumed that more 
efficient predictions would be obtained for 
students whose personality type and prefer- 
ence for vocational role were congruent. For 
example, an Intellectual type who prefers the 
role of researcher should be more predictable 
than the same type who prefers the role of 
supervisor. 

Table 21, a summary of 24 tables, presents 
the predictive efficiencies of a student’s high- 
est VPI score and his role preference, and his 
expressed vocational choice and his role pref- 
erence. For men, students who prefer the roles 
of practitioner, teacher, leader, researcher, or 
consultant are more likely to give the same 
or closely related vocational choice 8 mo. 
later than are students who fail to respond, 
or who respond “undecided” or “other role.” 
Or about two-thirds of the comparisons re- 
veal that male students with explicit role 
preferences are more stable than students 
with ambiguous preferences. The results for 
women in Table 21 imply that vocational 
role adds little or nothing to a woman’s initial 
expression of vocational choice. 

The italicized percentages within the six 
classes indicate the “expected” vocational role 
for each class or type (Holland, 1966b, p. 
65). The efficiency of these theoretically de- 
rived predictions is unclear because many 
cells in Table 21 are too small to yield stable 
percentages. If the “total” percentages in 
Table 21 are, however, used as a base, the 
“expected” percentages of hits, for men, ex- 
ceed the corresponding base percentages 10 
out of 10 times! For instance, students with 
peaks on Social and preferences for the role of 
teacher have a hit rate of 48.1% whereas the 
hit rate for “teacher” generally is 43.5. The 
results for women are again ambiguous for 
several reasons. Both the VPI and the clas- 
sification scheme create many small cells, and 


TABLE 22 
RELATION OF PERSONALITY TypEs (HiGH-Point VPI 
CopDE) TO STABILITY OF VOCATIONAL CHOICE 
(FALL SAMPLE) 





Mires ancitiaal High-point VPI code 


vocational | — — es 























hoic | has 5 — ie 
ae Real| Int | Soc | Conv | Ent |} Art | Total 
4 | ee (etree dus grad ih ihe 
Men (N = 1359)# 
Ss =e 
No change 50.9 | 50.6 | 42.0 | 49.5 | 40.9 | 47.3 | 644 
Intraclass 14.0 | 16.9 16.0 Se) 21.8 | 14.5 226 
Interclass 35.1 | 32.4 | 42.0 | 37.1 | 37.3 2 | 489 
Women (N = 1386)» 
: 7 = =e 
No change 33.3 | 48.0 | 57.0 | 58.0 | 52.8 | 47. 738 
Intraclass FOnl GrOn| 1Se7 amcor) | 13,9))1300 185 
Interclass 67.7 } 45,41 27.3 1 36.0! 13 38. 463 























4.17; df = 10; p <.20. 
= 10; p <.001. 


it is unclear how to derive a single set of 
explicit predictions from the vocational-choice 
classification for women. 

The efficiency of the predictions obtained 
for expressed choice and vocational role ap- 
proximates those obtained by subgrouping 
students whose first two choices belong to the 
same vocational class. 

In short, we can predict vocational choices 
most accurately by one of two simple meth- 
ods: (1) ask the student about his first two 
vocational choices, or (2) ask him once about 
his vocational intentions and then ask him 
for his preferred vocational role. Either of 
these methods is almost twice as efficient as 
the VPI. 


Degrees of Change 


In Table 22, the relation of personality type 
(high-point VPI code) to degrees of change 
in vocational choice is examined. In this table, 
“no change” equals giving identical responses 
over an 8-mo. interval, “intraclass change” 
equals giving two occupational choices that 
belong to the same occupational class (see 
Tables 12 and 13), and “interclass change” 
equals giving choices that belong to different 
classes. 

Table 22 reveals that, for women, some per- 
sonality types are more stable than others— 
notably the Social and Conventional types. 
The results for men are not significant and 
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fail to support earlier findings for high-apti- 
tude students (Holland, 1962). 


Personality Patterns 


The theory assumes that the consistency 
and the homogeneity of a student’s VPI pat- 
tern, or personality pattern, are indicative of 
the stability of his initial vocational choice 
(Holland, 1966b, p. 43f, p. 76f); that is, 
consistent patterns should be more stable 
than inconsistent patterns, and homogeneous 
profiles should be more stable than heteroge- 
neous profiles. 

Tests of the consistency hypothesis are 
shown in Tables 23 and 24 for men in the 
fall and spring samples. In 11 out of 12 com- 
parisons, the percentages of hits for students 
with the same VPI code favor the students 
with consistent codes. Consistent Realistic 
types are more stable than inconsistent Re- 
alistic types, etc. However, only 4 out of the 
12 comparisons are significant at the 5% 
level (one-tailed test). A new test with a 
larger N is needed to clarify these results. 


The results for women are not only not sig- 
nificant but not worth reexamination in the 
context of this theory. 

In Table 25, students were categorized as 
having VPI profiles of high, middle, or low 
homogeneity (the difference between the high- 
est and lowest VPI raw score), then compared 
for their stability of vocational choice. For 
men, the results clearly support the hypothesis 
—homogeneity is positively related to sta- 
bility, or, spiked profiles are stable and flat 
profiles are unstable (p < .05). The results 
for women are not significant. 

Another test of the homogeneity hypothesis 
consisted of examining the stability of voca- 
tional choice for students whose VPI profiles 
had (1) “no ties’—a single scale was higher 
than any other scale, (2) the two highest 
scales were ‘“‘tied”—both had equal scores, or 
(3) the three or more highest scales were 
tied. The results of this analysis, shown in 
Table 26, were statistically significant and 
according to expectation for men. Again, the 
results for women are not significant. 


TABLE 23 


CONSISTENCY OF THE PERSONALITY PATTERN (Two-Dicrr VPI Cop) AND STABILITY OF 
VOCATIONAL CHOICE AFTER 1 YEAR (FALL SAMPLE, MEN, NV = 1359) 


Final vocational choice 












































VPI i = 
high-point code 
Pate Real "tat “l 'Secl) Cony | matali: APE Unas ie NR> | Total 
Consistent two-digit VPT codes 
Realistic (Real) 34 DD 4 7 4 10 40.9 2 83 
Intellectual (Int) 55 182 12 3 22. 9 38 54.3¢ 14 335 
Social (Soc) 3 13 49 9 22 9 23 Sila 2 130 
Conventional (Cony) 8 11 4 30 22 9 34.9 2 86 
Enterprising (Ent) 6 9 15 16 76 2 26 49.4 4 154 
Artistic (Art) 6 28 Dy 35 She) ay 22.6 2 168 
| 
Inconsistent two-digit VPI codes 
Realistic 9 6 5 4 7 29.0 31 
Intellectual 15 100 33 6 21 5 2 46.7¢ 2 214 
Social 5 19 29 12 2, 14 Sore 1 82 
Conventional a 8 3} 1 4 15.8 19 
Enterprising 3 7 2 2 22 3 56.4 39 
Artistic w 3 1 1 6 2 3 tet 18 





a Und = undecided. 
b NR = no response. 
° x? = 3,00; df = 1; p < .05, one-tailed test, 
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TABLE 24 
CONSISTENCY OF THE PERSONALITY PATTERN (Two-Dicir VPI Cope) AND STABILITY OF 
VOCATIONAL CHOICE AFTER 1 YEAR (SPRING SAMPLE, MEN, N = 1773) 
Final vocational choice 
VPI high-point code | - a 
Real Int Soc Conv Ent Art Und a NR Total 
1 
Consistent two-digit codes 
Realistic (Real) 71 57 15 11 21 1 15 36.8# 2 | 193 
Intellectual (Int) 62 245 33 25 19 23 58.2» Saee 21 
Social (Soc) 6 18 S4 2 24 14 13 515 Dy 163 
Conventional (Conv) 8 5 6 34 35) 1 2 36.6 z 93 
Enterprising (Ent) 11 11) 24 15 78 8 8 48.8¢ 1 160 
Artistic (Art) 5 28 36 1 28 59 18 33.0 4 179 
Inconsistent two-digit VPI codes 
Realistic 26 16 D5 2 12 1 18 25.52 2 102 
Intellectual 13 UD 38 7 29 7 14 Syed 2 225 
Social 15 16 67 1 22 5 10 47.5 5 141 
Conventional 4 6 4 4 1 2” 19.0 zl 
Enterprising 14 WD 8 3 115) 2 + Dis Oe 58 
Artistic 1 8 2 1 2 2 1 11.8 17 





ax? = 3.82; df = 1; p < .05, one-tailed test. 
by? = 2.99; df = 1; p < .05, one-tailed test. 
ex? = 9.05; df = 1; p < .01, one-tailed test. 


Although this second analysis did not use 
the standard definition of homogeneity—the 
difference between highest and lowest VPI 
scale scores—it does suggest that the relative 
sharpness of a profile, as assessed by the crude 
process of counting the number of tied high 
points in a profile, is indicative of a student’s 
stability of vocational choice. Taken together, 
the analyses in Tables 25 and 26 imply that 
the construct, homogeneity, has some validity 
for men, but not for women. 


TABLE 25 


HOMOGENEITY OF THE PERSONALITY PATTERN (VPI 
PROFILE) AND STABILITY OF VOCATIONAL 
CHOICE (FALL SAMPLE) 








Men (N = 1359) Women (N = 1386)» 




















Combinations of Consistency, Homogeneity, 
and Role 


It follows from the theory that students 
whose VPI profiles are consistent and homoge- 
neous, and who express a vocational role 
preference other than “undecided, no re- 
sponse, or other role” will be more predictable 
than students who have inconsistent and 
heterogeneous profiles, and who fail to ex- 
press a role preference. To learn if the effects 


TABLE 26 


HOMOGENEITY OF A STUDENT’S VPI PROFILE AND 
STABILITY OF His VOCATIONAL CHOICE 
(FALL SAMPLE) 

















Men (N = 1402) Women (N = 1427)» 
Homo- ; 
geneity Hits Misses Hits Misses 
if % if % uf % i % 
No ties 816 | 67.6 | 391 | 32.4 | 893} 71.0 | 365 | 29.0 
2 ties 77 | 64.7 42 | 35.3 80 | 66.7 40 | 33.3 
3 or more 40 | 52.6 36 | 47.4 34 | 69.4 15 | 30.6 
Total 933 469 1007 420 





Homo- : 
geneity Hits Misses Hits Misses 
f % ig % fe % ii % 
High 201 | 46.4 | 232 | 53.6 | 248 | 52.9 | 221 | 47.1 
Middle 205 | 42.6 | 276 | 57.4 | 276 | 55.7 | 220 | 44.3 
Low 168 | 37.8 | 277 | 62.2 | 209 | 49.6 | 212 | 50.4 
2x2 = 6.81; df =2:p <.05. 
by? = 3.31; ns. 


ay? = 7.43; df =2;p<.05. 
by? = 1.04; ns. 
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TABLE 27 
PREDICTION OF FINAL VOCATIONAL CHOICE FROM STUDENT’S VPI PROFILE AND ROLE PREFERENCE 
Samples (males only) 
VPI profile me ql 
referre S ers 
Rake Fall Spring Fall-Inte 
Consistency | Homogeneity % hits | if % hits iF % hits | f 
Ce + Hi + Explicit ce 51.8 | 305 53.4 509 68.2 107 
yb + Hi + Explicit = 522 oe tals 53.0 185 54.4 49 
Cc + Lo + Explicit == 44.2 330 45.0 420 S12 63 
I ob Lo + Explicit = 45.0 160 Biko 233 47.5 56 
Cc + Hi + Ambiguous = 33.3 126 39.5 124 40.4 13 
I + Hi + Ambiguous = 26.2 42 Seal 56 30.0 10 
Cc + Lo + Ambiguous = 31.8 195 39.1 156 Didi 13 
I a Lo + Ambiguous = 25.6 86 28.9 90 19.4 6 














a Consistency. 
b Inconsistency. 
¢ Fall-Intellectual. 


of consistency, homogeneity, and vocational 
role are cumulative so that their occurrence 
in combination results in more efficient pre- 
dictions, all possible combinations of these 
variables were tried. Table 27 presents the 
results of these analyses for men in the fall 
and spring samples. 

The results suggest that consistency, homo- 
geneity, and vocational role preference cumu- 
late to improve the prediction of vocational 
choice, but their relative contributions appear 
very unequal. Explicit role preference ap- 
pears to contribute most to the prediction, 
followed next by homogeneity, and last by 
consistency. 

The predictions in Table 27 can be im- 
proved by controlling for VPI high-point 
code, that is, by recalculating the analyses in 
Table 27 for a single high-point code at a 
time. These analyses were done, but they re- 
quire large Ns to insure reliable percentages. 
For the largest subsample—male students who 
peaked on the Intellectual scale of the VPI— 
the results are especially clear and are shown 
in the last column of Table 27. Similar 
analyses for the smaller remaining subsamples 
produced similar but less-consistent results. 
(It is instructive to note that the best predic- 
tive combinations for men and women in 
Table 27 are either equal to, or less than, 
the predictive validity of a student’s first 
vocational choice in Tables 14 and 17.) Aside 


from their theoretical value, the results in 
Table 27 reinforce some common interpreta- 
tions about vocational counseling clients; 
namely, the row at the top of the table 
characterizes the people vocational counselors 
rarely see, whereas the bottom row charac- 
terizes the kinds of people counseling agencies 
usually deal with; that is, clients who pro- 
duce confused interest profiles and who are 
unclear or uncertain about the vocational 
role they want to play. 

To summarize, the predictive data in 
Tables 14 through 27 are strong evidence that 
the types and their associated constructs— 
vocational role preferences, consistency, and 
homogeneity—generally yield efficient and 
theoretically relevant predictions for men but 
only occasionally for women. At this point, 
the efficiency of self-expression of vocational 
choice suggests that a more thorough explora- 
tion of the general usefulness of this tech- 
nique for defining types is desirable. Simi- 
larly, a redefinition of consistency could be 
achieved by the use of a person’s first and 
second vocational choice, and this new defini- 
tion appears now to have more potential than 
the definition of consistency by the VPI. 


INSTITUTIONAL [INFLUENCES 


The following analyses were performed to 
learn if a college’s influence upon a student’s 
choice of vocation could be predicted from 
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the theory by characterizing students and col- 
leges in terms of personality types and en- 
vironmental models, and then testing the ap- 
propriate hypotheses (Holland, 1966b, p. 
72f). More explicitly, the analyses are con- 
cerned with the stability of vocational choice, 
or whether or not a student switches from one 
vocational class to another. A second group 
of analyses was performed to learn if a stu- 
dent’s satisfaction with his college is pre- 
dictable within the context of the theory. 


Assessment of Types and Environments 


In these analyses, a student’s initial voca- 
tional choice was used as an index of his 
personality type (see Tables 12 and 13) and 
as a control for student input. The student’s 
initial vocational preference is assumed to be 
a powerful method for equating students in 
the same vocational class across colleges: 
first, an earlier study (Astin, 1965) revealed 
that a college freshman’s vocational choice 
and initial major field were the best predictors 
of his vocational choice at graduation and 
that other variables added only .01-.03 to 
the multiple correlation. Second, several anal- 
yses were performed to examine the adequacy 
of initial vocational choice as a control varia- 
ble. For this purpose, average high school 
grades and family income were related to 
initial and final vocational choices in the fall 
sample for men and women. The median tests 
(2 X 6 chi-squares) revealed that a student’s 
family income and average high school grades 
were significantly related to both his first and 
last vocational choice. And, for women, high 
school grades were related more to first than 
last vocational choice. For men, the converse 
was true, but the difference was trivial (x? 
= 16.5 versus 19.7). For women, family in- 
come was related more to first than last voca- 
tional choice; for men, income was related 
more to last than first choice, but when the 
chi-squares were converted to contingency 
coefficients (C = .14 versus C = .18) only a 
slight difference was obtained. 

Taken together, these analyses suggest that 
a student’s initial vocational choice is a sub- 
stantial control that usually incorporates the 
variance contributed by control variables such 
as grades and income. In addition, initial 
vocational choice incorporates and surpasses 


the contribution that the VPI makes to pre- 
diction (see earlier section on prediction). 

The institutional environment for each col- 
lege was characterized by the Environmental 
Assessment Technique (EAT) (Astin & Hol- 
land, 1961). The assignment of major fields 
to the six environmental models (Realistic, 
Intellectual, Social, Conventional, Enterpris- 
ing, and Artistic) was made according to the 
revised criteria (Holland, 1966b, pp. 122-— 
124). This particular version of the EAT is 
only slightly different from earlier versions. 
The source of the basic data was Wright 
(1966). 

The computation of EAT scores was as 
follows: For each college, the number of de- 
grees earned in each field were assigned, by 
the criteria indicated earlier, to one of the 
six environmental models. The absolute num- 
bers for each environmental model were con- 
verted to percentages of the total graduating 
population for that college. The resulting set 
of six percentages forms the environmental 
profile for a college. For example, a college 
of 200 graduating seniors might have the 
following profile: 


Type Number EAT score 
(percent) 
Realistic 20 10 
Tntellectual 8 4 
Social 12 6 
Conventional 128 64 
Enterprising 28 14 
Artistic 4 2 
Total 200 100 


Stability of Vocational Choice 


Several analyses were performed to test 
the hypothesis that a student is more likely 
to maintain his vocational choice if he lives 
in a “congruent” environment—that is, Real- 
istic types will be more stable in Realistic col- 
lege environments; Intellectual types in In- 
tellectual environments, etc. (Holland, 1966b, 
p. 73f). First, for each sex at each college, 
6 X 6 tables, based on the classification 
scheme summarized earlier, were formed to 
show the relation of first to last vocational 
choice for time intervals of 8 or 12 mo. for 
the fall and spring samples. Second, per- 
centages of stability for each vocational class 
at each college were computed. Third, a total 
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TABLE 28 


CORRELATION (RHO) BETWEEN THE SHAPE OF THE 
COLLEGE PROFILE (EAT CODE) AND THE STABILITY 
OF STUDENT VOCATIONAL CHOICES IN THE 
CORRESPONDING AREAS OF CHOICE 


Spring sample 

















College Men | Women 
A 03 200) 
B mii — 35 
Cc 84* 1.00 
D | .64 | — 

E 16 1.00* 
F 14 1.00* 
G .20 1.00 
H Al —.50 
I —.47 —.50 
J 94** 80 
K —.07 80 
L vA 50 
M 24 88 
N —.54 80 
O AT 1.00 
ie — .03 — .40 
Q 16 1.00 
R 14 95 
S — .03 80 
ot 26 80 
U — 1.00 
V — ils) 
Fall sample 
| 

College Men Women 
A ml 85 
B 03 05 
( 09 _ 
D .20 1.00* 
E 20 80 








Note.—The significance levels for women vary because the 
number of vocational classes at individual colleges varies from 
two to four. For men, the number of vocational classes always 
equals six so that all rhos are based on an N of six. 


percentage of stability was calculated for 
each college, that is, the percentage of all 
students who reported either identical choices 
or choices that belonged to the same voca- 
tional class on both occasions (no changers 
plus intraclass changers). Fourth, a college’s 
EAT scores were ranked from one through 
six or from highest to lowest percentage of 
graduating seniors in each of six areas of 
study. Finally, the percentages of stability 


for the six personality types and the six EAT 
scores were intercorrelated (rho) for each col- 
lege. The results of these analyses are shown 
in Table 28; a positive correlation means that 
high percentages of stability for a given 
type are associated with high percentages 
of students graduating in the same area. Con- 
versely, low percentages of stability are as- 
sociated with low percentages of students 
graduating in the same area. Put another way, 
the positive correlations suggest that a stu- 
dent is more likely to maintain his present 
vocational choice if he attends a college in 
which his choice is congruent with the most 
popular fields defined according to the present 
six-category scheme. In contrast, a student is 
more likely to change fields if he attends a 
college in which his choice is atypical or un- 
popular. 

The use of a scale of vocational stability 
(six percentages) and a scale of environments 
(six percentages) instead of a dichotomy of 
congruent and incongruent interactions is a 
comprehensive and severe test of the con- 
gruency hypothesis. Many percentages were 
based on small Ns and are markedly unre- 
liable; the expectation that these two sets of 
percentages will be ordered in identical fashion 
is indeed a grandiose hypothesis. 

In Table 28, only 5 out of the 50 correla- 
tions are significant beyond the 5% level, but 
41 out of the 50 are positive, an outcome 
that is statistically significant by sign test 
and congruent with the hypothesis. The mag- 
nitude of the relationship, however, is am- 
biguous, for the rhos are based on Ws of only 
six for men and two to four for women. At 
best, the results imply that the congruency 
hypothesis has some tenuous validity. 

To secure a more explicit test of the con- 
gruency hypothesis, we correlated the per- 
centages of vocational stability for each type 
with the rank of the corresponding environ- 
mental model (EAT variable) across col- 
leges. (The rank of a given EAT variable 
within a college was reranked across colleges. ) 
Because the between-college analyses in Table 
29 are concerned with only a single pair of 
student-college variables, they are more easily 
interpreted than the within-college analyses of 
Table 28, in which each correlation involves 
a simultaneous test of six pairs of student- 
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TABLE 29 
CORRELATION (RHO) BETWEEN THE RANK OF EAT ENVIRONMENTS AND THE RANK OF THE 


STABILITY OF CORRESPONDING VOCATIONAL CHOICES 








Spring sample Fall sample 

Men ip Women ai Men | if | Women | i 
Real E vs Real VCs 45* 20 — — 67 5 == = 
Int E vs Int VCs 0 20 =O07 = 19 A7 5 Sars) 4 
Soc E vs Soc VCs .29 20 — .06 21 70 5 1.00* 4 
Conv E ys Conv VCs no2 20 sol 9 36 5 14 4 
Ent E vs Ent VCs O1 20 a — .86 5 -- -— 
Art E vs Art VCs eS 20 aDD 21 08 5 81 - 











Note.—/’s equal number of colleges, not students. 
aE = environment. 
b VC = vocational choice. 


* pb < .05, one-tailed test. For N of 20, p < .05 = rho of .377. 


tie (1; 


college variables. Although only 4 out of the 
20 correlations (rho) in Table 29 are sig- 
nificant, all 4 are positive in direction. Taken 
together, all statistically significant findings 
in Tables 28 and 29 support the congruency 
hypothesis: students maintain their voca- 
tional choices when their college tends to be 
dominated by students whose vocational 
choices belong to the same general class. 

To test the hypothesis that homogeneous 
environments are ‘conducive to vocational 
stability” (Holland, 1966b, p. 67), the per- 
centage of stability for all students at a col- 
lege and the homogeneity of a college’s en- 
vironment (the percentage difference between 
the largest and smallest EAT scores) for sam- 
ples of males and females in the spring and 
fall samples were intercorrelated. The cor- 
relations (product-moment) in Table 30 are 
all positive and support the hypothesis, but 
only one out of the four correlations is sta- 
tistically significant. The results imply that 
students are more apt to maintain their orig- 
inal choice when they attend colleges with 
well-defined or spiked profiles, and that stu- 
dents are more apt to change their vocational 
aspiration when they attend colleges with flat 
profiles or colleges with about equal numbers 
of students in the six vocational classes de- 
fined earlier. 

The results in Table 30 can also be ex- 
plained as simply the outcome that would be 
expected from common knowledge about the 
size of an educational institution; that is, a 


student will tend to remain in his field of 
initial choice in a small college because there 
are fewer alternatives, and he will tend to 
change field in a large college because there 
are many alternatives. To test the validity of 
this rival hypothesis, stability of vocational 
choice at an institution was correlated with 
its size. These results are also shown in 
Table 30, and they imply that size is a less 
plausible hypothesis than homogeneity is. The 
single significant correlation supports the size 
hypothesis, but the remaining three correla- 
tions are not significant and in the wrong 
direction. In this regard, it should be noted 
that “homogeneity” is a complex construct 
that incorporates size as well as variability 
among environmental variables (EAT); that 
is, large and diverse institutions are apt to 
be “heterogeneous” while small institutions 
are apt to be “homogeneous.” 


TABLE 30 


CORRELATION (PRODUCT-MOMENT) OF STABILITY OF 
StuDENT VOCATIONAL CHOICE WITH HOMOGENEITY 
AND SIZE OF AN INSTITUTION 








Spring sample Fall sample 








Men Women Men Women 
(C= 920) CN — 21) (Ne 5) (N = 4) 

Homogeneity 22 .60** -68 .23 
Size 39 24 61 — ,86* 





Note.—Percentages were transformed by arcsin function 
to render distributions of percentages more normal. N = num- 
ber of colleges, not students. 

* b < .05, two-tailed test. 

*E b < .01, two-tailed test. 
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TABLE 31 


THE CORRELATION (RHO) BETWEEN THE SHAPE OF THE 
COLLEGE PROFILE (EAT) AND INITIAL AND 
FINAL SATISFACTION WITH COLLEGE 





























Men Women 
cole ——————— = Sanne 
Initial | Final Initial Final 
Spring sample 
A 2.0 ESon al 719 39 
B — 39 —.59 33 .26 
c Ae || 30 alls — 26 
D GI 64 .86* 
E — .09 09 — .26 09 
F —.43 —.03 — —- 
G —.37 — 37 — .37 04 
H 76 Poi —.20 —.31 
I .20 33 14 26 
J —.31 —.26 26 49 
K —.60 —.31 — .66 Fa5 
ib .63 39 81 29 
M 37 09 A7 .89* 
N 59 14 —.47 49 
O —.79 —.19 253 56 
P 09 —.37 44 56 
Q .03 —.31 .20 .89* 
R —.37 21 .16 61 
S .63 E50 59 —.49 
cs — — .66 .89* 
U _- -- 43 oil 
V aT —.81 44 62 
Fall sample 
A 43 ll oil ana 
B 03 —.47 43 .60 
¢ 24 3618) — 
D 10 .03 —.71 —.03 
E 03 81 .90** 39 
* p= 05 
Ky < 01 


Satisfaction with College 


To test the hypothesis that a student’s 
satisfaction with his college will be greater if 
his type is congruent with his college environ- 
ment (Holland, 1966b, p. 73), initial and 
final mean satisfaction scores for students of 
different types were correlated with the ranks 
of the environmental models of a college. 
More explicitly, initial and final mean satis- 
faction scores were computed for the six types 
at each college. Then the means for the six ini- 
tial and final satisfaction scores were ranked 


one through six. Last, ranks for initial and 
final satisfaction were correlated with the 
ranks for a college’s EAT (one through six). 
The rhos for these analyses are presented in 
Table 31. The results for men clearly negate 
the hypothesis—satisfaction with one’s col- 
lege is associated as much with incongruency 
of personality types and college environment 
as it is with congruency. 

The results for women appear to support 
the hypothesis—satisfaction with one’s col- 
lege increases if one’s personality type tends 
to be congruent with the more popular areas 
of study at the college. For women, 17 out of 
25 pairs of correlations for initial and final 
satisfaction increase over time—trends that 
suggest congruency of student and college 
lead to greater satisfaction. Reliance upon 
the increase in correlation between initial and 
final satisfaction serves as a useful control 
for initial level of satisfaction. 

In the final analysis, the average final satis- 
faction score for all students at a college was 
correlated with the homogeneity score for 
their college environment. Other things being 
equal, the results in Table 32 support the 
hypothesis that students are more likely to be 
satisfied in colleges characterized by their 
flat profiles—about equal percentages of per- 
sonality types. Conversely, students are likely 
to be less satisfied in colleges with a well- 
defined or homogeneous student body. These 
results fit the theory that a student is more 
likely to find friends if he attends a large, 
diverse institution (where there will be many 
students of diverse types) than if he attends 


TABLE 32 


CORRELATION (PRopUCT-MOMENT) OF HOMOGENEITY 
AND SIZE OF COLLEGE WITH 
STUDENT SATISFACTION 








Spring sample Fall sample 








Men Women Men Women 
CN = 20)" =21) 4 G5) Wea 

Homogeneity —.54** —.43* .00 —.40 
Size ma OL — .87** — .96** 





Note.—Percentages were transformed by arcsin function 
to render distributions of percentages more normal. N = num- 
ber of colleges, not students. 

* p< .05. 

ae le 
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a small, selective institution (where there will 
be fewer students of only a few types). 

The correlation between institutional size 
and satisfaction with college is also shown in 
Table 32. In general, size appears to be a 
slightly less reliable predictor of satisfaction 
than homogeneity. 

The findings in Tables 31 and 32 imply that 
a student’s satisfaction with his college fol- 
lows the following theoretical rules: 


1. Within the hierarchy of six types of stu- 
dents at a college, the higher a student’s 
choice stands in that hierarchy, the greater 
the likelihood of satisfaction. This is simply a 
technical way of saying that students are 
happier the more closely they resemble the 
majority of students. 

2. Students are more likely to find satisfac- 
tion in colleges that possess about equal num- 
bers of each personality type. The assumption 
here is that heterogeneous colleges have a 
diverse student population so that a student’s 
opportunity for finding congenial friends is 
greater than it would be at a college populated 
largely by a single kind of personality type. 

3. The most satisfied student should be a 
student who resembles the most popular type 
at his college and whose college has a hetero- 
geneous profile of types—the distribution of 
types is about equal. 

4. The most dissatisfied student should be 
a student who resembles the most unpopular 
type at his college and whose college has a 
homogeneous distribution of types—the dis- 
tribution of types is very unequal so that a 
single type dominates the college with only 
weak representation of the remaining types. 
Thus, his chances for finding congenial friends 
are less than they would be in a heterogeneous 
student body. 


Taken together, the applications of the the- 
ory to student-college interactions in Tables 
28-32 were a partial success. These analyses, 
however, clearly reveal the need for replica- 
tion in a larger number of colleges and for 
more complete student samples at individual 
colleges. Further, the findings that contradict 
the theory require clarification. 


DISCUSSION 


Although the results were obtained for rela- 
tively typical samples of college freshmen and 
sophomores, the hypotheses tested still need 
replication. The large sample loss in the fol- 
low-up assessments casts some doubt upon 
the findings despite the failure to find marked 
differences between the initial and final sam- 
ples. In addition, the need to generalize the 
findings to more diverse populations remains, 
although the typical college student is an im- 
provement over the typical National Merit 
finalist—the source of data for nearly all 
earlier investigations. 

In this regard, the present study, using 
typical college students, appears to replicate 
the major findings obtained earlier with sam- 
ples of superior scholastic aptitude (Holland, 
1962, 1963, 1963-64, 1964; Holland & 
Nichols, 1964). If anything, the results of 
the present study are more consistent with 
the hypotheses in the theory than were the 
results of earlier studies. For example, stu- 
dents characterized as different types more 
frequently show the presence of expected 
traits, and there is less contradictory over- 
lapping of traits than before. Of special im- 
portance, it is still possible to distinguish 
people on the third scale of a three-scale 
profile and this accords with the theory. This 
severe test of the theory implies that its con- 
cepts have considerable explanatory power. 

The results of the college-effects analyses 
in the present and earlier studies are also 
consistent with one another in that all studies 
reinforce the congruency hypothesis. But be- 
cause identical analyses were not performed— 
largely because they were not possible or ap- 
propriate—it is not possible to clearly indi- 
cate specific replications or contradictions. 
This problem is also compounded by the 
controversy about the best statistical method 
for the analysis of college influence. The need 
for replication of these analyses is great, for 
the findings about institutional influence are 
tenuous. In addition, large numbers of col- 
leges are required to test all of the main hy- 
potheses in the theory. In the present study, 
the small sample of colleges precluded the 
test of many hypotheses and it limited con- 
fidence in the outcomes. At the same time, 
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the general magnitude of the college effects 
found here are large, perhaps the largest, rela- 
tive to the institutional influences assessed 
by others. The results suggest that the theory, 
in conjunction with the Environmental As- 
sessment Technique (EAT), warrants further 
examination, for the results imply that the 
EAT provides a sensitive environmental as- 
sessment. 

To a limited degree, the results are con- 
sistent with a recent study by Astin (1965) 
in which two out of three positive findings 
revealed that stability of vocational choice is 
most closely associated with congruency of a 
student’s vocational choice and the corre- 
sponding EAT variable. For instance, Real- 
istic vocational choices were positively cor- 
related with the Realistic Orientation at a 
college. In another study by Brown (1966), 
stability of a student’s vocational choice was 
closely related to whether or not he lived in 
a dorm with students having similar major- 
field intentions. Brown’s experimental study 
is a precise and unequivocal study of the con- 
gruency hypothesis, but congruency was de- 
fined in terms of scientific and humanistic 
types rather than the present types. Simi- 
larly, he found that general satisfaction with 
one’s college was positively associated with 
living in a dorm where the majority of stu- 
dents had similar major-field choices. In his 
study of vocational plans in college, Davis 
(1965) also reports substantial evidence for 
the congruency hypothesis as well as the 
“birds of a feather” hypothesis. Taken to- 
gether, these studies support the present 
theory despite their differences in statistical 
design and definitions of vocational choice 
and college environment. 

There are several major theoretical implica- 
tions of the results. First, the definition of 
types by the classification of a student’s voca- 
tional choice yields predictions of subsequent 
choices that are superior to predictions ob- 
tained from the definition of types according 
to the VPI. In future studies, then, it seems 
advisable to examine the theory by using ex- 
pressed vocational choice as the main tech- 
nique for defining a person’s type. Similarly, 
it seems desirable to study “consistency of 
personality pattern” in terms of a person’s 
first two expressed vocational choices. Con- 
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sistency of personality pattern might be elab- 
orated as follows: (a) a person gives identical 
responses for his first two preferred vocational 
choices (when asked for his first two choices, 
he perseverates and names identical occupa- 
tions), (&) a person gives two choices in the 
same vocational class, (c) a person gives two 
choices in ‘‘consistent” classes, and (d) a 
person gives two choices in “inconsistent” 
classes. These degrees of consistency promise 
more positive findings. The substantial predic- 
tive efficiency of the analyses of students 
whose vocational choices belonged to the 
same or different vocational classes was a 
partial test of this idea. 

Second, the theory needs several revisions. 
A person’s preference for various vocational 
roles needs to be clarified, for the assignment 
of roles to types is often contradicted by the 
evidence. Another more extensive revision of 
the theory is needed to provide more con- 
sistent results in the case of women. The use 
of the special classification for women im- 
proved the prediction of their vocational 
choices, but many other analyses are still 
significant for men but not for women. 

Third, the success of the VPI as a clas- 
sifying tool for assigning occupations and 
fields to classes implies the value of both 
making the classification more comprehensive 
and studying more subclasses. The extension 
of the classification scheme (Holland, 1966a) 
can be performed simply by assessing repre- 
sentative samples of occupations with the 
VPI, computing means, and assigning the oc- 
cupation to the appropriate class in terms of 
its coded mean VPI profile. 

Last, the partial success of the theory as 
a way to interpret student-institutional inter- 
actions implies that employee-organizational 
interactions and outcomes such as tenure and 
job satisfaction may follow the same prin- 
ciples. The typological and environmental 
formulations of the theory would provide a 
way to control and interpret such interactions. 
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Poe BASIC INTEREST SCALES FOR THE STRONG 


VOCATIONAL INTEREST BLANK FOR MEN! 


DAVID P. CAMPBELL, FRED H. BORGEN, SUZANNE H. EASTES, 
CHARLES B. JOHANSSON, anp ROBERT A. PETERSON 


University of Minnesota 


22 homogeneous content scales were developed for the Strong Vocational Interest 
Blank (SVIB) by clustering items with high intercorrelations. The resulting scales 
were named: Adventure, Agriculture, Art, Business Management, Law/Politics, 
Mathematics, Mechanical, Medical Service, Merchandising, Military Activities, 
Music, Nature, Office Practices, Public Speaking, Recreational Leadership, Reli- 
gious Activities, Sales, Science, Social Service, Teaching, Technical Supervision, 
and Writing. Mean scores are presented on each of these scales for 113 occupational 
samples, and test-retest statistics are included for groups tested and retested over 2 
wk., 30 days, 3 yr., 22 yr., and 30 yr. Similar information is presented for several 
student samples including a group who were tested as freshmen, sophomores, 5 yr., 
and 15 yr. after graduation. The main purpose of these scales is to provide some in- 
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terpretation aids for the traditional occupational scales of the SVIB. 


The regular scales now in use for the Strong 
Vocational Interest Blank (SVIB) were de- 
veloped by comparing the responses of men in 
specific occupations with those of a men-in- 
general (MIG) group. Those items selected 
more often by the occupational sample serve 
as a scale for identifying the characteristic 
interests of the men in that occupation. These 
are usually termed “‘empirical” scales, and the 
scores provide an index of similarity between 
the individual’s interests and the interests of 
men in the specified occupations. A substantial 
amount of research (see, e.g., Campbell, 1966; 
Darley & Hagenah, 1955; Strong, 1943, 1955) 


1 The statistics reported in this paper were expensive 
to accumulate, and many agencies provided assistance. 
The overall project was funded by the National Insti- 
tutes of Health through Grant 1428-03, most of the 
computer time was furnished by the University Com- 
puter Center, University of Minnesota, and considerable 
clerical help was supplied by the Student Counseling 
Bureau, University of Minnesota. 
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has shown that these scores are useful in psy- 
chological assessment projects and in counsel- 
ing situations. 


Problems with the Empirical SVIB Scales 


The major advantage of these empirical 
scales is that they include all of the discrimi- 
nating items for each occupation in one scale; 
this makes interpretation for that one occu- 
pation fairly easy. With these scores, a counse- 
lor can say something like, ‘““You have interests 
similar to lawyers.’’ However, the nature of 
empirical scales makes further psychological 
interpretation difficult. If the individual asks 
the obvious, ‘‘What does it mean to have 
interests similar to lawyers?” the counselor 
must fall back on what he has learned about 
lawyers from other sources. Although the re- 
lated scales on the SVIB profile provide some 
general flavor, interpretation is still shallow 
except when done by well-trained sophisti- 
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TABLE 1 


SAMPLE SVIB ITEM INTERCORRELATIONS 
































% 500 Men In General 
Item 
Like | Indif-| Dis- | Total 
ferent | like 
Algebra 
Arithmetic 
Like 51 12 06 69 
Indifferent 03 10 06 19 63 
Dislike 01 01 10 12 
Total 55 23 22 100 
Geometry 
Chess 
Like 16 04 03 23 
Indifferent 24 12 06 42 A 
Dislike 15 11 09 oo 
Total 55 27 18 100 
Actor 
Listening to a 
story vs. 
Telling a 
story 
Like 09 19 24 52 
Indifferent 06 10 16 32 | —.19 
Dislike 08 04 04 16 
Total 23 33 44 100 














cated counselors who have had considerable 
experience in working with the SVIB and who 
have studied the research literature. 

A second major disadvantage is that there 
is no limit to the number of empirical scales; 
in theory one could be developed for each occu- 
pation—but there are over 20,000 occupations 
listed in the Dictionary of Occupational Titles. 
While one can, in only partial fantasy, visualize 
a computerized system where this information 
would be available for hundreds of occupations, 
this is still far beyond current capabilities. In 
recent practice, to compensate for this rela- 
tively narrow coverage of the occupational 
world, the counselor usually extrapolates from 
the available SVIB scales to other occupations 
not listed on the profile. For example, if a 
student inquires about geology, scores on the 
Engineer and Chemist scales are probably 
relevant. Some such extrapolation will be in- 
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evitable, no matter what the final method used, 
but it should be as easy and direct as possible. 

A third disadvantage of these empirical 
scales is that they are difficult to work with in 
research studies. If an investigator wishes to 
study men who have survived in a specific 
occupational setting versus those who have 
not, he finds it cumbersome to compare these 
two groups on all of the 60 current empirical 
scales. While he can, and usually does, calcu- 
late the mean differences on each of the scales, 
the resulting statistics do not offer either a 
parsimonious nor easily interpretable method 
of understanding what distinguishes between 
the interests of these two samples. 

The empirical success of the SVIB has 
created most of these problems. Because the 
scales are useful and do provide an easily inter- 
pretable score for a specific occupation, there 
has been considerable pressure to build more 
of them, further increasing the complexity of 
the profile. From all indications, this will con- 
tinue as more occupations feel the need for 
self-study. This is good, for this detailed 
psychometric information on a wide variety of 
jobs is essential, but this prospect makes it 
even more imperative to develop a simpler way 
to summarize the results. 

What both the counselor and researcher need 
is another system of scoring to supplement the 
occupational scales, a system containing rela- 
tively few scales, but scales which could be 
used to generalize beyond a single occupation. 

Clark (1961), in his research with the Minne- 
sota Vocational Interest Inventory, has shown 
that one way to do this is to work with mea- 
sures that are homogeneous in content. Each 
of his homogeneous scales (or area scales as 
they are called on the profile) reflects interests 
in one type of activity or in closely related 
activities ; thus, they are “‘pure” content scales. 

Such a set of scales has been constructed for 
the men’s form of the SVIB; this monograph 
is a report of their development and of their 
psychometric characteristics. Because this was 
an attempt to identify the important clusters 
of interests represented in the SVIB, these 
scales have been termed ‘‘basic interest” 
scales. 


Basic SCALE DEVELOPMENT 


To build these scales, an item intercorrela- 
tion matrix was generated for the SVIB and, 




















Basic INTEREST SCALES FOR THE SVIB 3 
TABLE 2 
ADVENTURE CLUSTER 
Item oi | 14 | 231 12 | 334 | 86 | Dl | 337 | 321 

211 Pursuing bandits in a sheriff’s posse 24 26 SY 23 At 

14 Airplane pilot 25 27 37 16 De, 
231 Climbing along edge of precipice 23 12 2B 19 10 

12 Auto racer 28 40 11 22 32 
334 Taking a chance 13 05 16 17 11 

86 Secret Service man 42 29 15 30 14 
277 Men who live dangerously 21 22 20 DS) 16 27 
337 Thrilling, dangerous activities® 36 18 26 28 62 25 31 
321 Airline pilot 18 51 13 23 12 18 2 30 





Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 


from this matrix, clusters of items with high 
intercorrelations were identified. The items in 
these clusters became the basic scales. 


Set 1 and Set 2 of the Basic Scales 


Actually, two sets of scales were constructed, 
the first using the 291 items common to all 
editions (1927, 1933, 1938, and 1966) of the 
SVIB booklet, the second using all 399 items 
of the 1966 revised form. 

Although the first set will usually not be 
used in current projects either for counseling 
or research, three factors dictated its develop- 
ment: first, practically all of the SVIB cri- 
terion groups were studied with either the 
1927, 1933, or 1938 booklets; second, many 
longitudinal studies have old data on an 
earlier booklet and current data on a later 
form; third, a more practical concern despite 


the efforts of the publisher and the scoring 
services, there inevitably are users who have 
not yet switched to the revised (1966) form. 
For all of these situations, a set of scales com- 
patible with both early and recent booklets is 
needed. 

The second set of basic scales is simply the 
first set expanded to include the items added 
to the SVIB in the 1966 revision. In this re- 
vision, a definite attempt was made to improve 
the coverage in areas relatively neglected in 
the original item pool such as art, religion, and 
music. As a result, scales for those areas are 
longer and more adequate in the second set. 
Unless confronted by one of the situations in 
the preceding paragraph, current users of the 
SVIB should use this second set. 

Because most of the normative data pre- 
sented here are from groups tested before 1966, 


TABLE 3 


AGRICULTURE CLUSTER 





Item 37 | 102 | 342 | 345 | 76 | 191 | 230 | 162 | 214 | 150 
37 Farmer 65 24 22 64 28 —34 
102 Agriculture 58 27 28 44 32 —18 
342 Outside work Dil 27 40 30 26 —25 
345 Physical activity 20 19 40 18 25 —10 
76 Rancher 67 54 26 18 29 —30 
191 Handling horses 28 28 20 16 35 —16 
230 Living in the city | —24 —17 —24 —18 —21 —06 
162 Horseback riding® 17 20 13 11 27 71 —01 
214 Forest ranger® 34 37 23 18 37 24 —19 21 
150 Camping out® 18 20 20 13 18 23 —13 24 39 








Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a Item considered for inclusion in cluster, but discarded for various reasons, See text. 
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Fic. 1. Frequency distribution of item intercorrelations 


only the Set 1 scales are used unless specifi- 
cally noted otherwise. 


Eliminated Items 


Two restrictions were applied to the indi- 
vidual items before scale construction began. 
First, to eliminate the highly popular and un- 
popular ones, no item was used if any of the re- 


sponses, Like (L), Indifferent (I), or Dislike 
(D), had less than 15% response among a 
sample of MIG. Second, none of the items 
numbered 281-320 were included; those are 
the items grouped in tens where the respondent 
is to select the three he likes best, the three he 
likes least, and mark the remaining four in- 
different. This format is troublesome as 1 or 


TABLE 4 


Art CLUSTER 














Item 5 
5 Artist 49 45 53 
20 Cartoonist 54 S/O) 
43 Interior decorator 38 =. 26 38 
104 Art 65 46 38 
163 Art galleries 49 36 34 50 
194 Decorating with flowers shy Wil 2 BS) 
233 Looking at antiquefurniture} 30 15 26 33 
84 Sculptor Of VASE Sa Or 
141 Sketching pictures of wild 
animals® 4535502 are O 
39 Art museum director 48 38 39 48 
183 Magazines about art and 
music® 44 29 29 49 
253 Artistic men® 50 31 39 49 
263 Prominent artists® 44 31 20 39 
352 Music and art events vs. 
athletic events* SO ne2 Leo ae OO) 





| 20 | 4 [104 | 163 





194 | 233 | 84 | ua | 39 183 | 253 | 23 352 





S/o SO OOO 
28 21 227) 40 
26 42 35 46 
50 338 30 38 


SE COMO 
36 39 «33 
42 39 33 
Sao Omer 


Al 35529 42, 
0 PZ) 1598 38 


60) 349935 48737 46 
40 33 30 45 34 39 40 
AT 275360, 440 63044 | 50 e056, 


AZ SIS AS 38) 20-04 1 S022 





Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


* New item from the 1966 booklet, added to develop the Set 2 scales, See text. 
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Busrtness MANAGEMENT CLUSTER 





TABLE 5 





Item 


33 | 35.| 38 | 65 | 21 | 206 | 226 | sar | 196 | 42 | 8s | 96 [177 | ars 





33 Employment man- 
ager 
35 Factory manager 
58 Manufacturer 
65 Office manager 
81 Sales manager 
206 Meeting and direct- 
ing people 
226 Developing business 
systems 
347 Technical vs. super- 


47 
28 
57 
47 


33 


30 


visory responsibility | —30 


196 Interviewing men 
for a job 
42 Hotel manager 
85 Manager, Chamber 
of Commerce 
96 Travel bureau man- 
ager® 
177 Business methods 
magazines* 
278 Prominent business- 
men® 


50 
44 


48 


41 


34 


28 


44 


48 


42 


Zi, 


34 


—22 


31 
33 


27 


37 


31 


27 
49 


34 
ol 


18 
34 


—17 


33 


44 
41 
35 
42 
28 
39 
—28 


40 
38 


45 


35 


33 


30 


36 
34 
38 
50 
36 
41 
oii 


39 
38 


42 


44 33 34 
20 Ome 
2 oe oe 
28 «640043 
26 34 44 
0 eS eC) 
Commo) 
—17 —18 —09 
ye 
30 35 
30 = 38 
25. 43 39 
CA ee lees L 


Se oes 
32 ~=32) —10 
Zo S8h — 10 
330) 60) 24 
36 42 —24 

SOm is 
29 —22 

—28 —26 

45 33 —27 
S120 — 30) 
S222 
27 19) 17 
24 48 —28 
29 30 —24 


LS Ome 22 





21 


37 


Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 


2% of the respondents fill in this section in- 
correctly. Another 1% or so, aggravated by 
this forced-choice format, write some hostile 
comment in the margin and refuse to fill in any 
of the items. To solve these problems, and 
because these items are less efficient than the 





47 Judge 


Item 41 | 47 
41 Governor of a state Sl 
55 
48 Labor arbitrator 39 48 
51 Lawyer, criminal 45 50 
52 Lawyer, corporation 34 46 
72 Politician 62 48 
160 Electioneering for office? 50 33 
359 Dog trainer vs. parole officer® —25 —23 
323 Taxi driver vs. policeman? —16 —14 
356 Warden of a prison —01 —00 
59 High school principal® 38 = 44 


regular L-I-D items, this section will probably 
be removed in some future revision. In antici- 
pation of that move, these items were not used 
in these basic scales. Only a few items were 
affected by these restrictions. 


TABLE 6 


48 





42 
41 


35 
30 
44 


rs 


31 
38 
33 


52 
43 
34 
—23 
—23 
—06 
28 





Law/Pourtics CLUSTER 





52 | 7 | 160 | 359 | 328 | ase | 39 
35) 51 
48 34 
22 39 
42 36 
22 
34 
28 65 
—12 -—24 —18 
oS 17 
Os) SUk Sy, 10 10 
29 37 20° — 24° —12, 04 


Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for various reasons. See text, 
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TABLE 7 


MATHEMATICS CLUSTER 

















Item 101 | 103 | 107 | 115 | 120 | 128 | 148 | 122 | 24 
101 Algebra 63 61 67 75 36 
103 Arithmetic 65 48 51 79 as 
107 Calculus 62 47 56 59 44 
115 Geometry 67 50 62 65 50 
120 Mathematics 76 77 63 66 42 
128 Physics 50 38 55 53 48 
148 Solve mechanical puzzles*| 35 35 35 34 37 30 
122 Mechanical drawing® 26 31 31 35 36 35 41 
24 Civil engineer® 26 27 35 34 32 31 26 39 





Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 
1960s. Decimals omitted. 
« Item considered for inclusion in cluster, but discarded for various reasons. See text. 


TABLE 8.—I rem INTERCORRELATION MATRIX 














Item 19 | 56 | 94 | 186 | 189 | 190 | 122 | 132 | 148 | 187 | 121 | 185 | 188 
19 Carpenter Sl 4647 ~«=©6400S fC 6—C 4s ss) 55 2G 35) AO SOG 
56 Machinist 47 66 47 44 62 41 $5 34 S55 40," (45meeas 
94 Toolmaker 49 73 46 46 46 #40 52; 37 7 42, 30;RScmeOS 
186 Repairing a clock 35 45 49 48 48 39 42 44 58 36,  GOMNGO 
189 Cabinetmaking Si 435499 43 46 47 - 57 33 40 (Si seedOmieay 
190 Operating machinery “7, SEY 52 39 49 43 60 42 45 56 
122 Mechanical drawing 32 41 42 38 44 40 62, 41, 35 61 Soaese 
132 Shop work Sa oe So Zep SS) Gil Sy 34. 44 64 42 44 
148 Solving mechanical puzzles} 23 33 35 40 31 37 «33 #30 39. 32) 3Seaod 
187 Adjusting a carburetor 33 oy SS 66 8645 62 so 52 139 3/7, 9) 63.aOe 
121 Industrial arts AQ 738 41 930) 7445" 4059 5925) 3 37 39 
135" Makinga radioor Hi Fiset29! 938) 93998) 55.) 450 tor Ome | ee oe 64 


188 Repairing electrical wiring |} 32 45 47 62 53 60 39 SO 37 67 «30 #53 
368 Have mechanicalingenuity) 27 39 40 43 41 46. 37 44-36 45 34. | ddaeeeaG 


13 Auto mechanic 47 59 55 46 36 915399295 475 320 62) 327SGmeed 
32 Electrical engineer 29 49 49 41 38 40 36 36 38 44 28 47 48 
45 Inventor 24. 30) 325 307 33s 4 19 20S S02 es Oe 
55 Locomotive engineer 37 53, G40 23 eS OO ee moe 18 34 28 —200aes 
61 Mining superintendent 17. 28 30 19 [ie 230 2220525) 522) 275) (24 ee 
30 Draftsman 30, 42. 45 34 932) 33) 62: 5541s) -27, 33) 4 1 ees 
88 Shop foreman 33° 52° +54. 929 29) 942, 73355 42. 25° 40) 34 
24 Civil engineer 30 41 43° 24 3 38> 305 345 Sl 320" 3 se 
180 Popular mechanics maga- 
zines 32. 45° 45 45°) 39 |$0 “43 S1 938 49 42) -20ReEG 
98 Watchmaker 34 46 54 49 31 930 ~25 30 25 3679205 s0RRso 
128 Physics 17 24527 «= .28' | 20) 22 822) 19° 36 20 ae ee 
16 Designer, electronic equip- 
ment® 26 38 40° 33° 32 9831 929" 27 (39 34 | 265eRaSeeo 


22 Electronics technician® 29° 45- 46° 36 32 9.37 32 345) 36) 43" (27s O MD 
216 Looking at things in a 


hardware store® 33!) © 34 “365% 530; 10537 S925 5 S923 2) 29 ee el 
34 Geologist» 30!) 24.7 260° °20 9 27 Wis Pls 19%) Dil LO) 19S en 
87 Computer operator? 1S) ey = hil 19) 12 9824 “82015 135) 2155922) | 09 a 


166 Science fiction magazinesb | 09 21 Olea ama 20-19 12, 23.24. 15.25 mS 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 
8 New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for varjous reasons. See text, 
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Item Intercorrelations 


The next step in scale construction was to 
generate an item intercorrelation matrix. Using 
the responses from Strong’s group of 500 MIG, 
a Pearson product-moment correlation was 
calculated between each pair of items by as- 
signing the values +1, 0, and —1, respec- 
tively, to the Like, Indifferent, and Dislike re- 
sponses, then treating one item as the x-axis, 
the ovher as the y-axis. Examples of three such 
correlations are shown in Table 1, including a 
high-positive, a low-positive, and a_high- 
negative correlation. As there were almost no 
large negative correlations, except between 
forced-choice items where they were artificially 
created, —.19 qualifies here as “high.” 


FOR MECHANICAL CLUSTER 





The next step was to decide exactly how to 
build the scales. While the authors wanted to 
identify clusters of items with “high” inter- 
correlations, they initially had little idea of 
what to call high. To gain some knowledge 
here, a frequency distribution, using all of the 
correlations in the item intercorrelation matrix, 
was constructed; it is reproduced in Figure 1. 
To provide a replication, an analogous dis- 
tribution is included for the item intercorrela- 
tions from the women’s form of the SVIB. The 
two distributions are remarkably similar. 

Several conclusions can be drawn from the 
distributions in Figure 1: 

1. The item intercorrelations were generally 
low, mostly around 0, practically all between 
a0. 





368 | 13 | 32 | 45 | SD | 61 | 30 | 83 | 24 | 180 | 98 


| 128 | 16 | Dy | 216 | 34 | 87 | 166 





Some Ome S20 37 | 32 38) 42 32 
Somoomeras 9 3/57) 645 36%) 51 36 
SOmmmotmes9o) 35 40 51 40 57 33 
PSOne 33) (30) 82) 33. 30 29 
AieoOmeee29, 35 30 31° 40 39 34 
Seo O20 3737) 26) 32 28 
SOOM 2/25 30 55) 31 39 
Soe 33 30 36 45 43 Sl 
SOmeeimeeecOr 39 23 30) 33 930 26 
MVmemotmee se 30 35° 37 29 27 30 
Wem 24 29" 222A 40380 25 
Domine?) 36 30)" 297 30° 24 32 
Zico 29° SLR 27) 31h 29 25 

Ome Al 25582295 289 26 20 


35 36, «28 «445 #439 #27 @8©646 24 
33 40 44 40 41 34 42 56 
A252 «6(A0 50 teee 20 SS as 
Teele 29 «| 17 of RS 23 
tomo 30 25 28 SOMO 40 
Vimeo 17 31. 22 38 37 
Pomel ot 15 41- 44 33 29 


SUmeeoteeeor 34 299) 369) 39 30 

Pmeoomerot 29° 3157 25 33 ~36 28 
DoS Se 23 38°) 2498 St) 3A 24 
POM 53 1455 2588 180 14 38 


Simeone 0S) «45 22) 30). 34. 26 50 
208s) 12 295 288) 408 31 54 
ZomeecOmerct | 19 ~ 259" 217) 19 24 25 
Smet 3S) 3006 20) C321 15 44 


Viele ol) 13 9245-20) 31 35 22 
Zimeioem 24) 22, 19 18 22 16 =08 


2A Ss 0) 
Oe el) 
iter SR) 
SS eS 
31 39) 34 
38 27 34 
S2me DOMED 
SOmEZo SS 
3429 30 
Slo A MES 
33 AD yy 
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Wy BYE Si 
Oe mo, 
DAS) 2S 
ies 28 
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5 5 IA 
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ome! 
27 24 
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1960s. Decimals omitted. 
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TABLE 9 


MEDICAL SERVICE CLUSTER 














Item 29 | 67 | 69 | 93 | 130 | 136 | 192 | 256 | 264 | 49 | 356 | 64 
29 Dentist AN) O40) S45) “21 235 25 
67 Pharmacist 42 31 S00 23 ee oO 2d 
69 Physician Sommers 78 = 331 Wey eho; 
93 Surgeon AOL nO Zi 24% 3S 
130 Physiology 220 20 5D eos 34 8628 
136 Zoology 17 24 28 29 48 15 
192 Giving first aid assistance 49 46 47 46 41 38 
256 Physically sick people* LLG i ee mE 
264 Outstanding scientists* 11 1 So) ee oa ee On Oe 7 
49 Laboratory technician 25a o 2 NG) BR) 17) 25.40) 06,25 
356 Superintendent of hospital 12 10) 2220) eee 20 ee Same 2 en) Se) 7 
64 Psychologist® 145 159 25) 82077 §35) 9 S505 4 ee 





ote.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested i 


N 
1960s. Decimals omitted. 


a Item considered for inclusion in cluster, but discarded for various reasons. See text. 


2. The distributions were slightly positively 
skewed. This mild effect could be caused by 
any of several factors such as a slight response 
set or overrepresentation of some item-content 
areas. 

3. There were not many high correlations. 


In the initial matrix, the highest were .79 be 
tween the items Mathematics and Arithmetic 
and .78 between Surgeon and Physician, only 
5% were above .25, and only 1% above .35 
This strongly suggests that the items are rela. 
tively specific and that the respondents pay 2 


TABLE 10 


MERCHANDISING CLUSTER 














Item x0 | 99 st |a19| 18 | 42 | 85 |200] 65 | 92 | os | 346 96 | 177 [218 
80 Retailer 59 48 51 52 36 40 48 44 30 
99 Wholesaler 5 58 44 EAS SS 4 SO 
81 Sales manager 52 48 47 48 34 44 44 50 38 
219 Buying merchandise h , 
for a store 53. 49 50 S452 aS Oe S220 
18 Buyer of merchandise | 54 52 S51 58 36 33 44 44 32 
42 Hotel manager 369) 30) 35938) 936 ae) yl AI By 
85 Manager, Chamber of | 34 36 42 36 36 38 30 47 41 
Commerce 
220 Displaying merchan- 
dise in a store A139 F 34 9559 Ae Sess S206 
65 Office manager 37) 39) 42) 3605415 SS e427 35 
92 Stockbroker 2934 tS OOS ee? OF ae OME DO 


68 Public relations man® | 35 35 53 38 41 
346 Work in an import- 


export business® 28, 30) 30) 3135 
96 Travel bureau man- 
ager® 347 S31 329 325 33 
177 Business methods 
magazines® 32s O04 nod 
218 Looking at things in 
a clothing store» 24 S22 26 9 22 





AD OS ON te OL 

Sm) PRE DRS PA LH 

aBy Gh hi, SIS) ca 
2A S31 270) 33 Soo 4ea lo eee 


19° (20) 345), 190920) 628) 2295 20 





Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


® New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for various reasons. See text. 
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great deal of attention to the unique aspects of TABLE 11 
each item. MixiraAry Activities CLUSTER 

4, There were no large negative correlations. eS SS 
This indicates that, at the item level, there is Tew 4 | 123 | 151 | 210 | 237 
little tendency for people who like one item to 
all dislike another. (This is peculiar, for large 4 Military officer 49 42 59 
negative correlations certainly appear at the 123 Military drill 52 66 668 
scale level, ranging up to —.88 between the 15! Drilling in a military 
Social Science Teacher and Engineer scales.) 419 Drilling soldiers = e 74 es 
In constructing the basic scales, only a few 237 Military mena 69 Aa) 43 7 
items with negative correlations were included, 
even though a lower (absolute) value of cor- Note.—Correlations above diagonal from Strong’s 500 MIG 
Pe on nae accepted posted pete aden ehone Below diagonal based on sample tested in 

. . 1 omitted. 


After the matrix had been calculated, and it semreiaesaide the 1966 booklet, added to develop the Set 2 


TABLE 12 


Music CLUSTER 











Item 62 | 63 | 66 | 167 | 183 | S52 | 152 | 157 | 149 | 261 
62 Musician 50 57 40 
63 Music teacher 56 2 Zi 
66 Orchestra conductor 58 51 34 
167 Symphony concerts 39 (634 ~~ 47 
183 Magazines about art and music® 43 41 46 54 
352 Music and art events vs. athleticevents® | 34 Di 35 48 50 
152 Playing the piano* 49 oi 42 30 30 22 
157 Jazz concerts® 26 20 28 ZS) 30 08 29 
149 Religious music 28 30 26 34 oD 21 pal 13 
261 Musical geniuses* 43 34 44 39 46 =. 33 35 2S 25 


Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 
1960s. Decimals omitted. 
4 New item from the 1966 booklet, added to develop the Set 2 scales. See text. 


TABLE 13 


NATURE CLUSTER 











Item 37 | 50 | 102 | 106 | 125 | 136 | 147 | 193 | 230 | 326 | 194 | 76 | 214 
37 Farmer 40 65 33 35 26 34 40 —34 —14 19 64 
50 Landscape gardener 37 40D) S55 AsO 2 ema ol 17 220 435 33 
102 Agriculture 58 42 42 44 34 39 42 —18 —08 29 44 
106 Botany 2 2S ero 2 Lae) Omer 3) — (OM 10) 225 22 
125 Nature study 2S RSS nS SEL 47 64 45 —08 —16 32. 25 
136 Zoology IS} PR AN SES) A SO Ole — 055 9 215 24 
147 Bird watching DET EES LT 239 30 soe 4 48 —17 —20 30 28 
193 Raising flowers and vege- 
tables 33 44 40 28 30 16 24 —17 -—32 51 24 
230 Living in a city —24 —08 —17 00 —03 O01 —06 —03 25 —01 —30 
326 Selling things house to 
house vs. gardening —24 —34 —22 —19 —22 —12 —21 —26 13 —15 —1i1 
194 Decorating with flowers Se eS eS OS fe) eS 13 
76 Rancher Gy Bh Seb ey a Nee Sail aye als} 
214 Forest ranger® 5 AO) eS) 22 | 26 26 —19 —24 17 37 





Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 
1960s. Decimals omitted. 
a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
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TABLE 14 


OFFICE PRACTICES CLUSTER 











1s | 2 | 25 | 65 | 74 | 80 | 99 | 105 135 | 36 | sat | 208 | s89 








Item 
15 Bank teller 7S iil 
21 Cashier in bank 81 2a OO 
25 City or state employee 2526 19 
65 Office manager 39 40 22 
74 Private secretary SOS eee co nO 
80 Retailer 32 eo ee Sa 
99 Wholesaler 302 SOD es: 
105 Bookkeeping A003 jie el een 54: 
135 Typewriting 2) ee eS 
| 36 Income tax accountant® 43:0) (40 meni dno G4 
321 Airline pilot vs. airline 
ticket agent® —18 —20 —15 —13 
208 Making statistical 
charts» eS lyf EG 
389 Pay attention to details» |—07 —03 —11 —10 


39” S54 . 30" "40" “32 

SO snes /Meo a ASI EZ y) 

Zi OMe Seo ES 

48 44 47 40 21 

38 340 «63532 

19 9 eS Lie 

10 ESS: SO mms 

Lim eo! 39 

17 14a loe 23. 

2A 23) DLS. 
—08 —13 —04 —15 —07 —14 

10 S10, 09 27. SIS 32504 
—08 —06 —02 —14 —12 —13 14 —15 








Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for various reasons. See text. 


required 3 hr. with a CDC 1604 computer, 
various searching techniques were used, with 
the further aid of the computer when possible, 
to locate smaller matrices containing perhaps 
5-25 items, all having high intercorrelations. 
Using Figure 1 as a guide, high was defined as 
above .30 though it was sometimes necessary 
to drop lower. The absolute minimum was set 
at .20 but even that was violated occasionally, 
especially when the correlations were negative. 


Item Composition of the Basic Scales 


Twenty-two basic scales were finally con- 
structed; item intercorrelation matrices for 
each of them are listed in Tables 2-23. The 
items in the tables were sometimes abbreviated 
for easier presentation here. 

In these matrices, two correlations are pre- 
sented for each pair of items. Those above the 
diagonal were generated from Strong’s 500 
MIG, tested during the 1930s, and are the 


TABLE 15 


PuBLIC SPEAKING CLUSTER 











Item 131 | 199 
131 Public speaking 66 
199 Making a speech 67 
72 Politician 42 43 
26 Minister, priest, or rabbi 25 29 
41 Governor of a state 40 47 
160 Electioneering for office® 38 43 
75 Radio announcer*® 34 2 
68 Public relations man® 38 28 
176 Telling jokes> 26 22 
367 Prefer working alone to working 
on committees —20 —18 





—18 





72 | 26 41 | 160 | 75 | 68 | 176 | 367 
34 29 34 
39 31 29 
2D Sl 
21 23 
62 27 
65 18 50 
32 24 30 29 
39 21 34 35 39 
19 06 14 21 24 20 
—08 —15 -17 —13 -—18 —08 


Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion iv cluster, but discarded for various reasons. See text. 
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TABLE 16 


RECREATIONAL LEADERSHIP CLUSTER 





7 j 70 | 79 | x27 | a4s | ars 12 | 380 | 280 | 60 | asa | 174 





Item 

7 Athletic director Sou 44 
70 Playground director 55 40 

79 Reporter, sports page Sie 43 
127 Physical education O20 4240 
143 Boxing 322425 
171 Sports pages in newspapers 59 2m aS 
12 Auto racer 1309 14 
350 Playing baseball Se) els 
280 Athletic men*> 44 27 38 
60 Professional baseball player oOo 

352 Music and art events vs. athletic 

events? —41 —16 —30 
174 Skiing® ily SS 


CEES 2 A 26 25:5 34. 
27 Se? pam 22 ee) me 2.) 
2 Ome ae 205 24 ete23 
2 On ee Oe 2 One I 
32 33 205 20519 
Some) 2) ee 
10 24 04 20a LO 
31 LOO See 22 
AS? OOS Smee 
41 21 43 18S a 39. 
—31 —18 —49 —10 —07 —33 —37 
17 me Ol As Ss AO 6 Os 





Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 
& Inadvertently left out of Set 1 scales. 


b New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
¢ [tem considered for inclusion in cluster, but discarded for various reasons, See text. 


correlations used here to cluster the items into 
scales. The correlations below the diagonal are 
based on a recently tested sample of MIG who 
filled in the 1966 booklet. (Many of them were 
actually tested in 1964 and 1965, using a pre- 
publication version of this bookiet.) This 
second group, which also included 500 men 
from a variety of occupations, provided a 
replication to recheck the homogeneity of the 
selected clusters. In general, the intercorrela- 
tions remained high in the second group, indi- 
cating that these clusters are stable over time 
and from one group to another. 

The latter group also provided item inter- 
correlations for the new SVIB items added in 
the 1966 revision, and several items, those so 


noted, were added to the scales after reviewing 
these figures. The expansion of the scales with 
these new items constitutes Set 2 of the SVIB 
basic scales. 


Considerations in Scale Construction 


It is difficult to recapture exactly the 
methods used in searching out these sub- 
matrices; a combination of computer searching, 
art, hunches, and intuition was employed to 
decide which items should be grouped together, 
and it would be difficult to exactly replicate 
these techniques. To demonstrate some of these 
decisions, items at the bottom of some matrices 
are noted; these were considered for inclusion 
in the scale but were rejected for one reason or 


TABLE 17 


RELIGIOUS ACTIVITIES CLUSTER 














Item 26 | 100 | 250 | 124 | 149 | 161 | 169 | 181 
26 Minister, priest, or rabbi 52 46 
100 Worker in YMCA 30 36 
250 Religious people 35 30 
124 Bible history*® 46 27 45 
149 Religious music 39 30 42 46 
161 Going to church® 47 30 57 46 43 
169 Church young people’s group* 3 45 ay 46 46 59 
181 Reading the Bible? 39 26 51 61 52 56 Sil 





Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
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TABLE 18 


SALES CLUSTER 


1 | st | 77 | a0 | st | 90 | 95 | 197 | 326 | 23s | o8 | oo 








Item 
11 Auto salesman 43 49 
54 Life insurance salesman 43 49 
77 Real estate salesman 5a eZ 
80 Retailer S406 35) 41 
81 Sales manager 43 41 48 
90 Specialty salesman 430m SOte 5d 
95 Traveling salesman 43 44 45 
197 Interview prospects in selling 44 45 47 
326 Selling things house to house* SOMES Smee 29 
335 Definite salary vs. commission on 
what is done® —16 —15 —20 
68 Public relations man® S135. 40 
360 Appraise real estate 25 eee eS 


33 32) 42) 495 35 

33° 36° 41 42 41 

3941 50 44 44 

487° 39 38 30 

Se 47 48 54 

45 ov 54. 43 

Some OO) 51 

SON COME Oo e Oe 

LS soem 4 47 
—07 —32 —28 —26 —35 —32 

Son) Postemwaly 933, 940% 9217 
eee ee 20 18 28 20 —19 19 





Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for various reasons. See text. 


another, usually because of low intercorrela- 
tions but occasionally because of content. 

To guide these decisions, the following two 
standards were closely followed: first, each de- 
cision was based on statistical evidence so that, 
for example, no two items were clustered to- 
gether unless it was empirically defensible; 
second, the other major concern was with 


eventual interpretation. Thus, considerable 
attention was paid to item content. Occa- 
sionally, an item with relatively high correla- 
tions with a cluster of items was not included 
with them because the item simply seemed out 
of character. One such example was the item 
Sculptor in the Science cluster (Table 19). 
Although that item correlated fairly high with 




















TABLE 19 
SCIENCE CLUSTER 
Item 83 | 212 | 23 49 108 | 6 | 10 128 | 107 | 136 | 114 | 16 | 22 | 34 | 346 | 264 | 87 | 64 
83 Scientific research 
worker 67 54 50 Al 43 47 37 35 35 35 
212 Doing research 
work 67 42 39 36 29 41 41 34 32 27 
23 Chemist Sa 44 52 63 35 39 33 36 38 30 
49 Laboratory tech- 
nician 54 40 54 39 ih 44 31 36 31 26 
108 Chemistry 38 30 57 37 25 25 50 38 36 30 
6 Astronomer 40 36 41 31 26 37 29 30 31 34 
10 Author of technical 
book 46 46 38 Pee 26 30 29 32 23 20 
128 Physics 41 36 45 32 50 34 oS 44 30 32 
107 Calculus 33 29 34 22 40 30 30 aS 19 19 
136 Zoology 33 28 31 25 28 oe 30 27 21 41 
114 Geology 28 26 28 24 Qi 34 26 30 20 43 
16 Designer, elec- 
tronic equipment® 50 41 55 46 ili 38 39 Al 31 20 22 
22 Electronics tech- 
nician® 49 37 58 54 ey) 35) 32 35 25 19 21 68 
34 Geologist® 44 37 44 39 32 47 32 34 24 40 61 36 37 
346 Business vs. re- 
search laboratory* |} —50 —47 —35 —33 —27 —26 —31 -—28 —23 —20 —12 -—30 —31 —21 
264 Outstanding sci- 
entists*® ol 27 30 2S 26 24 32 28 17 28 24 DS 19 18 —20 
87 Computer opera- 
atorb 29 23. 21 34 18 12 16 18 17 06 09 30 36 17 —16 14 
64 Psychologist 22 15 12 14 08 20 20 12 08 19 12 10 06 17 —00 10 08 








Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for various reasons. See text. 
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TABLE 20 
SOCIAL SERVICE CLUSTER 
Item 89 | 100 | 133 | 184 | 228 | 229 | 359 | 64 | | 256 

89 Social worker 59 49 40 26 32 

100 Worker in YMCA 46 34 28 30 35 
133 Sociology 47 23 41 22 33 

184 Social problem movies 49 24 47 14 20 
228 Contributing to charities 22 20 21 24 51 
229 Raising money for a charity 34 28 De 27 50 
359 Dog trainer vs. parole officer* —29 -—13 —26 —29 —-15 —14 

64 Psychologist 39 18 48 36 09 10 —04 
256 Physically sick people 19 16 15 21 32 30 —10 05 





Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


® New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Item considered for inclusion in cluster, but discarded for various reasons. See text. 


the science items such as Scientific Research 
Worker (.38), Chemist (.30), and Astronomer 
(.37), it was not included with them because 
it just did not seem to belong. 

Another example is shown in Table 7 which 
has the item intercorrelations for the Mathe- 
matics scale. At the bottom of the table are 
several items which might have been included 
with this scale if the size of the intercorrela- 
tions had been the only consideration. How- 
ever, they were not added because they would 
have clearly diluted the homogeneity of the 
Mathematics cluster. Such decisions were not 
made often, nor lightly, but all of the decisions 
were definitely not left in the hands of the 
computer—most of them, though, were. 

Scanning the item intercorrelation clusters 
makes it clear that some item-content areas 


are better represented in the SVIB item pool 
than others. In the mechanical area, for 
example, many items were available, while in 
the Religious Activities area only three items 
were available for the Set 1 scales. Normally 
three items would not be considered sufficient 
to constitute a scale but because it was pos- 
sible to generate a longer scale for the Set 2 
scales such a scale was constructed. 

While most of the possibilities for such 
scales in the SVIB item pool have been ex- 
hausted, no claims are made to have exhausted 
the domain of interests. When working with 
item intercorrelations, it becomes very ap- 
parent that a cluster of interrelated items will 
emerge if enough items are available concerning 
any one type of activity. But if an activity is 
not represented in the SVIB item pool, then 


TABLE 21 


TEACHING CLUSTER 

















Item 82 | 27 

82 School teacher 68 

27 College professor 63 

63 Music teacher 34 26 
203 Teaching adults 48 44 
202 Teaching children 38 22 

53 Librarian 40 34 

59 High school principal 57 46 
387 Have patience when teaching others* Ze 22 
359 Dog trainer vs. parole officer> —23 —23 


| 63 | 203 | 202 | Do | 59 | 387 | 339 
48 38 36 40 
37 A 29 38 
18 21 39 
23 45 18 
24 53 Dis) 
35 18 17 
29 32 28 31 
13 34 26 06 18 
—02 —20 —-11 -—04 —24 -—-17 





Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


a New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
b Iter considered for inclusion in cluster, but discarded for various reasons. See text. 
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TABLE 22 


TECHNICAL SUPERVISION CLUSTER 





Item 35 | 58 | 61 | 88 | 347 
35 Factory manager 49 43 46 —10 
58 Manufacturer 54 37 33 —11 
61 Mining superin- 
tendent 40 38 50. ~=—-:10 
88 Shop foreman 45 35 44 09 
347 Technical vs. sup- 
ervisory responsi- 
bility —22 —17 O01 —0O2 





Note.—Correlations above diagonal from Strong’s 500 MIG 
tested in 1930s; those below diagonal based on sample tested in 
1960s. Decimals omitted. 


no scale to measure interests in that area can 
be constructed. This is probably not a signifi- 
cant problem because the SVIB item pool does 
have considerable diversity, especially in occu- 
pational activities. However, certain omissions 
can be quickly identified by scanning analogous 
intercorrelation matrices which have been de- 
veloped for the women’s SVIB. For example, 
one scale available for the women’s form but 
not the men’s has been labeled ‘‘Homemaking”’ 
and contains the following items: 


ITEM 
Caterer 
Cook 
Housekeeper 
Home Economics Teacher 


Doing your own laundry work 
Cooking 

Sewing 

Preparing dinner for guests 
Trying new cooking recipes 
Home Economics 


Whether or not such a scale would be useful 
on the men’s profile can only be an academic 
question here; because these items have never 
been included in the men’s booklet, this scale 
is not available for use in comparing male occu- 
pational groups. 

In constructing these scales, some difficult 
decisions arose over the issue of scale inter- 
relationships, the most difficult being that of 
scale independence. Most psychometricians, 
and probably all factor analysts, would likely 
argue that these scales should be as independ- 
ent as possible; certainly they would prefer 
that there be no overlapping items. While 
there might be some statistical advantages in 
such procedures, it is not at all clear that the 
resulting scales would be the most accurate re- 
flection of the structure of interests, as some 
interest dimensions are related, and some SVIB 
items tap more than one dimension. 

One such quandary appeared in the three 
scales: Sales, Merchandising, and Office Prac- 
tices. The correlations between these scales are 
high: .78, .63, and .79, respectively, and there 
is probably only one underlying dimension— 


TABLE 23 


WRITING CLUSTER 








Item 9 | ae | 71 | 31 | 53 | 78 | 119 | 40 | 112 | ae | 165 | 170 | 75 
9 Author of novel Oe cA] Oe OUT O Camm) Cinna 

57 Magazine writer 65 S10 OS i 43 OOO OO EAS Oe 

71 Poet SOO) Ai 4 33 ae) Oe 2 emer 

31 Editor 52 eee Ome 4S SSO SO ke he AS San29 

53 Librarian 28 5300 3 moe 51 SOF 20cm 20 umes 

78 Reporter, general aD 549 39 505399 28 2S ee One 
119 Literature Si 425 48 43 SO 40 26) Meo, 9240 

40 Foreign correspondent Aq 951) 033 eo ee er) 29 «18 

112 English composition 48 47 40 44 35 40 57 32 30 

172 Poetry AL 408 (6/937 985 S252 28h 

165 Writing a one-act play 58’. 648 559950 837 = 465 AS 45 AG a 44 

170 Biographies® 88° She 383.9032 1335 32s ADO 4am 4 eA) 

75 Radio announcer 23 G4 2200936 SSD eee 22s 4 ee Oe es Se 





Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 


8 New item from the 1966 booklet, added to develop the Set 2 scales. See text. 
> Item considered for inclusion in cluster, but discarded for various reasons. See text. 
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business. Yet, there were three fairly distinct 
clusters of items and, as they clearly merit dif- 
fering interpretations, they were retained as 
separate scales. Whenever such an arbitrary 
decision had to be made, the overwhelming 
consideration was to improve the final inter- 
pretability of the scales, not to satisfy inter- 
mediate psychometric requirements. The even- 
tual impact of these decisions can be seen in 
Table 31 where the scale intercorrelations are 
reported; some of the correlations were quite 
high—in the .70s—and that is undesirable. 
Still, one cannot have it both ways; either you 
take the clusters as they stand, or you arti- 
ficially eliminate overlapping items. The 
authors’ preference was for the former. 


“Personality” Scales 


The scales which appeared first in the scale- 
construction stage were those that researchers 
in interest measurement have learned to 
expect: Sales, Science, Mechanical, Social 
Service, and so forth. But the authors were 
also interested in constructing scales that 
would resemble the more traditional per- 
sonality dimensions such as aggressiveness or 
compulsivity. 

Several attempts to build such scales from 
the SVIB item pool were singularly fruitless. 
Neither computer algorithms nor intuition 
located clusters of items that held together 
statistically. To illustrate the difficulties, inter- 
correlations for items selected to represent the 
dimensions of autonomy and compulsivity are 
shown in Tables 24 and 25. The level of homo- 
geneity of these ‘‘clusters” falls far short of 
the minimal standards of statistical coherence 


which were followed in forming the occupa- 
tional oriented basic scales, and these per- 
sonality clusters were dropped. These failures 
might be attributed either to a lower cohe- 
siveness of personality clusters, or to the in- 
adequacies of the SVIB item pool in the per- 
sonality domain. 

Finally, to gain some expetience with such a 
personality scale, the Adventure scale was 
forced into existence. Items with a derring-do 
flavor were separated out, their intercorrela- 
tions were scanned, and those with at least 
mild positive correlations, generally larger than 
.10, were clustered together. As will be seen 
later, the resulting scale worked reasonably 
well—a group of astronauts scored highest, 
2.50 SDs above the lowest scoring occupations 
of school superintendents and mathematicians 
—but, still, the scale seems to have less occu- 
pational relevance than the other more homo- 
geneous scales. For example, on the other 
scales, scores increase with age, especially with 
occupational experience, but scores on the Ad- 
venture scale decrease with age. In general, 
attempts to build homogeneous personality 
scales were unsuccessful. 


Scale Reliability 


Relatively few items are available in the 
SVIB for any single content area; thus, most 
of these scales are short and one of the first 
concerns here was with scale stability. Building 
short scales, no matter how pure, could con- 
ceivably result in scales with unacceptable re- 
liabilities. To check this, several scales of 
varying lengths were constructed and their 
long-term reliabilities were compared. Six 


TABLE 24 


AUTONOMY CLUSTER 








Ttem 299 | 335 | 336 | 351 | 338 | 367 | 392 

299 Freedom in . . . doing work —16 13 —06 —06 

335 Commission on what is done ys. salary? | —14 —25 —04 24 

336 Work for yourself 18 —29 —01 —31 

351 Amusement alone. . 8 —12 —00 07 07 

338 Work for self in small business® —10 aS —48 05 

367 Prefer working alone 06 01 —00 —22 —01 

592 Dislike taking orders 08 —06 17 —09 —14 09 





Note.—Correlations above diagonal from Strong’s 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 
a Item with negative weight. 
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TABLE 25 
CoMPULSIVITY CLUSTER 
Item 105 | 223 | 227 | 334 | 341 | 344 | 380 | 362 | 366 | 373 | 385 | 389 
105 Bookkeeping 20 26 —0O1 —09 —02 17 
223 Methodical work 21 14 —10 —21 —02 30 
227 Saving money Oe) 14 —24 —03 —08 15 
334 Playing safe® —10 —14 —10 03 14 —06 
341 Work with many details* —09 —27 —03 06 —08 —19 
344 Similarity in work® —10 —10 —08 19 00 10 
380 Plan work in detail 12 28 18 —06 —18 —Ol 
362 Make decisions immediately 03 —11 —07 20 O09 O7 —10 
366 Keep detailed recordsofexpenses} 19 21 21 —15 —11 —13 29 —11 
373 Am always on timewithmywork| 06 14 11 —11 —01 —09 25 O07 13 
385 Am slow going and sure —01 10 08 —21 O1 —08 12 —29 06 —02 
389 Pay attention to details very little) -14 —26 —12 14 29 04 —37 17 —32 —13 —10 


Note.—Correlations above diagonal from Strong's 500 MIG tested in 1930s; those below diagonal based on sample tested in 


1960s. Decimals omitted. 
« Item with negative weight. 


content areas were selected and two test-retest 
samples were used. The first samples included 
102 members of an Army Reserve unit tested 
twice over a 30-day interval; the second in- 
cluded 191 Stanford University seniors tested 


TABLE 26 
RELIABILITIES FOR SOME EXPERIMENTAL 


Basic SCALES 


Test-retest 


Corrected reliabilit 
Content area No. split-half y, 
items eee 
reliability 
30 day | 22 yr. 

Science 8 82 .86 .66 
16 .89 91 A 

24 .90 88 .66 

32 92 91 73 

Mechanical 14 93 73 
21 94 70 

25 94 70 

Sales 8 89 47 
16 91 50 

Writing 4 85 .60 
i 83 59 

9 84 61 

20 88 67 

Public speaking 7 87 .67 
Mathematics 5 88 iD 








Note.—Median test-retest correlation for 53 regular SVIB 
scales: for 30 days, .91; for 22 yr., .67. 


first in 1927 and retested in 1949. Test-retest 
correlations for both of these samples for 
several experimental scales of differing lengths 
are reported in Table 26. Within each content 
area, the scales were formed from the “best” 
items available; thus, the 8-item Science scale 
has those items with the highest intercorrela- 
tions, the 16-item scale contains the initial 
8, and the next 8, and so forth. 

Several conclusions can be drawn from these 
reliability calculations: 


1. The relationship between scale length and 
reliability was approximately the same, 
whether split-half or test-retest reliability co- 
efficients were used. As the major concern with 
the SVIB is long-term stability, test-retest 
reliability is more important, and that sta- 
tistic was used for the other comparisons. 

2. In general, the expected relationship ap- 
peared between scale length and reliability— 
the longer scales were more reliable—but the 
relationship was by no means perfect. The 
shortest Mechanical scale (14 items) was more 
reliable over 22 yr. than the longest (25 items). 
The most reliable scale over the 22-yr. period 
was the 5-item Mathematics scale. 

3. There was considerable difference between 
the content areas in the reliability of scales of 
equal length. Thus, the 8-item Sales scale, 7- 
item Writing scale, 7-item Public Speaking 
scale, and 5-item Mathematics scale had 22-yr. 
test-retest reliabilities ranging over a con- 
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siderable span: .47, .59, .67, and .75, respec- 
tively. This range could be caused either by 
differing degrees of homogeneity within the 
scales, the nature of the interest clusters in- 
volved, the composition of the test-retest 
samples, or all three factors. 

4. In general, these reliabilities were roughly 
equivalent to those of the regular SVIB occu- 
pational scales. The loss due to scale shortness 
was apparently offset by the gain attributable 
to scale homogeneity. (The regular occupa- 
tional scales have roughly 60-75 items.) 


The general conclusion reached from this 
series of studies was that short scales, if their 
content is homogeneous, can be as reliable as 
long scales. Thus, the conflicting goals of purity 
of content, measured by item intercorrela- 
tions, and longest possible scale length were 
stressed in the scale construction. 

Further information on the stability of these 
scales is reported in a later section. 


Item W eights 


Once the items had been selected for each 
scale, scoring weights were established by 
assigning the weight of +1 to the Like re- 
sponse, and —1 to the Dislike response. In the 
few cases where the item intercorrelations were 
negative, these weights were reversed. 


Cranny’s Work in Factor Scales 


In a closely parallel but independent project, 
Cranny (1967) has factor analyzed the items 
of the SVIB; he described his project as 
follows: 


The responses of 500 men in general to each of the 288 
items were intercorrelated and the items arranged in 15 
clusters on the basis of their intercorrelations. Thur- 
stone’s (1947) multiple group method of factor analysis 
was followed. One factor was extracted from each cluster 
and the angular cosines among the 15 oblique factors ob- 
tained. Four factors were extracted from the 15 by 15 
matrix of angular cosines using Lawley’s (1940) maxi- 
mum-likelihood procedure. The residuals were reduced 
to less than .00005. 

The four factors were rotated obliquely and Wherry’s 
(1959) hierarchical factor solution was applied to yield 
four sub-general and 15 group factors. The factor load- 
ings on the 19 factors of each of the 288 items were com- 
puted and a 288 by 288 residual table was obtained. 

The items with loadings on each factor are listed and 
descriptive labels are suggested for each factor. One of 
the group factors was not sufficiently defined to permit 
interpretation [p. 1]. 


TABLE 27 


LisTING OF CRANNY’S VERSUS SVIB SCALES 











Cranny scales SVIB scales 


Art 
Social Welfare and Service 
Public Contact 


Art 
Social Service 
Public Speaking 


Writer Writing 
Military Military 
Farming Agriculture 
Natural and Social Science} Nature 

Science 
Mechanical Mechanical 
Undefined 
Medical Medical Service 
Mathematics Mathematics 
Business Management Business Management 
Sales Sales 
Musician Music 
Law Law/ Politics 








Note.—Scales with no Cranny counterpart: Technical Super- 
vision, Office Practices, Merchandising, Recreational Leader- 
ship, Adventure, Religious Activities, Teaching. 


There is considerable agreement between 
Cranny’s factor scales and the scales derived 
in the current project, which is interesting 
because the scale-construction techniques dif- 
fered somewhat. Cranny’s was essentially a 
statistical approach; in contrast, the methods 
of this study were much more intuitive. 

A comparison of the two sets of scales ap- 
pears in Table 27 where the scales are matched 
up by name. This comparison exaggerates the 
similarity slightly as some scales with the same 
names have slightly differing item content. 
Still, the resemblance is considerable and 
reassuring. 

Many of the differences were due to greater 
concern in this present study for eventual 
interpretability. The treatment of the business 
area is one example; Cranny has two scales: 
Business Management and Sales, while this set 
contains five, covering approximately the same 
item content: Business Management, Sales, 
Technical Supervision, Merchandising, and 
Office Practices. There is no simple way to 
determine which approach is better. Cranny’s 
scales are probably statistically more de- 
fensible; the present authors’ scales are 
probably easier to interpret—so the choice 
between them is largely a matter of taste. 

Cranny’s report, which is an impressive 
amount of work for a doctoral dissertation, 
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contains no normative or other psychometric 
data on his scales. 


Evaluation of the Basic Scales 


In developing these scales, both statistical 
precision and common sense were used. The 
success of these techniques must be deter- 
mined, not by studying the scale-building 
techniques themselves but by the efficacy of 
the resulting scales. To this end, a great deal 
of information follows on the reliability, va- 
lidity, and other characteristics of these Basic 
Scales. The results seem meaningful and, after 
studying these data, users of the SVIB should 
have more confidence in their interpretations, 
and researchers should be able to home in more 
precisely on the unanswered questions. 

One result of this work is that now some- 
thing more is known about the organization of 
interests. The clusters of items were not 
forced; for the most part they fell out of the 
intercorrelation matrix relatively cleanly and 
their makeup reflects something of the basic 
dimensions underlying the items. In this sense, 
some clusters that did not appear are note- 
worthy. The failure of personality clusters to 
appear has already been noted. Another inter- 
esting omission was anything resembling a 
“prestige” cluster—it might have contained 
items such as corporation president, governor, 
and perhaps college professor. This might indi- 
cate that status per se is not an important 
determinant of item choice. 

The clusters that were formed merit more 
extensive study as they have some psycho- 
logical integrity, so to speak. More is said about 
this in the section on construct validity. One 
observation can already be made; that is, item 
clustering tends to follow vertical, not hori- 
zontal, divisions in the occupational world. 
Thus, the Science scale includes high and low 
laboratory positions; the Office Practices scale 
includes high and low office positions. This 
strongly suggests that the unifying feature is 
the activity involved, not the more tangential 
aspects of the work. 


NorMs FOR THE Basic SCALES 


Once the basic scales were developed, norms 
had to be developed for interpretation; simply 
calculating the number of Likes that an indi- 
vidual marks in each scale would be of little 


value because the areas differ in popularity. 
To make the scores meaningful, they must be 
standardized in some manner that will permit 
ready comparisons between individuals and 
between scales. One common way to do this— 
and the method that has been employed here— 
is to convert the raw scores into standard 
scores with a known mean and standard devia- 
tion, usually 50 and 10, respectively. To do 
this, it is necessary to score some reference 
sample on each scale, then use the resulting raw 
score mean and standard deviation in a raw- 
score-to-standard-score conversion formula, 
The question as to which reference sample 
should be used is difficult to answer. The regu- 
lar occupational scales are normed against 
whatever occupation they are based on but 
that is not possible here as these scales are not 
specific to any one occupation. For these basic 
scales, there are various arguments for using 
a more general sample such as college freshmen, 
or high school seniors, or college-bound high 
school seniors, or perhaps a random sample of 
young adults, or even a stratified sample of the 
entire male population. Each would have its 
peculiar advantages and disadvantages. 


Description of Norm Sample 


The selection of a norming sample was again 
a result of concern with eventual interpreta- 
bility. The men in this sample have been 
tested twice, once as teenagers and once as 
adults. They were first tested by Strong in 1930 
when they were 16 yr. old, in his cross-sectional 
study of the relationship between age and 
interests (Strong, 1931). They have been re- 
tested in 1966-67 in a longitudinal study of the 
relationship between teenage interests and 


TABLE 28 


EDUCATIONAL LEVEL oF Norm Group 








Educational level n % 

Less than high school graduate 94 14 
High school graduate 139 ae 
Some college 170 26 
BA, BS 149 23 
MA, MS 50 8 
MD, DDS, LLB 26 4 
PhD, EdD 19 3 
Total 647 100 
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adult occupations. (When retested, their mean 
age was 51.8, with a standard deviation of 
1.50. To describe them as ‘‘16-yr.-olds’”? when 
first tested and ‘“‘52-yr.-olds’” when retested 
provides brief and essentially correct labels.) 
For this reason, they make an especially good 
norm group as norms can be established, using 
the same people, for both teenagers and adults. 
This advantage of being able to compare 
directly 16-yr.-olds with 52-yr.-olds, using the 
same sample, is offset slightly by the dis- 
advantage that the 16-yr.-olds were drawn 
from the 1930 population, not the current gen- 
eration, but this is a small price to pay for 
having directly comparable groups. 

A more precise description of the sample is 
not possible, for Strong never wrote down his 
specific 1930 sampling methods. Nevertheless, 
it is apparent that he did try to obtain a wide 
and diverse sample of teenagers, and it is not 
grossly inaccurate to consider this a cross- 


TABLE 29 


Jos SATISFACTION OF Norm Group 








Item %o ee 
ing item 
a. It is exactly what I have wanted to do. 18 
b. It is approximately what I have wanted 
to do. 28 
c. Itis something I entered, due to circum- 
stances more or less beyond my control, 
but I am now satisfied in it. 40 
d. Itisacareer which is tolerable but not 
really what I would like to do. 2 
e. It isan unsatisfactory and unrewarding 
career. 1 
f. Itisacareer that I strongly dislike and 
I wish I could leave for some other. 0.3 


Note.—The instructions were as follows: Please check one 
of the following which best describes how you feel about your 
job. 


section of males within the specified age ranges, 
with perhaps a mild underrepresentation of the 


TABLE 30 


TEENAGE AND ADULT MEAN SCORES FOR Basic SCALE NORM GROUP AND CORRELATIONS 
BETWEEN SET 1 AND SET 2 SCALES 



































Adult scores (age 52) Teenage scores (age 16)> 
Scale Set 1 Set 2 Set 1 r 
M SD M SD M SD 
Public Speaking —0.93 2.67 —1.41 3.94 46.6 9.2 94 
Law/Politics —0.82 3.39 —1.41 4.08 45.4 9.6 97 
Business Management 0.43 5.03 1.03 6.01 43.9 9.5 98 
Sales —2.58 4.15 —3.73 oe 46.1 8.7 .96 
Merchandising —0.87 4.89 —0.58 6.54 44.1 9.6 98 
Office Practices —1.63 3.63 —2./2 4.27 47.8 daE3 .98 
Military Activities —1.12 2.62 —1.02 3.04 50.9 9.4 98 
Technical Responsibility 0.56 2.08 0.56 2.08 44.5 11.3 1.00 
Mathematics 2.39 3.35 2.39 3:39 45.4 10.2 1.00 
Science 1.90 5.23 2.78 7.31 45.6 10.6 .98 
Mechanical 4.54 10.97 5.40 12.07 47.5 9.1 1.00 
Nature 2.67 5.06 3.08 5.44 43.3 10.3 .99 
Agriculture eZ 3.23 lee S78) 49.0 9.2 1.00 
Adventure —0.80 2.48 —0.23 3.53 Biee. died .96 
Recreational Leadership 0.80 3.66 1.92 4.86 52.9 9.7 .98 
Medical Service —0.02 3.28 —0.02 3.28 45.5 10.4 1.00 
Social Service —0.87 PBoifil —0.90 3.35 47.3 8.7 98 
Religious Activities —0.76 1.41 —1.18 3.79 46.9 9.2 89 
Teaching —0.29 2.91 —0.17 3.65 40.8 9.0 .96 
Music —0.28 2.37 —0.63 4.68 46.6 10.1 91 
Art —0.54 4.16 —1.28 6.63 46.0 9.3 .95 
Writing —0.66 5.09 —0.72 6.02 45.7 9.2 .98 





» Raw scores. (The adult norm group has a standard score M of 50, SD of 10 on all scales.) 


b Standard scores. 


TABLE 31—Basic SCALE INTERCORRELATIONS 
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301). 


Note.—Set 1. Correlations above diagonal calculated on sample of Stanford University seniors, 1927 (N 


Set 2. Correlations below diagonal calculated on cross-section sample of 52-yr-old men (N = 647). 


lower end of the educational and socioeconomic 
ladder. 

Some basic demographic data are available 
for the total group. Their educational level is 
listed in Table 28. Practically all (86%) are 
high school graduates; about two-thirds have 
had some college; over a third have degrees. 
One in eight have earned graduate degrees. 
Ninety-three percent were married in 1966. 
They had, on the average, 18-yr. experience 
in their jobs. Their feeling toward their work 
is reported in Table 29 which lists the percent- 
age checking various choices in answer to a 
question on job satisfaction. 


Raw-Score-to-Standard-Score Conversion 


The mean scores for this norm group on both 
sets of scales are listed in Table 30. For the 
adults, only the raw-score statistics are listed; 
these are the figures that are used to convert 
an individual’s raw scores into standard scores 
by using the following formula: 
<0 + 50 = individual’s standard score 
where X = individual’s raw score, M = adult 
norm group raw score mean, SD = adult norm 
group raw score standard deviation. This con- 
version, which is a simple linear transformation 
and does not change the shape of the distribu- 
tion, has been used on all the remaining scores 
reported in this report, including the teenager 
scores of the norm group. The net result is to 
convert all scores into a distribution where 
this adult norm group has a mean of 50 and 
standard deviation of 10. This permits an im- 
mediate comparison between the individual’s 
score and the mean score of the norm group, 
and makes other comparisons possible also— 
such as comparing a person’s score on the Law/ 
Politics scale with his score on the Science 
scale. 

The adult group was used here because both 
Set 1 and Set 2 scores were available for them; 
because the teenagers were tested in 1930 with 
the old booklet, only Set 1 scores are available 
for them, and the means are reported in 
standard score points. As can be seen, their 
means on almost all of the scales were slightly 
below the adult mean of 50. The only excep- 
tions were the Adventure and Recreational 
Leadership scales. 
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The correlations between the Set 1 and 2 
scales with the same name are reported in 
Table 30 also, and they indicate the scales are 
essentially identical, with the mild exception 
of Religious Activities and Music, but even 
their correlations (.89 and .91) are high enough 
so that the scales could be used interchangeably. 


SCALE INTERCORRELATIONS 


The intercorrelations between the scales are 
presented in Table 31. Those above the diago- 
nal are based on the Set 1 scales, using the 
scores of a sample of 1927 Stanford University 
graduates. The correlations below the diagonal 
are based on the Set 2 scales, using the scores 
of the adult norm sample. 

Using these intercorrelations as a guide, the 
scales have been ordered so that the adjoining 
ones tend to be related. 


Suggested Profile 


The profile shown in Figure 2 has been pre- 
pared as an aid in interpreting these scores. 
The order of the scales was taken from the 
intercorrelation table. The heavy vertical line 
at 50 represents the mean score of the adult 
norm group; the lighter line, the teenage mean. 
The highest and lowest possible scores for each 
scale for both Set 1 and Set 2 are also marked. 


ScALE RELIABILITIES 


Scores are available for several groups that 
have been tested and retested with the SVIB 
over varying lengths of time. Test-retest cor- 
relations and means and standard deviations 
for both test and retest administrations are re- 
ported in Table 32. All of these comparisons 
use the Set 1 scales; as the Set 2 scales are 
usually longer, their reliabilities might be 
slightly higher. 

The 2-wk. test-retest sample contains mostly 
sophomores at the University of Minnesota, all 
volunteers from the University’s introductory 
psychology course. 

The 30-day sample includes the officers and 
men from an Army Reserve Unit at Fort 
Snelling, Minnesota. Almost all of them had 
college degrees. Their occupations were widely 
scattered, with a mild concentration in the ad- 
vertising and mass media fields. 

The Harvard students, who were tested in 
the spring of their freshmen year and retested 


in the fall of their senior year, were participants 
in an extensive longitudinal study being con- 
ducted by Stanley King and Bruce Finnie. 

The YMCA Secretaries were a group first 
tested by Strong in 1927-30 when they were 
40 yr. old, and retested by Verburg in 1951 
when they were 63 and, for the most part, re- 
tired (Verburg, 1952). 

Further information on these retest groups, 
except the YMCA Secretaries, is available in 
the current SVIB manual (Campbell, 1966). 

The median test-retest correlation of these 
basic scales for each of these groups is given at 
the bottom of Table 32, and the analogous 
figure is reported for the regular SVIB occupa- 
tional scales. As can be seen, the basic scales 
are slightly but consistently less stable over 
time than the occupational scales; the longer 
length of the latter is undoubtedly the reason. 
The difference between the reliabilities of the 
two types of scales is probably trivial; the 
length of the test-retest interval is still far more 
important in determining level of reliability 
than is type of scale. 

Test-retest reliabilities for the basic scales 
are available from another set of men who were 
studied several times by E. K. Strong, Jr. He 
tested them originally as freshmen at Stanford 
in 1930, again the following year as sopho- 
mores, once more in 1939, 5 yr. after they had 
graduated, and, finally, in 1949, 15 yr. after 
they had graduated from Stanford (Strong, 
1955). Table 33 has the mean profiles for the 
group at each testing and the test-retest cor- 
relations over the various intervals. The 
number tested varied from one time to another 
and is reported at the bottom of the table for 
each comparison. From these figures one can 
draw the following conclusions: 


1. On most of the scales, the mean scores of 
this diverse group of students resembled the 
norm group’s mean of 50. As freshmen, their 
most significant deviations were their low 
scores on the Nature and Teaching scales, 43 
and 45, respectively, and their high score of 56 
on the Adventure scale. 

2. Nineteen years later, when retested, their 
interests had shifted mainly in these same three 
areas and, on the Nature and Adventure scales, 
they closely resembled the norm group. On the 
Teaching scale, however, the shift had carried 
them beyond the norm group to a score of 53. 


TABLE 32 
Basic SCALE MEANS, STANDARD DEVIATIONS, AND TEST-RETEST CORRELATIONS FOR 


SEVERAL SAMPLES OVER VARYING TIME PERIODS 


Basic INTEREST SCALES FOR THE SVIB 
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ising 
Recreational Leadership 


Medical Service 
Social Service 


Religious Activities 
Teaching 
Music 


Technical Supervision 
Art 


Public Speaking 
Law/ Politics 
Business Management 
Sales 

Merchandi 

Office Practices 
Military Activities 
Mathematics 
Science 
Mechanical 
Nature 
Agriculture 
Adventure 
Writing 


for 53 regular SVIB 


Madn test-retest r 
scales 


Man test-retest r 
for 22 basic scales 


102; 3-yr. sample N = 189; 22-yr. sample N = 47; 30-yr. sample N = 48. 


30-day sample N = 


Note.—2-wk. sample N = 140 
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TABLE 33 


BAsts SCALE MEANS, STANDARD DEVIATIONS, AND TEST-RETEST CORRELATIONS FOR STANFORD 
STUDENTS TESTED AS FRESHMEN, SOPHOMORES, 5 YEARS AFTER GRADUATION, 
AND 15 YEARS AFTER GRADUATION 















































C C C c c 
ase 1930- met 1930~ | 1931- — 1930- | 1931- | 1930- a 
Scale 31 39 39 |_| 49 49 49 ————- 
b © vd ye f 
ven) Steyn) Ma Sm ee i DOR Spe? i Mm | sD 
Public Speaking Dee 9.8 79 52.8 9.8 so in as 9.5 .54 59 .66 52a 9.7 
Law/ Politics S22 9.7 .79 53.8 10.1 54 .56 53.6 91 ay .56 .66 53.6 9.8 
Business Management 49.2 | 10.3 .76 S10 Os 54 sol 52.4 | 10.5 A8 oo .78 51.9 | 10.2 
Sales 47.9 8.7 .70 48.9 9.3 A8 49 47.8 9.3 46 50 81 48.5 9.6 
Merchandising 50.3 10.2 wha yh) 10.7 Be cd S17 10.8 aod 56 81 S135 10.6 
Office Practices ia 10.7 .69 S250 Ae 48 roy 51:3 10.2 46 54 re 49.7 10.5 
Military Activities 51.9 9.9 .78 50.4 9.8 .63 .67 49.5 10.0 56 -62 «74 49.0 9.6 
Technical Supervision 47.8 | 11.8 68 49.1 | 10.6 Bore Roe ot.4) |) ai au 50 69 51.4 | 11.8 
Mathematics Sled 9.9 84 51.2 9.9 me edd 52.7 9.2 Ske 70 .87 SZ 9.5 
Science 49.2 10.4 .82 49.1 10.0 64 Te 52.4 | 10.0 65 aff .76 51.6 9.4 
Mechanical 47.1 10.5 85 46.7 10.3 WEE Baie 48.6 | 11.2 sda .70 85 49.1 11.4 
Nature 43.3 10.4 81 43.4 10.6 -62 -68 47.2 10.2 52 53 nie. 50.1 10.3 
Agriculture 47.0 9.8 mae 46.9 9.8 61 .67 47.5 9.8 -60 -60 .76 49.1 10.5 
Adventure 56.1 a2 nD Difiee 11.4 54 noe 52.0 114 59 0 68 50.1 10.8 
Recreational Leadership | 51.3 8.8 as 51.9 91 aol .68 49.9 9.5 63 .67 .79 49.1 10.2 
Medical Service 48.7 10.4 .80 48.3 9.9 -66 66 49.8 i cid 56 61 76 50.6 10.0 
Social Service 49.2 9.7 .70 48.6 9.7 34 44 48.2 9.7 38 A9 Sd 49.0 9.9 
Religious Activities 45.5 9.3 68 44,3 9.5 .36 ak 44,7 9.7 31 31 56 46.1 | 10.2 
Teaching 44.8 9.6 .69 46.6 9.9 AS 48 50.7 8.9 40 Eo) -62 S2ah 9.2 
Music 48.5 8.9 i 49.0 9.2 58 ol Saks 8.1 .53 51 .63 51.8 9.0 
Art 47.9 9.3 .80 48.7 9.9 59 .60 51.5 9.8 56 61 aie S152 10.1 
Writing 51.4 9.8 -79 53.5 10.0 -67 ate 53.9 10.2 64 nie 81 54.9 9.6 
Mdn test-retest r See 56 oe 54 56 76 
aN = 253. 
bN = 175, 
oN = 159, 
aN = 202. 
eN = 178, 
CN = 137. 


3. If one assumes that a 3-point shift—which 
is roughly one-third SD—is the smallest dif- 
ference worth attending to, the following shifts 
occurred over time within this sample: 


GAINS 
Teaching 8 points 
Nature 7 points 
Music 4 points 
Writing 4 points 
Art 3 points 
Business Management 3 points 
Technical Supervision 3 points 
LOSSES 
Adventure 6 points 
Military Activities 3 points 


““Middle-aging intellectual responsibility” is 
the apparent trend. 

4. Their scores on the remaining scales over 
this 19-yr. period did not shift even this mild 
amount. Interests in areas such as Public 
Speaking, Mathematics, Science, and Religious 
Activities were, as measured by these scales, 
surprisingly constant. 

5. The test-retest correlations reported at the 
bottom of Table 33 provide a measure of the 


stability of the rank order of the individuals 
within the total sample. The most intriguing 
finding here is that the stability over the 
10-yr. adult span, 1939-1949, was equal to 
the 1-yr. freshmen-sophomore period—further 
support for the familiar finding that interests 
are remarkably stable after age 25. 


Similar results for another group of college 
students are reported in the following section 
on validity. 


VALIDITY 


There is increasing awareness among psy- 
chologists that there is no single way of es- 
tablishing the validity of any test or inventory 
as no one index can adequately represent the 
relationship between test behavior and real- 
world behavior. Even in those situations where 
the test is designed for a specific purpose, for 
example, the Medical School Aptitude Test, it 
is not clear which aspect of medical school per- 
formance should be used as the criterion. For 
measures of interests, the situation is even more 
confusing as there are no criteria available that 
have even the relative clarity of medical school 
accomplishments. How can the validity of, say, 


TABLE 34 


Basic INTEREST SCALES FOR THE SVIB 25 


BRIEF DESCRIPTION OF THE OCCUPATIONAL SAMPLES WHOSE BASIC SCALE 
MEAN Scores APPEAR IN TABLES 35-56 


Accountant 
Advertising Men 
Air Force Officer 


Animal Husbandry Prof. 


Anthropologist 
Army officer 


Architect 

Artist 

Astronaut 
Authors 

Banker (’34) 
Banker (64) 
Biologist 

Bus. Ed. Teacher 
Carpenter 

CPA (’44) 

CPA (65) 

Cham Comm Exec 


Chemist 

Comm Rec Director 
Computer Programmer 
Corp Pres (’35) 

Corp Pres (’65) 

Credit Manager 
Danforth Fellow 


Dentist 

Economist 

Elem Teacher 
Engineer 

Farmer (36) 

Farmer (’67) 

Forest Service Men 
Funeral Director 
Generals and Admirals 


Governor 
Guidance Counselor 
Interior Decorator 


Judge 
Lawyer (’27) 
Lawyer (49) 
Legislator 


Librarian 
Mathematician 
Math-Science Teacher 
McKnight Fellow 


Minister (’27) 


Year 
tested 


1931 
1964 
1964 
1935 
1965 
1958 
1966 


1932 
1967 
1965 
1928 
1936 
1967 
1936 
1945 
1965 


1965 
c1950 
1967 


1967 
1927 
1949 
1967 


1959 
1929 
1936 
1966 


1927 





Investigator 


Winter 


CIMR» 

Rossmann & Lips 

Army Personnel Re- 
search Office 

Strong 

Strong 

CIMR 

Strong 

Strong 

CIMR 

Lindsay 

Bacon 

Strong 

Strong 

Rhode 

Strong 


Strong 

Roys 

Perry & Cannon 
Strong 

CIMR 

Strong 

Rossmann & Bentley 


Strong 
Rossmann & Lips 
Winkle 
Strong 
Strong 
Krause 
Strong 
Strong 
CIMR 


CIMR 
Brown 


CIMR 


CIMR 
Strong 
Berdie & Hagenah 
Willow 


Winters 
Strong 
Strong 
CIMR 


Strong 


Brief description 


Junior accountants 


Air Force officers training at Wright Patter- 
son Air Force Base 

University faculty members 

University faculty 

Above average officers, mostly field grade, all 
West Point graduates 

Architects, mostly from California 


Astronauts, mostly military officers 
Roughly 60% journalists, 40% authors 


All listed in American Men of Science 


Senior Certified Public Accountants 

Minnesota Certified Public Accountants 

Chamber of Commerce and Trade Associa- 
tion executives 

Industrial chemists 

Community recreation directors 


Corporation presidents 
Corporation presidents 


Danforth Foundation Fellows—aspiring col- 
lege professors 


University faculty members 
Minnesota elementary teachers 


Farmers, mostly from Pacific Northwest 
South Dakota farmers 


3- and 4Star Army Generals and Navy 
Admirals 
Retired state governors 


Members, American Institute of Interior 
Design 

Federal District Judges 

California lawyers 

Minnesota lawyers 

Senators and Representatives from Minne- 
sota Legislature 


High school mathematics-science teachers 
McKnight Fellows—actors in Tyrone Guth- 
rie Theater in Minneapolis 


a CIMR = Center for Interest Measurement Research, University of Minnesota. 
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TABLE 34— (Continued) 


Brief description 























Members, Minneapolis Symphony Orchestra 
Mainly performing musicians 


Members, National Institute of Arts and 


Above average naval officers 
Newspaper reporters and television news 
broadcasters 


Members, American Optometric Association 


Texas petroleum engineers 

New York pharmacists 

Mostly news photographers 
journalists 


and _photo- 


Random sample of the American Medical 
Association 
Representative sample of those boarded in 
designated specialty 
Representative sample of those boarded in 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 
Representative sample 
designated specialty 


of those boarded in 
of those boarded in 
of those boarded in 
of those boarded in 
of those boarded in 
of those boarded in 


of those boarded in 


Strong’s original aviator group 
Police officers from several cities 
University faculty members 
Minnesota priests 


Veterans Administration clinical psycholo- 


Ninety percent of APA membership 
Experimental psychologists, subset of the 
total 1949 group 


Sample N ted Investigator 
Minister (’65) 96 | 1965 | CIMR 
Mpls. Symphony 53 | 1966 | CIMR 
Musician 450 | 1952 | Strong 
Music Teacher 493 | 1952 | Strong 
NIAL Member 42 | 1966 | CIMR 
Letters 
Navy Officer 645 | 1965 | Stephenson & Abrahams 
Newsmen 283 | 1967 | CIMR 
Office Worker 326 | 1928 | Strong 
Optometrist 405 | 1963 | Acree 
Osteopath 585 | 1939 | Strong 
Personnel Director 147 | 1927 | Strong 
Petroleum Engineer 385 | 1965 | Alford 
Pharmacist 309 | 1947 | Schwebel 
Photographer 253 | 1967 | CIMR 
Physical Therapist 350 | 1957 | Strong 
Physician (727) 175 | 1927 | Strong 
Physician (49) 534 | 1949 | Strong & Tucker 
Internist 209 | 1949 | Strong & Tucker 
Neurological Surgeon 47 | 1949 | Strong & Tucker 
Orthopedic Surgeon 71 | 1949 | Strong & Tucker 
Pathologist 154 | 1949 | Strong & Tucker 
Pediatrician 96 | 1949 | Strong & Tucker 
Physiatrist 460 | 1966 | Athelstan 
Psychiatrist 404 | 1949 | Strong & Tucker 
Radiologist 111 | 1949 | Strong & Tucker 
Surgeon 188 | 1949 | Strong & Tucker 
Urologist 84 | 1949 | Strong & Tucker 
Physicist (’27) 192 | 1929 | Strong 
Physicist (’67) 64 | 1967 | CIMR 
Pilot 510 | 1941 | Strong 
Policemen 254 | 1933 | Strong 
Political Scientist 177 | 1967 | Rossmann & Lips 
Priest 234 | 1966 | Lepak 
Printer 270 | 1936 | Strong 
Production Manager 216 | 1935 | Strong 
Psychologist (’31) 174 | 1931 | Strong 
Psychologist (’47) 149 | 1947 | Kelly 
gists 
Psychologist (’49) 1045 | 1949 | Kriedt 
Psychologist (Exp) 256 | 1949 | Kriedt 
Public Admin 550 | 1941 | Strong 


Public administrators from a variety of pub- 
lic agencies 
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TABLE 34—(Continued) 


Sample N bie Investigator 
Pulitzer Prize 84 | 1965 | CIMR 
Purch Agent 219 | 1931 | Strong 
Rehab Counselor 272 | 1949 | Acree 
Salesmen, Auto 116 | 1929 | Strong 
Salesmen, Computer 190 | 1964 | Stein 
Salesmen, Encyclopedia 49 | 1966 | CIMR 
Salesmen, Life (’31) 310 | 1931 | Strong 
Salesmen, Life (’66) 76 | 1966 | CIMR 
Salesmen, 3M 100 | c1960 | Kirchner 
Salesmen, 3M applicants] 100 | c1960 | Kirchner 
Salesmen, PG&E 179 | 1939 | Strong 
Salesmen, Real Estate 243 | 1932 | Strong 
Salesmen, Steel 61 | 1966 | Berdie 
Sales Manager 228 | 1932 | Strong 
School Super (730) 190 | 1930 | Strong 
School Super (’65) 153 | 1965 | CIMR 
Soc Sci Teacher 217 | 1936 | Strong 
Social Worker (’53) 400 | 1953 | McCornack 
Social Worker (’67) 54 | 1967 | CIMR 
Sociologist 198 | 1965 | Rossmann & Lips 
Student Personnel 192 | 1961 | Clark 
Unitarian Minister 113 | 1950 | Strong 
Vet (49) 310 | 1949 | Hannum 
Vet (66) 478 | 1966 | Hannum & Alsip 
YMCA PD 216 | 1927 | Strong 
YMCA Sec 113 | 1927 | Strong 
YMCA Staff 184 | 1961 | Seashore 








Brief description 


Pulitzer Prize winners, mostly journalists and 


novelists 

Purchasing agents 

Vocational rehabilitation counselors from the 
Veterans Administration 

Automobile salesmen 

Control Data Corporation computer sales- 
men 

Outstanding World Book encyclopedia sales- 
men 


$2,000,000 life insurance salesmen 

Salesmen on staff of 3M 

Men applying for sales positions at 3M 
company 

Appliance salesmen, Pacific Gas & Electric 
company 


Structural steel salesmen 


School superintendents 
School superintendents 
High school social science teachers 


University faculty members 
Members, American College Personnel Asso- 
ciation 


Iowa veterinarians 
Towa veterinarians 
YMCA phyiscal director 


YMCA staff (comparable to combination of 
original physical directors and secretaries) 





a measure of interests in Religious Activities 
be studied? 

The establishment of validity for such mea- 
sures, it is the authors’ opinion, depends on 
the collection and organization of a vast 
amount of data covering several aspects of the 
inventory. All psychological tests and inven- 
tories have some potential validity—for 
persons do not give random answers—but until 
the relationships between the test behavior 
and an individual’s actions are thoroughly 
studied, using many diverse approaches, the 
value of the test remains obscure. 

With this orientation, a substantial amount 
of information is presented here, showing that 
responses to these SVIB scales are related to 
the occupation that the individual chooses. 
The concept of occupational membership was 


used here because it has proven valuable in 
earlier work with the regular SVIB scales, 
because it is objective, because—as E. K. 
Strong frequently commented—hardly anyone 
finds fault with it, and because it is closely re- 
lated to the intended purpose of the SVIB, 
that is, as an aid in vocational counseling. 
The information that follows has been 
organized in three traditional areas: content, 
concurrent, and predictive validity. 


Content Validity 


Content validity refers to the actual content 
of the scales. For these basic scales, a certain 
amount of content validity is assured as each 
item has survived both a statistical and a 
“common sense” screening to determine that, 
within each scale, the items are related. Con- 
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TABLE 35 
MALE OccuPATIONS MEAN SCORES ON THE ADVENTURE SCALE 
M stand- s 1 
ard score ae 
70 
69 
68 
67 
66 Astronaut 
65 
64 
63 
62 
61 
60 Salesmen, 3M applicants 
59 Salesmen, 3M/ Policemen/ Salesmen, Computer 
58 Air Force Officer/ Navy Officer 
mil Salesmen, Steel/ Pilot 
56 McKnight Fellow/ YMCA Staff/ Petroleum Engineer 
55 Photographer/ Elem Teacher/ Comm Rec Director/ Computer Programmer 
54 CPA 65/ Army Officer/ Legislator/ Newsmen/ Salesmen, Life 66/ Social Worker 67 
53 Cham Comm Exec/ Salesmen, Auto/ Forest Service Men/ Physical Therapist 
52 Vet 66/ YMCA PD/ Optometrist/ Minister 65/ Orthopedic Surgeon 
51 Salesmen, Encyclopedia/ Generals and Admirals/ Danforth Fellow/ CPA 44/ Corp Pres 65/ Neuro- 
logical Surgeon 
50 Priest/ Sociologist/ Credit Manager/ Salesmen, PG&E/ Printer/ Carpenter/ School Super 65 
49 Radiologist/ Urologist/ Farmer 36/ Surgeon/ Mpls Symphony/ Osteopath/ Math-Science Teacher/ 
Student Personnel 
48 Psychologist 47/ Salesmen, Real Estate/ Public Admin/ Pediatrician/ Chemist/ Biologist/ Banker 
64/ Physiatrist/ Bus Ed Teacher/ Farmer 67/ Musician/ Physicist 67/ Anthropologist 
47 Accountant/ Social Worker 53/ Psychologist 31/ Office Worker/ Unitarian Minister/ Economist/ 
Production Manager/ Psychologist (Exp)/ Personnel Director/ Funeral Director/ Physician 49/ 
Psychiatrist/ Animal Husbandry Prof/ Artist/ Vet 49 
46 Pharmacist/ Soc Sci Teacher/ Physician 27/ Engineer/ Purch Agent/ Political Scientist/ Sales Man- 
ager/ Librarian/ Dentist/ Lawyer 49/ Lawyer 27/ Pathologist/ Authors/ Interior Decorator/ 
Governor/ Advertising Men 
45 Rehab Counselor/ Architect/ YMCA Sec/ Salesmen, Life 31/ Guidance Counselor/ Pulitzer Prize/ 
Minister 27/ Psychologist 49/ Music Teacher/ Internist 
44 Judge 
43 Banker 34/ Corp Pres 35/ Physicist 27 
42 
41 School Super 30 
40 Mathematician/ NIAL Member 
39 
38 
aii 
36 
35 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


sequently one has some confidence that when 
a person answers, for example, Like or Dislike 
to the items, Algebra, Arithmetic, Calculus, 
Geometry, Mathematics, and Physics, he is 
telling something about his interests in 
mathematics. 

However, although content validity is quite 
helpful in scale interpretation, by itself it 


reveals nothing about the relationship between 
an individual’s interests and his actual be- 
havior. For such information, one needs to 
turn to concurrent and predictive validity. 


Concurrent Validity 


Concurrent validity refers to the power of 
the test to distinguish between specified groups 
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TABLE 36 


MALE OccuPATIONS MEAN SCORES ON THE AGRICULTURE SCALE 











M stand- 
ard score 


70 
69 
68 
67 
66 
65 
64 
63 
62 
61 
60 
59 
58 
SM 
56 
55 
54 
53 
Sy 
51 
50 
49 


48 


47 


46 


45 


44 


Sample 











Farmer 67 


Farmer 36/ Vet 49 
Vet 66 
Forest Service Men/ Animal Husbandry Prof 


Pilot 

Army Officer/ Generals and Admirals 

Orthopedic Surgeon 

Salesmen, Encyclopedia/ Policemen/ Comm Rec Director/ Physical Therapist/ Astronaut/ Governor 
YMCA PD/ Salesmen, PG&E/ Biolcgist/ Urologist/ Legislator/ Osteopath 

Public Admin/ Radiologist/ Carpenter/ Surgeon/ Salesmen, Life 66/ Petroleum Engineer/ Neuro- 
logical Surgeon 

Salesmen, Real Estate/ YMCA Sec/ Salesmen, Steel/ Salesmen, Life 31/ YMCA Staff/ Dentist/ 
Banker 34/ Minister 27/ Salesmen, Computer/ Funeral Director/ Air Force Officer/ Navy Officer/ 
Physiatrist 

Salesmen, 3M applicants/ Salesmen, 3M/ Soc Sci Teacher/ Pediatrician/ Physician 27/ Credit Man- 
ager/ Photographer/ Elem Teacher/ Sales Manager/ CPA 44/ Banker 64/ Corp Pres 65/ Physician 
49/ Artist 

Chemist/ Engineer/ School Super 30/ Unitarian Minister/ Production Manager/ Personnel Director/ 
Salesmen, Auto/ Corp Pres 35/ Bus Ed Teacher/ Pathologist/ Math-Science Teacher 

Rehab Counselor/ Priest/ Cham Comm Exec/ Architect/ Social Worker 53/ Office Worker/ Printer/ 
Guidance Counselor/ McKnight Fellow/ Optometrist/ Computer Programmer/ Lawyer 49/ School 
Super 65/ Psychiatrist/ Social Worker 67/ Music Teacher/ Musician/ Internist/ Anthropologist 
Pharmacist/ Accountant/ Psychologist (Exp)/ Purch Agent/ Lawyer 27/ Newsmen/ Minister 65/ 
Advertising Men/ Physicist 27/ Psychologist 49/ Authors 

Psychologist 31/ Mpls Symphony/ Danforth Fellow/ Physicist 67/ Interior Decorator/ NIAL Mem- 
ber/ Student Personnel 

Psychologist 47/ Sociologist/ CPA 65/ Librarian/ Pulitzer Prize/ Judge/ Mathematician 

Economist 

Political Scientist 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


at the current moment. For example, can these 
scales discriminate between the interests of 
men who are currently salesmen or scientists? 
This is in contrast to predictive validity where 
the issue is whether the scales can identify dif- 
ferences between boys who will become sci- 
entists or salesmen. 


The relevant data for concurrent validity 
are presented in Tables 34 through 56. For 
each of the basic scales, the means of 113 occu- 
pational samples are rank ordered, thus per- 
mitting at a glance an evaluation of the power 
of the scale to separate occupations. 

These occupational samples represent the 
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TABLE 37 


MALE OccuPpATIONS MEAN SCORES ON THE ART SCALE 











M stand- : 
; Sample 
ard score 
70 
69 
68 
67 
66 
65 Interior Decorator 
64 Architect/ McKnight Fellow 
63 Artist 
62 
61 
60 Photographer 
59 Mpls Symphony/ NIAL Member 
58 Pulitzer Prize 
57 Unitarian Minister/ Librarian/ Danforth Fellow/ Musician 
56 Advertising Men/ Physiatrist/ Music Teacher/ Anthropologist/ Neurological Surgeon 
3 Psychologist 47/ Psychologist 31/ Elem Teacher/ Minister 27/ Minister 65/ Orthopedic Surgeon/ 
Authors 
54 Priest/ Sociologist/ Social Worker 53/ Pediatrician/ Biologist/ Economist/ Psychologist (Exp)/ 
Dentist/ Newsmen/ Psychiatrist 
53 Salesmen, 3M/ YMCA Sec/ Urologist/ Personnel Director/ Political Scientist/ Judge/ Physical 
Therapist/ Psychologist/ 49/ Social Worker 67/ Physicist 67/ Pathologist 
52 Salesmen, 3M applicants/ YMCA PD/ Radiologist/ Physician 27/ Salesmen, PG&E/ Carpenter/ 
Surgeon/ Salesmen, Encyclopedia/ Comm Rec Director/ YMCA Staff/ Computer Programmer/ 
Generals and Admirals/ Physicist 27/ Physician 49/ Internist/ Student Personnel 
51 Rehab Counselor/ Cham Comm Exec/ Public Admin/ Office Worker/ Engineer/ Printer/ Corp Pres 
35/ Optometrist/ Bus Ed Teacher/ Osteopath/ Astronaut/ Mathematician 
50 Accountant/ Soc Sci Teacher/ Chemist/ Credit Manager/ Salesmen, Life 31/ School Super 30/ 
Guidance Counselor/ Funeral Direc/ Policemen/ Army Officer/ Legislator/ School Super 65/ Salesmen, 
Life 66/ Math-Science Teacher 
49 Salesmen, Real Estate/ Salesmen, Steel/ Air Force Officer/ Production Managers/ Salesmen, Auto/ 
Forest Service Men/ Pilot/ Sales Manager/ Salesmen, Computer/ APA 44/ Navy Officer/ Lawyer 27/ 
Governor 
48 Pharmacist/ Farmer 36/ Purch Agent/ Lawyer 49/ Animal Husbandry Prof/ Petroleum Engineer 
47 Vet 66/ CPA 65/ Banker 34/ Banker 64/ Corp Pres 65 
46 Vet 49 
45 Farmer 67 
44 
43 
42 
41 
40 
39 
38 
37 
36 
35 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


majority of adult men who have been tested 
for research purposes with the SVIB since it 
was first published in 1927. Further informa- 
tion on each sample is listed in Table 57. 
Most of these occupational samples include 
men with at least 3 yr. of experience who say 


they like their work. More extensive informa- 
tion on many of the samples can be found in 
the SVIB manual (Campbell, 1966). 

One large block of these samples came from 
Strong’s files, including those used to build the 
occupational scales, another block came from 
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TABLE 38 


MALE OccuPATIONS MEAN SCORES ON THE BUSINESS MANAGEMENT SCALE 











M stand- 
ard score Sample 

70 

69 

68 

67 

66 

65 

64 

63 

62 Salesmen, 3M applicant 

61 Salesmen, 3M/ Credit Manager 

60 Cham Comm Exec/ Sales, Encyclopedia/ Bus Ed Teacher 

59 YMCA Sec/ Personnel Director/ Salesmen, Computer 

58 Salesmen, Steel/ Comm Rec Director 

57 Rehab Counselor/ Accountant/ Office Worker/ Salesmen, PB&E/ Sales Manager/ YMCA Staff/ 
School Super 65 

56 Guidance Counselor/ Purch Agent/ Legislator 

55 Public Admin/ Corp Pres 35/ Funeral Director/ CPA 65/ Production Manager/ Army Officer/ 
CPA 44 

54 Soc Sci Teacher/ Social Worker 53/ Salesmen, Life 31/ Air Force Officer/ Salesmen, Auto/ Navy 
Officer/ Banker 64/ Governor 

53 Salesmen, Real Estate/ School Super 30/ Banker 34/ Generals and Admirals/ Corp Pres 65/ Salesmen, 
Life 66 

52 YMCA PD/ Petroleum Engineer/ Student Personnel 

51 Pharmacist/ Elem Teacher/ Forest Service Men/ Social Worker 67/ Music Teacher 

50 Policemen/ Optometrist/ Lawyer 49/ Minister 65/ Math-Science Teacher/ Advertising Men 

49 Psychologist 47/ Priest/ Librarian/ Computer Programmer/ Minister 27/ Physical Therapist 

48 Vet 66/ Engineer/ Printer/ Unitarian Minister/ Carpenter/ Farmer 36/ Pilot/ Interior Decorator 

47 Economist/ Physiatrist/ Psychologist 49/ Orthopedic Surgeon/ Vet 49 

46 Pediatrician/ Chemist/ Urologist/ Lawyer 27/ Newsmen/ Psychiatrist/ Farmer 67/ Osteopath/ 
Astronaut 

45 Sociologist/ Radiologist/ Judge/ Animal Husbandry Prof/ Musician 

44 Psychologist 31/ Surgeon/ Political Scientist/ Dentist/ Neurological Surgeon 

43 Architect/ Psychologist (Exp)/ Physician 49/ Internist 

42 Biologist/ Danforth Fellow 

41 Physician 27/ Photographer/ Mpls Symphony/ Pathologist 

40 McKnight Fellow/ Physicist 27/ Physicist 67/ Mathematician/ Authors 

So Anthropologist 

38 Pulitzer Prize 

37 

36 Artist/ NIAL Member 

35 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


The mean scores in Tables 35 through 56 are 


the work of the present authors’ research in- 
stitute at Minnesota, and the remainder came 
from various other investigators working with 
the SVIB. Practically everyone in the latter 
group who was asked for his data was co- 
operative—if the information had not been 
lost or discarded—and the authors would like 
to publicly acknowledge gratitude for their 
help. 


reassuring. Occupations which should score 
high did so; military officers scored highest on 
the Military Activities scale, ministers scored 
highest on the Religious Activities scale, sci- 
entists scored highest on the Science scale and 
so on—there were virtually no surprises. In 
fact, the results are so straightforward as to 
lead one to that conclusion voiced frequently 
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TABLE 39 
MALE OccuPATIONS MEAN SCORES ON THE LAW/PoLtTIcs SCALE 
M stand- 
Sample 
ard score 
70 
69 
68 
67 
66 
65 
64 
63 
62 Judge/ Legislator 
61 Salesmen, 3M applicants/ Salesmen, 3M/ Political Scientist/ Salesmen, Computer/ Governor 
60 Cham Comm Exec 
59 Laywer 49 
58 Social Worker 53/ Credit Manager/ CPA 65/ Economist/ Navy Officer/ Lawyer 27/ Newsmen/ 
Salesmen, Life 66/ Student Personnel 
37 Rehab Counselor/ Unitarian Minister/ Air Force Officer/ Salesmen, Encyclopedia/ Comm Rec Di- 
rector/ Army Officer/ School Super 65/ Social Worker 67 
56 Priest/ Sociologist/ Salesmen, Steel/ Guidance Counselor/ Personnel Director/ YMCA Staff/ Gen- 
erals and Admirals/ Bus Ed Teacher/ Danforth Fellow 
55 Public Admin/ Social Sci Teacher/ School Super 30/ Elem Teacher/ Minister 65/ Astronauts 
54 Psychologist 47/ Policemen/ Optometrist/ Computer Programmer/ Corp Pres 65/ Petroleum Engi- 
neer 
53 YMCA Sec/ Salesmen, PG&E/ McKnight Fellow/ CPA 44/ Physiatrist/ Physical Therapist/ Psy- 
chiatrist/ Psychologist 49 
52 Accountant/ Pediatrician/ Office Worker/ Salesmen, Life 31/ Salesmen, Auto/ Librarian/ Banker 34/ 
Minister 27/ Orthopedic Surgeon/ Anthropologist/ Neurological Surgeon 
51 Vet 66/ Salesmen, Real Estate/ Psychologist 31/ Radiologist/ Urologist/ Psychologist (Exp)/ Pulitzer 
Prize/ Banker 64 
50 Printer/ Production Manager/ Purch Agent/ Pilot/ Sales Manager 
49 Pharmacist/ YMCA PD/ Corp Pres 35/ Biologist/ Photographer/ Surgeon/ Forest Service Men/ 
Animal Husbandry Professor/ Music Teacher/ Osteopath/ Internist/ Math-Science Teacher 
48 Funeral Director/ Mpls Symphony/ Advertising Men/ Physician 49/ Musician/ Physicist 67/ Authors 
47 Engineer/ Farmer 36/ Dentist/ Pathologist/ Vet 49 
46 Physician 27/ Chemist/ Carpenter/ Mathematician/ Interior Decorator/ NIAL Member 
45 Architect/ Farmer 67 
44 Physicist 27 
43 
42 Artist 
41 
40 
39 
38 
Sil 
36 
35 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate vear. 


by 


students 


in introductory psychology studying these tables of means, both to learn 


courses: ‘‘You had to do all that research just 
to find that out?” 

Clearly, scores on these scales are related, in 
a highly significant fashion, to the occupations 
of adult men. Anyone who would understand 
these scales thoroughly must spend some time 


more about the relative levels of scores on each 
scale, and to learn more about each occupation. 

These rank-ordered means provide a firmer 
foundation for ascertaining the importance of 
scores at various levels. Scanning these tables 
suggests that scores above 58 or 59 are high 
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TABLE 40 


MALE OccuPATIONS MEAN SCORES ON THE MATHEMATICS SCALE 














M stand- Sample 
ard score 

70 

69 

68 

67 

66 

65 

64 

63 

62 

61 Astronauts 

60 Physicist 27/ Physicist 67/ Mathematician 

59 

58 Air Force Officer/ Computer Programmer/ Salesmen, Computer/ Petroleum Engineer 

oT, Engineer/ Army Officer 

56 Chemist/ CPA 65/ Economist/ Generals and Admirals/ Corp Pres 65 

55 Psychologist (Exp)/ Optometrist/ CPA 44/ Navy Officer/ Animal Husbandry Prof 

54 Psychologist 31/ Danforth Fellow/ Physiatrist/ Math-Science Teacher 

53 Accountant/ Pediatrician/ School Super 30/ Biologist/ Urologist/ Production Manager/ Elem 
Teacher/ Pilot/ School Super 65/ Psychologist 49/ Pathologist/ Orthopedic Surgeon/ Neurological 
Surgeon 

2 Architect/ Radiologist/ Credit Manager/ Salesmen, Steel/ Corp Pres 35/ Forest Service Men/ Banker 
64/ Psychiatrist 

OL Salesmen, 3M/ Public Admin/ Office Workers/ Personnel Director/ Surgeon/ Sales Manager/ Lawyer 
49/ Physician 49/ Internist 

50 Psychologist 47/ Vet 66/ Pharmacist/ Sociologist/ Salesmen, PG&E/ Guidance Counselor/ Carpen- 
ter/ Farmer 36/ Purch Agent/ Judge/ Legislator/ Physical Therapist/ Bus Ed Teacher/ Student Per- 
sonnel 

49 Rehab Counselor/ Physician 27/ Salesmen, Auto/ Banker 34/ Lawyer 27/ NIAL Member/ Governor 

48 Salesmen, 3M applicant/ YMCA PD/ Policemen/ Political Scientist/ Mpls Symphony/ Comm Rec 
Director/ Dentist/ Minister 27/ Osteopath/ Anthropologist/ Vet 49 

47 Cham Comm Exec/ YMCA Sec/ Salesmen, Life 31/ Printer/ Unitarian Minister/ Funeral Director/ 
Salesmen, Encyclopedia 

46 Priest/ Salesmen, Real Estate/ Photographer/ McKnight Fellow/ Librarian/ YMCA Staff/ Minister 
65/ Salesmen, Life 66/ Music Teacher/ Musician 

45 Soc Sci Teacher/ Social Worker 53/ Social Worker 67/ Farmer 67/ Advertising Men 

44 

43 Newsmen 

42 Pulitzer Prize/ Artist/ Interior Decorator 

41 

40 Authors 

39 

38 

37 

36 

35 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


enough to be important from an occupational 
standpoint; scores below 42 or 43 indicate im- 
portant areas of rejection, at least among 
adults. Thus, the present authors’ recommen- 
dation is that scores above 58 be considered 
high, those below 42, low. Some sharpening of 


this interpretation will obviously be necessary, 
especially with teenagers and especially with 
the Adventure scale where one knows high 
scores will likely decrease over time, but these 
figures can be used as initial landmarks. 
Though these data are positive and meaning- 
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TABLE 41 


MALeE OccuPATIONS MEAN SCORES ON THE MECHANICAL SCALE 

















M stand- Sample 
ard score 

70 

69 

68 

67 

66 

65 

64 

63 

62 

61 

60 

59 

58 

57 

56 Air Force Officer/ Carpenter/ Orthopedic Surgeon 

55 

54 Radiologist/ Pilot/ Army Officer/ Farmer 67/ Neurological Surgeon 

53 Engineer/ Urologist/ Physicist 27/ Petroleum Engineer 

Dy Psychologist (Exp)/ Computer Programmer/ Physical Therapist/ Astronaut/ Math-Science Teacher 

oii Chemist/ Printer/ Production Manager/ Surgeon/ Forest Service Men/ Policemen/ Navy Officer/ 
Physiatrist/ Physicist 67 

50 Pediatrician/ Salesmen, PG&E/ Farmer 36/ Optometrist/ Salesmen, Computer/ Generals and 
Admirals/ Pathologist 

49 Biologist/ Dentist/ CPA 44/ Psychiatrist/ Osteopath 

48 Architect/ YMCA PD/ Psychologist 31/ Credit Manager/ Personnel Director/ Elem Teacher/ 
Physician 49/ Psychologist 49 

47 Vet 66/ Public Admin/ Accountant/ Salesmen, Auto/ Purch Agent/ Comm Rec Director/ Minister 
27/ Banker 64/ Corp Pres 65/ Animal Husbandry Prof/ Bus Ed Teacher/ Internist/ Mathematician/ 
Vet 49 

46 Salesmen, 3M applicants/ Salesmen, 3M/ YMCA Sec/ Physician 27/ Salesmen, Steel/ Office Worker/ 
Unitarian Minister/ Guidance Counselor/ Corp Pres 35/ Mpls Symphony/ YMCA Staff/ Musician/ 

45 Psychologist 47/ Rehab Counselor/ Pharmacist/ Social Worker 53/ CPA 65/ Photographer/ Social 
Worker 67 

44 Priest/ Cham Comm Exec/ Soc Sci Teacher/ School Super 30/ Funeral Director/ Sales Manager/ 
Legislator/ Banker 34/ Minister 65/ School Super 65/ Music Teacher/ Artist 

43 Salesmen, Real Estate/ Sociologist/ Economist/ Librarian/ Anthropologist / Student Personnel 

42 Danforth Fellow 

41 Salesmen, Encyclopedia/ McKnight Fellow/ Lawyer 49/ Advertising Men/ Governor 

40 Salesmen, Life 31/ Lawyer 27/ Interior Decorator 

39 Judge/ Newsmen 

38 Political Scientist/ Pulitzer Prize/ Salesmen, Life 66/ Authors/ NIAL Member 

37 

36 

35 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


ful, one disappointing aspect—at least for the 
senior author—was the magnitude of differ- 
ences between extreme groups. On most of the 
scales, the range from the highest mean to the 
lowest was about 20 standard score points, or 
2 SDs. While that is a substantial separation, 
especially when viewed against results from 


other tests (if reported in percentiles, that 
would be the difference between the fifteenth 
and eighty-fifth percentiles), still it is less than 
the 3 or 4 SDs found between extreme scores 
on the regular SVIB occupational] scales. To 
achieve purity of content, and thus easier 
interpretation, empirical validity has suffered. 
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TABLE 42 


MAtLeE Occupations MEAN SCORES ON THE MEDICAL SERVICE SCALE 











M stand- 














ard score Sample 

70 

69 

68 

67 

66 

65 

64 Dentist/ Physical Therapist 

63 Vet 66/ Urologist/ Physiatrist/ Osteopath/ Vet 49 

62 Pharmacist/ Radiologist/ Pediatrician/ Surgeon/ Physician 49/ Neurological Surgeon 

61 Physician 27 

60 Optometrist/ Psychiatrist/ Pathologist/ Internist/ Orthopedic Surgeon 

59 YMCA PD/ Animal Husbandry Prof 

58 Biologist 

57 Psychologist 47/ Math-Science Teacher 

56 Psychologist (Exp)/ Comm Rec Director/ YMCA Staff 

oS Psychologist 31/ Psychologist 49 

54 Salesmen, 3M applicants/ Rehab Counselor/ Priest/ Social Worker 53/ Chemist/ Elem Teacher/ 
Policemen/ Minister 27/ Social Worker 67/ Astronaut 

53 YMCA Sec/ Unitarian Minister/ Guidance Counselor/ Funeral Director/ Salesmen, Computer/ 
School Super 65/ Student Personnel 

52 Salesmen, 3M/ Soc Sci Teacher/ Credit Manager/ Air Force Officer/ Mpls Symphony/ Army Officer/ 
Computer Programmer/ Legislator/ Danforth Fellow/ Minister 65/ Salesmen, Life 66/ Petroleum Engi- 
neer 

st Cham Comm Exec/ Public Admin/ Salesmen, PG&E/ Forest Service Men/ Pilot/ Navy Officer/ 
Musician/ Governor 

50 Sociologist/ Salesmen Steel/ Office Worker/ School Super 30/ Personnel Director/ Farmer 36/ Sales- 
men, Auto/ Librarian/ CPA 44/ Music Teacher/ Bus Ed Teacher/ Anthropologist 

49 Salesmen, Life 31/ Printer/ CPA 65/ Production Manager/ Photographer/ McKnight Fellow/ Gen- 
erals and Admirals/ Physicist 67 

48 Accountant/ Economist/ Carpenter/ Salesmen, Encyclopedia/ Sales Manager/ Pulitzer Prize/ 
Judge/ Lawyer 49/ Newsmen/ Banker 64/ Physicist 27/ NIAL Member 

47 Salesmen, Real Estate/ Engineer/ Corp Pres 35/ Purch Agent/ Banker 34/ Lawyer 27/ Corp Pres 65/ 
Artist 

46 Architect/ Political Scientist/ Advertising Men/ Mathematician/ Author/ Interior Decorator 

45 Farmer 67 

44 

43 

42 

41 

40 

39 

38 

37 

36 

35 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


Predictive Validity 


The predictive validity of these scales can be 
demonstrated by reanalyzing some SVIBs 
collected by Berdie (1960) and Schletzer 
(1963). Berdie identified students who gradu- 
ated from the University of Minnesota in cur- 


ricula that are highly predictive of eventual 
occupations, specifically, accounting, dentistry, 
journalism (which is less predictive than the 
others), law, mechanical engineering, and 
medicine. Each of these students had com- 
pleted the SVIB as high school seniors, and 


36 CAMPBELL, BoRGEN, EASTES, JOHANNSON, AND PETERSON 


TABLE 43 


Mate OccuPpaTIONS MEAN SCORES ON THE MERCHANDISING SCALE 








M stand- 
ard score 


Sample 





70 
69 
68 


62 Salesmen, 3M 
61 Salesmen, 3M applicants/ Bus Ed Teacher 


59 Credit Manager 


58 Cham Comm Exec/ Salesmen, PG&E/ Salesmen, Encyclopedia 
57 Salesmen, Steel/ Office Worker/ Salesmen, Computer 
56 Pharmacist/ YMCA Sec/ Funeral Director/ Salesmen, Auto/ Purch Agent 


55 Comm Rec Director 


54 Rehab Counselor/ Accountant/ Soc Sci Teacher/ Guidance Counselor/ CPA 65/ Sales Manager/ 


YMCA Staff/ Interior Decorator 


53 Salesmen, Real Estate/ Air Force Officer/ Legislator/ School Super 65 
52 Salesmen, Life 31/ Personnel Director/ Elem Teacher/ Army Officer/ CPA 44/ Navy Officer/ Banker 





64/ Salesmen, Life 66 


51 YMCA PD/ Corp Pres 35/ Optometrist/ Banker 34/ Petroleum Engineer/ Math-Science Teacher 
50 Vet 66/ Public Admin/ Social Worker 53/ Advertising Men 
49 Production Manager/ Librarian/ Computer Programmer/ Generals and Admirals/ Physical Therapist/ 


Music Teacher 


48 Priest/ School Super 30/ Printer/ Urologist/ Policemen/ Lawyer 49/ Minister 65/ Corp Pres 65/ 


Physiatrist/ Student Personnel/ Governor 


47 Radiologist/ Carpenter/ Farmer 36/ Pilot /Minister 27/ Animal Husbandry Prof/ Social Worker 67/ 





Farmer 67/ Osteopath/ Orthopedic Surgeon/ Astronaut/ Vet 49 


46 Psychologist 47/ Pediatrician/ Unitarian Minister/ Forest Service Men/ Musician 
45 Sociologist/ Chemist/ Economist/ Dentist/ Psychiatrist/ Psychologist 49/ Neurological Surgeon 
44 Judge/ Psychologist 31/ Engineer/ Surgeon/ Mpls Symphony/ Lawyer 27/ Newsmen/ Physician 49/ 


Internist 
43 Architect/ Biologist 


42 Physician 27/ Psychologist (Exp)/ McKnight Fellow/ Political Scientist 


41 Photographer/ Pathologist/ Mathematician 


40 Danforth Fellow/ Physicist 67 

39 Physicist 27/ Authors/ Anthropologist 
38 Pulitzer P-ize/ Artist 

Sih 

36 NIAL Member 

35 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


Berdie’s report indicates that there were sub- 
stantial relationships between their high school 
SVIB profile and the curriculum they selected. 
Schletzer, for her doctoral dissertation, lo- 
cated and retested these same students approxi- 
mately 4 yr. after they graduated from the 
University of Minnesota, roughly 8-10 yr. after 
the initial testing. Although these groups were 
small, approximately 30 in each of the six 


curricula, the results are very important as no 
other occupational samples have been tested 
before college and retested several years later 
after entry into the occupation. 

The relevant data are presented in Tables 57 
and 58. In Table 57 are listed, for the total 
group, test and retest means, standard devia- 
tions, and test-retest correlations. For each of 
the occupational subgroups, the test and retest 


M stand- 
ard score 


70 
69 
68 
67 
66 
65 
64 
63 
62 
61 
60 
59 
58 
57 
56 
55 
54 


53 


Se 


51 


50 
49 


48 
47 


46 


45 
44 
43 
42 
41 
40 
39 
38 
37 
36 
So 
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TABLE 44 


MAtE OccuPaATIONsS MEAN SCORES ON THE MiILirary SCALE 








Sample 





Army Officer/ Generals and Admirals 


Policemen 
Air Force Officer 


YMCA PD/ Pilot 


Navy Officer/ Astronaut/ Governor 

Comm Rec Director 

Salesmen, 3M applicants/ Rehab Counselor/ Public Admin/ Credit Manager/ Office Worker/ Sales- 
men, PG&E/ YMCA Staff/ Legislator/ Salesmen, Computer/ Physical Therapist 

Salesmen, 3M/ Vet 66/ Cham Comm Exec/ Engineer/ Personnel Director/ Salesmen, Auto/ Optome- 
trist/ CPA 44/ Farmer 67/ Osteopath/ Vet 49 

Accountant/ Soc Sci Teacher/ Salesmen, Steel/ Production Manager/ Farmer 36/ Purch Agent/ 
Funeral Director/ Lawyer 27/ School Super 65/ Banker 64/ Bus Ed Teacher/ Petroleum Engineer/ 
Math-Science Teacher 

Salesmen, Real Estate/ Pharmacist/ Salesmen, Life 31/ Guidance Counselor/ Carpenter/ Salesmen, 
Encyclopedia/ Elem Teacher/ Forest Service Men/ Sales Manager/ Dentist/ Animal Husbandry 
Prof/ Orthopedic Surgeon/ Advertising Men 

Architect/ YMCA Sec/ Physician 27/ School Super 30/ Printer/ CPA 65/ Urologist/ Banker 34/ 
Lawyer 49/ Physician 49/ Physiatrist/ Salesmen, Life 66/ Music Teacher/ Student Personnel 

Judge/ Priest/ Social Worker 53/ Chemist/ Surgeon/ Computer Programmer/ Corp Pres 35/ News- 
men/ Psychiatrist/ Social Worker 67/ Neurological Surgeon 

Psychologist 31/ Radiologist/ Biologist/ Psychologist (Exp) / Minister 27/ Pathologist/ Authors 
Psychologist 47/ Librarian/ Minister 65/ Physicist 27/ Psychologist49/ Musician/ Internist/ Interior 
Decorator 

Pediatrician/ Unitarian Minister/ Economist/ Photographer/ Political Scientist/ Pulitzer Prize/ 
Artist/ Mathematician 

Sociologist/ Corp Pres 65/ Physicist 67/ Anthropologist 

Mpls Symphony/ Danforth Fellow/ NIAL Member 


McKnight Fellow 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


means are presented, and the mean changes Table 57, Table 58 was prepared. In this table 
larger than 4 points are identified. Four points, are listed for each subgroup: 
which is four-tenths of a standard deviation, ; ’ 
is probably the smallest practical difference 1. Their three highest scores as high school 
worth attending to among these small samples. seniors. 

As an aid to interpretation of the data in 2. Their three highest scores on retest, when 
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TABLE 45 


MALE OccuPpATIONS MEAN SCORES ON THE Music SCALE 











M stand- 
ard score 


70 
69 
68 
67 
66 
65 
64 
63 
62 
61 
60 
59 
58 
Sy 
56 
oS 


54 
53 


52 


51 


Sample 


Music Teacher 


McKnight Fellow/ Musician/ Mpls Symphony 


Danforth Fellow 
NIAL Member 


Librarian/ Minister 65 

Unitarian Minister/ Minister 27/ Physiatrist 

Psychologist 47/ Pediatrician / Economist/ Photographer/ Political Scientist/ Pulitzer Prize 

Priest/ Sociologist/ Social Worker 53/ Elem Teacher/ Newsmen/ Psychiatrist/ Interior Decorator/ 
Anthropologist 

YMCA Sec/ Psychologist 31/ Biologist/ Social Worker 67/ Artist/ Pathologist/ Internist 

Radiologist/ Psychologist (Exp)/ Computer Programmer/ Physical Therapist/ Psychologist 49/ 
Physicist 67/ Orthopedic Surgeon/ Neurological Surgeon/ Student Personnel 

Salesmen, 3M applicants/ Cham Comm Exec/ Architect/ Soc Sci Teacher/ YMCA PD/ Credit Man- 
ager/ Printer/ Urologist/ Surgeon/ Optometrist/ Comm Rec Director/ Salesmen, Computer/ School 
Super 65/ Physician 49/ Bus Ed Teacher/ Mathematician / Math-Science Teacher/ Advertising Men 
Public Admin/ Accountant/ Chemist/ Office Worker/ Guidance Counselor/ Personnel Director/ 
Salesmen, Encyclopedia/ YMCA Staff/ Legislator/ CPA 44/ Physicist 27/ Salesmen, Life 66/ Osteo- 
path 

Rehab Counselor/ Salesmen, 3M/ Pharmacist/ Salesmen, Steel/ Salesmen, PG&E/ School Super 30/ 
CPA 65/ Carpenter/ Salesmen, Auto/ Army Officer/ Generals and Admirals/ Judge/ Navy Officer/ 
Corp Pres 65/ Animal Husbandry Prof/ Astronaut/ Authors 

Physician 27/ Salesmen, Life 31/ Engineer/ Production Manager/ Policemen/ Pilot/ Dentist/ Fu- 
neral Director/ Air Force Officer/ Banker 64 

Vet 66/ Farmer 36/ Purch Agent/ Forest Service Men/ Sales Manager/ Banker 34/ Lawyer 49/ 
Corp Pres 35/ Lawyer 27/ Petroleum Engineer/ Vet 49 

Salesmen, Real Estate/ Governor 


Farmer 67 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


they were roughly 3-4 yr. beyond their final Although, again, there is no one index to de- 


degree. 


: termine the predictive efficiency of these scales, 


3. The three largest gains for each sample, the results are reassuringly meaningful. The 
which frequently were not among the highest — scales that each group scored highest on, both 
scores on either testing. at test and retest, are, with a few exceptions, 
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TABLE 46 


MALE OccuPATIONS MEAN SCORES ON THE NATURE SCALE 








M stand- Sample 
ard score 

70 

69 

68 

67 

66 

65 

64 

63 

62 

61 

60 

59 Farmer 36/ Animal Husbandry Prof 

58 Forest Service Men 

Oe Vet 49 

56 Vet 66/ Biologist 

55 Orthopedic Surgeon 

$4 Radiologist/ Urologist/ Physiatrist 

53 Pediatrician/ Surgeon/ Minister 27/ Physical Therapist/ Farmer 67/ Osteopath/ Pathologist/ 
Neurological Surgeon 

52 Public Admin/ YMCA Sec/ Dentist/ Physician 49/ Psychiatrist/ Artist/ Internist/ NIAL Nember 

Si YMCA PD/ Physician 27/ Comm Rec Director/ Generals and Admirals/ Math-Science Teacher 

50 Rehab Counselor/ Architect/ Chemist/ School Super 30/ Unitarian Minister/ Guidance Counselor/ 
Mpls Symphony/ School Super 65/ Anthropologist/ Governor 

49 Psychologist 47/ Soc Sci Teacher/ Social Worker 53/ Psychologist 31/ Credit Manager/ Psychologist 
(Exp)/ Personnel Director/ Carpenter/ Elem Teacher/ Policemen/ Librarian/ Pulitzer Prize/ Army 
Officer/ Banker 34/ Legislator/ CPA 44/ Psychologist 49/ Music Teacher/ Interior Decorator 

48 Pharmacist/ Sociologist/ Accountant/ Engineer/ McKnight Fellow/ Pilot/ Optometrist/ Lawyer 49/ 
Corp Pres 35/ Funeral Director/ Minister 65/ Physicist 27/ Bus Ed Teacher/ Musician/ Physicist 67/ 
Astronaut/ Mathematician/ Authors/ Student Personnel 

47 Priest/ Office Worker/ Salesmen, Life 31/ Salesmen, PG&E/ Printer/ Production Manager/ Photog- 
rapher/ Sales, Encyclopedia/ Judge/ YMCA Staff/ Computer Programmer/ Corp Pres 65/ Banker 64/ 
Social Worker 67/ Petroleum Engineer 

46 Salesmen, Real Estate/ Economist/ Sales Manager/ Danforth Fellow/ Navy Officer 

45 Cham Comm Exec/ Salesmen, Steel/ CPA 65/ Purch Agent/ Political Scientist/ Air Force Officer/ 


44 
43 
42 
41 
40 
39 
38 
37 
36 
35 


Lawyer 27/ Newsmen/ Advertising Men 
Salesmen, Life 66 

Salesmen, Auto/ Salesmen, Computer 
Salesmen, 3M 

Salesmen, 3M applicants 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


those most related to their work. The excep- 
tions were almost all due to the Adventure 
scale; this was among the three highest scores 
for five of the samples at test, and remained 
among the highest for three at the retest. This 
personality scale just does not operate as do 
the vocational interests scales. Other than this 


scale, the other high scores were appropriate. 
The accountants scored highest on the busi- 
ness-oriented scales: Office Practices, Sales, 
and Merchandising; the dentists and physi- 
cians scored highest on the Medical Service 
and Science scales, and so forth. 

The patterns of high scores on both the test 
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TABLE 47 
MALE Occupations MEAN SCORES ON THE OFFICE PRACTICES SCALE 
M stand- 
ard score neue 
70 
69 
68 
67 
66 
65 Bus Ed Teacher 
64 
63 
62 
61 Baker 34/ Banker 64 
60 
59 Office Worker 
58 Accountant/ Soc Sci Teacher/ Credit Manager 
57 
56 
55 Rehab Counselor/ Guidance Counselor/ Comm Rec Director/ Math-Science Teacher 
54 Salesmen, 3M/ YMCA Sec/ Salesmen, PG&E/ CPA 65/ CPA 44/ School Super 65 
53 Public Admin/ Purch Agent/ Policemen/ Legislator/ Funeral Director 
52 Salesmen, 3M applicants/ Pharmacist/ Farmer 67/ Governor 
51 Vet 66/ Cham Comm Exec/ YMCA PD/ School Super 30/ Printer/ Carpenter/ Elem Teacher/ 
Salesmen, Auto/ Librarian/ Army Officer/ YMCA Staff/ Minister 27/ Air Force Officer 
50 Salesmen, Real Estate/ Social Worker 53/ Salesmen, Steel/ Personnel Director/ Farmer 36/ Forest 
Service Men/ Optometrist/ Navy Officer/ Minister 65/ Music Teacher 
49 Priest/ Salesmen, Encyclopedia/ Sales Manager/ Computer Programmer/ Lawyer 49/ Generals and 
Admirals/ Physical Therapist/ Petroleum Engineer/ Vet 49 
48 Salesmen, Life 31/ Production Manager/ Salesmen, Computer/ Corp Pres 35/ Physiatrist/ Animal 
Husbandry Prof/ Osteopath/ Interior Decorator/ Student Personnel 
47 Urologist/ Social Worker 67 
46 Psychologist 47/ Sociologist/ Radiologist/ Pediatrician/ Unitarian Minister/ Pilot/ Dentist/ Mathe- 
matician 
45 Judge/ Psychologist 31/ Chemist/ Lawyer 27/ Corp Pres 65/ Psychologist 49/ Musician/ Orthopedic 
Surgeon 
44 Biologist/ Economist/ Political Scientist/ Mpls Symphony/ Physician 49/ Psychiatrist/ Advertising 
Men 
43 Engineer/ Psychologist (Exp)/ Newsmen/ Physicist 27/ Salesmen, Life 66/ Internist/ Astronaut/ 
Neurological Surgeon 
42 Physician 27/ Surgeon/ Pathologist 
Al Architect/ McKnight Fellow/ Pulitzer Prize/ Physicist 67 
40 Danforth Fellow/ Authors/ Anthropologist 
39 Photographer/ NIAL Member 
38 
37 Artist 
36 
35 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


and retest administrations again make it very 
clear that the scales have a substantial rela- 
tionship to the individual’s occupational 
choice. 

The gains reported in Table 58 are worthy 
of note also as they indicate areas of interests 
showing the greatest increase over the college 


years, irrespective of initial level of score. In 
general, the changes should please university 
faculties. The largest gain for all six groups was 
on the Teaching scale, ranging from 7 points 
among the lawyers to 10 points among the 
dentists, a full standard deviation. All of the 
groups also showed increased scores on the Art, 
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Music, and Writing scales, areas particularly 
important to a liberal arts college faculty. 
Interestingly, this tendency was less pro- 
nounced among the accountants who were pri- 
marily students drawn from the Business 
School. However, the other non-liberal-arts 
sample—the mechanical engineers—showed 
the same increases as the other students. 

These increases are very similar to those 
found among the Stanford students in the re- 
liability study reported earlier, showing that 
interests in teaching and in cultural activities 
increase during the college years. The same 
pattern, though less pronounced, appeared in 
the Harvard test-retest profile in Table 32. 
These highly selected students had higher 
scores in these areas when initially tested, and 
their scores did not increase as much as the 
Minnesota and Stanford students. 


Salsemen and Scientists 


Further predictive validity information has 
been developed by studying two subsets of in- 
dividuals in the norm group. As these men have 
been tested twice, once in 1930 as 16-yr.-olds 
and again in 1966 as 52-yr.-olds, it has been 


SALESMEN 


SALES 


SCIENCE 


The test (age 16) and retest (age 52) SVIB 
basic scale profiles for these two groups are 
presented in Fig. 3. 

As the mean profiles indicate, the Sales and 
Science scales separated these groups fairly 
well at age 16, and by age 52 the difference had 
increased. At age 16, the differences were 
roughly 1 SD, increasing to about 1.5 or 2 SDs 
between the adults. The larger differences be- 
tween the adult samples were created because 
the salesmen increased more on the sales- 
oriented scales, the scientists on the science- 
oriented scales, not because either of them had 
decreasing scores. Whether these increases 





possible to study how their test results relate 
to their adult occupations. 

Of the 1,943 boys tested in 1930, 1,214 (63%) 
have been retested in 1966-67. Twenty-five 
percent could not be located currently, 4% 
refused to cooperate in this study, and 8% 
were deceased. (When these basic scales were 
normed, only 647 retests were available.) 

When retested as adults, these men were 
asked to fill in a short checklist describing 
their jobs. This checklist contained “semantic 
differential” type items, that is, pairs of state- 
ments reflecting opposite extremes, and each 
man was asked to indicate where his job fell on 
a line between these pairs. Two of the items 
were “sales versus nonsales” and ‘‘science 
versus nonscience.” From the total sample, two 
subgroups of individuals were identified, using 
their answers to these two items. The first 
included 101 men who marked the opposing 
extremes, ‘‘sales, nonscience,” as descriptive of 
their jobs; the other sample included 42 men 
who marked the other extremes, “‘science, non- 
sales.”” The following diagram shows graphi- 
cally how these groups were selected and re- 
ports the number in each group: 


SCIENTIST 


NONSALES 


NONSCIENCE 


came before or after the occupational ex- 
perience cannot be determined from these 
data; the latter seems more plausible. 

These mean differences on the basic scales, 
though large and statistically far beyond the 
chance level, were not as large as the differ- 
ences between these two samples on the regular 
SVIB occupational scales. For example, on the 
SVIB Chemist scale, the samples had the fol- 
lowing test and retest means—test: Scientist 
M = 39, Salesman M = 26, Du = 13; retest: 
Scientist M = 43, Salesman M = 17, Dy = 26. 
On the SVIB Sales Manager scale, the samples 
had the following test and retest means—test : 
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TABLE 48 


MALE OccuPATIONS MEAN SCORES ON PUBLIC SPEAKING SCALE 





ee el 
M stand- 
ard score 


70 
69 
68 
67 Legislator 

66 Governor 

65 Unitarian Minister/ Minister 65 
64 


Sample 


63 Priest/ Cham Comm Exec/ Salesmen, Real Estate/ Political Scientist/ Salesmen, Computer 

62 Salesmen, 3M applicants/ Minister 27/ School Super 65/ Salesmen, Life 66 

61 Salesmen, 3M/ Comm Rec Director/ Danforth Fellow/ Social Worker 67 

60 YMCA Sec/ Social Worker 53/ YMCA Staff/ Judge/ Newsmen/ Student Personnel 

59 Sociologist/ Economist/ Elem Teacher/ Lawyer 49 

58 Soc Sci Teacher/ Salesmen, Steel/ Guidance Counselor/ Generals and Admirals/ Navy Officer/ 


Astronaut 


Si Rehab Counselor/ Credit Manager/ School Super 30/ Air Force Officer/ McKnight Fellow/ Lawyer 27 
56 Public Admin/ CPA 65/ Optometrist/ Corp Pres 65/ Music Teacher/ Bus Ed Teacher 

55 Salesmen, Life 31/ Librarian / Pulitzer Prize/ Army Officer/ Physiatrist/ Animal Husbandry Prof 

54 Psychologist 47/ YMCA PD/ Salesmen, PG&E/ Personnel Director/ Computer Programmer/ Phy- 


sical Therapist/ Psychologist 49/ Petroleum Engineer/ Neurological Surgeon 


53 Vet 66/ Pediatrician/ Sales Manager/ Psychiatrist / Orthopedic Surgeon/ Anthropologist 
aD, Psychologist 31/ Funeral Director/ Biologist/ Urologist/ Salesmen, Auto/ Physicist 67/ Internist/ 


Math-Science Teacher/ NIAL Meml-er 





oi Salesmen, Real Estate/ Accountant/ Radiologist/ Photographer/ Psychologist (Exp)/ Surgeon/ 
Policemen/ CPA 44/ Banker 64/ Osteopath/ Authors/ Interior Decorator/ Advertising Men 
50 Office Worker/ Printer/ Production Manager/ Purch Agent/ Mpls Symphony/ Banker 34/ Phy- 


sician 49/ Musician/Pathologist 
49 Pharmacist/ Farmer 36/ Pilot/ Vet 49 


48 Physician 27/ Corp Pres 35/ Forest Service Men/ Dentist/ Mathematician 


47 Chemist/ Engineer/ Physicist 27 
46 Architect/ Carpenter 
45 Farmer 67 

44 Artist 

43 

42 

At 

40 

39 

38 

37 

36 

35 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 


following samples indicate appropriate year. 


Salesman M = 35, Scientist M = 27, Dy = 8; 
retest: Salesman M = 41, Scientist M = 20, 
Dy = 21. 

These larger differences, exceeding 2 SDs 
among the adult samples, can probably be 
traced to the relative heterogeneity of the 
occupational scales compared to the basic 
scales. Each of these two occupational scales, 
Chemist and Sales Manager, contains both 


science and sales components. In the Chemist 
scale, these components are weighted posi- 
tively and negatively, respectively; in the 
Sales Manager scale, the reverse is true. Thus 
the occupational scales take advantage of both 
components for maximum separation, while 
the basic scales sacrifice some of this differ- 
entiation for easier interpretability. Note again 
that there were differential increases (and, 
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TABLE 49 
MALE OccuPATIONS MEAN SCORES ON THE RECREATIONAL LEADERSHIP SCALE 
M stand- Sample 
ard score 
70 
69 
68 
67 
66 
65 
64 
63 
62 
61 YMCA PD 
60 
59 Salesmen, 3M applicant/ Comm Rec Director 
58 Salesmen, 3M/ YMCA Staff/ Physical Therapist 
Sf 
56 
35 Policemen 
34 Salesmen, Steel/ Elem Teacher/ Salesmen, Auto/ Salesmen, Computer 
53 Soc Sci Teacher/ Salesmen, Life 66/ Social Worker 67 
52 Credit Manager/ CPA 65/ Pilot/ Army Officer/ Legislator/ Air Force Officer/ Math-Science Teacher 
ol YMCA Sec/ Salesmen, PG&E/ Forest Service Men/ School Super 65/ Osteopath/ Petroleum Engi- 
neer/ Orthopedic Surgeon/ Astronaut/ Vet 49 
50 Vet 66/ Cham Comm Exec/ Printer/ Guidance Counselor/ Urologist/ Computer Programmer/ Gen- 
erals and Admirals/ CPA 44/Newsmen/ Bus Ed Teacher 
49 Rehab Counselor/ Social Worker 53/ Office Worker/ Salesmen, Life 31/ Personnel Director/ Salesmen, 
Encyclopedia/ Optometrist/ Dentist/ Navy Officer/ Minister 65/ Banker 64/ Physiatrist/ Neurologi- 
cal Surgeon 
48 Priest/ Salesmen, Real Estate/ Pharmacist/ Public Admin/ Accountant/ Radiologist/ Pediatrician/ 
Production Manager/ Carpenter/ Farmer 36/ Surgeon/ Purch Agent/ Sales Manager/ Animal Hus- 
bandry Prof/ Farmer 67/ Student Personnel/ Governor 
47 School Super 30/ Unitarian Minister/ Lawyer 49/ Minister 27/ Funeral Director/ Lawyer 27/ Corp 
Pres 65/ Physician 49/ Psychiatrist/ Musician 
46 Psychologist 47/ Physician 27/ Chemist/ Banker 34/ Advertising Men 
45 Sociologist/ Engineer/ Danforth Fellow/ Corp Pres 35/ Music Teacher/ Internist 
44 Psychologist 31/ Photographer/ Mpls Symphony/ Judge/ Pathologist 
43 Architect/ Biologist/ Economist/ McKnight Fellow/ Pulitzer Prize/ Psychologist 49/ Artist/ Authors 
42 Psychologist (Exp)/ Political Scientist/ Librarian 
41 Physicist 67/ Anthropologist 
40 Physicist 27 
39 Mathematician/ Interior Decorator 
38 NIAL Member 
37 
36 
30) 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 
following samples indicate appropriate year. 


contrary to the findings on the basic scales, 
actual decreases on the nonappropriate scale) 
on these scales for these two differing samples. 


Construct Validity, Interest Constellations, and 
Implications 


All three types of validity information re- 
ported here, content, concurrent, and predic- 


tive, support the conclusion that these scores 
are related to occupational choice. Collectively, 
they also argue for the existence of the fourth 
type of validity that psychologists have found 
useful: construct validity. Construct validity 
refers to the concepts that underlie test be- 
havior, concepts which may not be directly ac- 
cessible for measurement but which provide 


M stand- 


ard score 


75 
74 
73 
72 
71 
70 
69 
68 
67 
66 
65 
64 
63 
62 
61 
60 
59 


58 
Sib 
56 
55 
54 
53 


a 
51 


50 
49 


48 
47 
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TABLE 50 


MALE Occupations MEAN SCORES ON RELIGIOUS ACTIVITIES SCALE 








Sample 


YMCA Sec/ Minister 27 
Minister 65 
YMCA Staff 


Priest 
YMCA PD 


Unitarian Minister 


Comm Rec Director/ Legislator 

Rehab Counselor/ Phsyical Therapist 

Salesmen, 3M/ Cham Comm Exec/ Social Worker 53/ Guidance Counselor/ School Super 65/ 
Student Personnel 

Salesmen, 3M applicant/ Credit Manager/ Salesmen, Encyclopedia/ Elem Teacher/ Danforth Fellow/ 
Salesmen, Life 66/ Social Worker 67/ Music Teacher 

Soc Sci Teacher 

Salesmen, Steel/ Optometrist/ Banker 64/ Physiatrist/ Bus Ed Teacher 

Sociologist/ School Super 30/ CPA 65/ Funeral Director/ Farmer 67 

Vet 66/ Librarian/ Salesmen, Computer/ Math-Science Teacher/ Governor 

Computer Programmer/ Judge/ Lawyer 49/ Air Force Officer/ Animal Husbandry Prof/ Petroleum 
Engineer 

Salesmen, Life 31/ Policemen/ Mpls Symphony/ Navy Officer/ Astronaut/ Vet 49 

Public Admin/ Radiologist/ Office Worker/ Personnel Director/ Salesmen, Auto/ Banker 34/ Polit- 
ical Scientist/ Newsmen/ Generals and Admirals/ Orthopedic Surgeon/ Neurological Surgeon 
Pharmacist/ Accountant/ Pediatrician/ Biologist/ Urologist/ Economist/ Pulitzer Prize/ Army 
Officer/ CPA 44/ Corp Pres 65/ Musician/ Osteopath 

Psychologist 47/ Salesmen, PG&E/ Printer/ Production Manager/ Farmer 36/ McKnight Fellow/ 
Sales Manager/ Physician 49/ Interior Decorator 

Carpenter/ Purch Agent/ Corp Pres 35/ Psychologist 49/ Mathematician/ NIAL Member 
Salesmen, Real Estate/ Engineer/ Photographer/ Surgeon/ Forest Service Men/ Dentist/ Lawyer 27/ 
Physicist 27/ Psychiatrist 

Pilot/ Physicist 67/ Pathologist/ Internist 

Architect/ Physician 27/ Chemist/ Authors/ Anthropologist/ Advertising Men 

Psychologist 31/ Psychologist (Exp) 

Artist 








Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34), Thus numbers 
following samples indicate appropriate year. 


some explanatory power for the parallel and to suggest a mechanism to explain an indi- 
features of test responses and actual behavior. vidual’s answers to these items. 

What follows here is a brief attempt to make The underlying concepts here are the related 
psychological sense of these clusters of interests _ constellations of interests. Somehow these con- 
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TABLE 51 


MALE OccuPpATIONS MEAN SCORES ON THE SALES SCALE 


M stand- 
ard score 


70 

69 

68 

67 Salesmen, 3M applicants 

66 Salesmen, 3M 

65 

64 Salesmen, PG&E/ Salesmen, Encyclopedia 
63 Salesmen, Auto/ Salesmen, Computer 

62 


Sample 


61 Salesmen, Steel/ Salesmen, Life 31/ Salesmen, Life 66 


60 


59 Salesmen, Real Estate/ Sales Manager/ Bus Ed Teacher 


58 
Bi 
56 YMCA Sec/ Credit Manager/ Legislator 


55 Pharmacist/ Cham Comm Exec/ Funeral Director 


54 Office Worker/ Guidance Counselor 


53 Rehab Counselor/ Soc Sci Teacher/ YMCA Staff/ School Super 65 
52 YMCA PD/ Corp Pres 35/ Air Force Officer/ Purch Agent/ Comm Rec Director/ Banker 34/ Ad- 


vertising Men/ Banker 64/ Farmer 67 


Sil Vet 66/ Corp Pres 65/ Petroleum Engineer/ Vet 49 

50 Accountant/ Social Worker 53/ CPA 65/ Production Manager/ Personnel Director/ Carpenter/ Op- 
tometrist/ Physical Therapist/ Music Teacher/ Interior Decorator/ Math-Science Teacher/ Governor 

49 School Super 30/ Farmer 36/ Elem Teacher/ Policemen/ Minister 27/ Navy Officer/ Minister 65 

48 Priest/ Printer/ Army Officer/ CPA 44/ Physiatrist/ Social Worker 67/ Osteopath 

47 Public Admin/ Radiologist/ Unitarian Minister/ Urologist/ Forest Service Men/ Pilot/ Computer 
Programmer/ Lawyer 49/ Animal Husbandry Prof/ Musician/ Orthopedic Surgeon 

46 Psychologist 47/ Dentist/ Generals and Admirals/ Astronaut/ Neurological Surgeon/ Student Per- 


sonnel 


45 Pediatrician/ Chemist/ Engineer/ Surgeon/ Librarian/ Physician 49/ Psychiatrist/ Psychologist 49 
44 Sociologist/ Mpls Symphony/ Judge/ Newsmen/ Lawyer 27/ Internist 

43 Architect/ Psychologist 31/ Physician 27/ Biologist/ Economist/ Photographer 

42 Psychologist (Exp)/ McKnight Fellow/ Political Scientist/ Danforth Fellow/ Pathologist/ Mathe- 


matician 


41 Pulitzer Prize/ Physicist 27/ Physicist 67/ Artist/ Authors 


40 Anthropologist 
39 NIAL Member 
38 
Si 
36 
35 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


stellations dictate, at least to some degree, the 
individual’s feeling toward activities which 
fall within the bounds of the constellation. It 
might be fruitful in future research to consider 
an individual’s answer to a specific item (and 
thus his feeling toward a specific behavior) as 
a combination of his perception of which con 
stellation an item belongs to and of his attrac- 
tion toward that constellation generally. Thus, 
the two salient variables are his evaluation of 
constellation membership for an item and his 


attraction for the activities within the constel- 
lation—in a convenient shorthand, Perception 
x Attraction. 

A clarifying example: A salesman would 
answer Like to the item “Sell Fuller brushes 
door-to-door” because he would perceive that 
as a sales item and he likes sales activities; a 
scientist would mark it Dislike because he also 
perceives it as a sales item and dislikes those 
activities. But if confronted with the item, 
“Work for the National Academy of Sciences, 
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TABLE 52 


MALE OccuPATIONS MEAN SCORES ON THE SCIENCE SCALE 


M stand- 
ard score 


62 Physicist 27/ Physicist 67 
61 Chemist 
60 Biologist/ Psychologist (Exp)/ Pathologist 








Sample 


59 Psychologist 31/ Animal Husbandry Professor/ Neurological Surgeon 
58 Pediatrician/ Urologist/ Internist/ Mathematician/ Math-Science Teacher 
ST Radiologist/ Engineer/ Air Force Officer/ Physiatrist/ Psychiatrist/ Psychologist 49/ Orthopedic 


Surgeon/ Astronaut 
56 Psychologist 47/ Surgeon/ Anthropologist 


55 Physician 27/ Economist/ Optometrist/ Computer Programmer/ Danforth Fellow/ Physician 49/ 


Physical Therapist/ Petroleum Engineer 
54 Army Officer 


39 Pilot/ Dentist/ Generals and Admirals/ Osteopath 
52 Vet 66/ Pharmacist/ Public Admin/ Architect/ Sociologist/ School Super 30/ Mpls Symphony/ Sales- 


men, Computer/ CPA 44/ Navy Officer 


51 Rehab Counselor/ YMCA PD/ Unitarian Minister/ Production Manager/ Personnel Director/ Elem 
Teacher/ Librarian/ Minister 27/ NIAL Member/ Student Personnel 

50 Accountant/ Guidance Counselor/ Corp Pres 35/ Farmer 36/ Forest Service Men/ Political Scientist / 
Policemen/ School Super 65/ Corp Pres 65/ Musician/ Artist/ Vet 49 

49 YMCA Sec/ Social Worker 53/ Credit Manager/ Salesmen, PG&E/ Printer/ Photographer/ Judge/ 


Laywer 49 


48 Salesmen, 3M/ Priest/ Salesmen, Steel/ Office Worker/ CPA 65/ Purch Agent/ Pulitzer Prize/ Comm 
Rec Director/ Lawyer 27/ Music Teacher/ Bus Ed Teacher 
47 Soc Science Teacher/ Carpenter/ McKnight Fellow/ Sales Manager/ YMCA Staff/ Legislator/ 


Minister 65/ Social Worker 67/ Governor 


46 Salesmen, 3M applicant/ Cham Comm Exec/ Advertising Men/ Banker 64/ Author 
45 Salesmen, Encyclopedia/ Salesmen, Auto/ Newsmen 
44 Salesmen, Life 31/ Funeral Director/ Banker 34 


43 Salesmen, Real Estate 


42 Salesmen, Life 66/ Farmer 67/ Interior Decorator 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see;Table 34), Thus numbers 


following samples indicate appropriate year. 


appearing before Congress, private founda- 
tions, and other prestige organizations, selling 
the idea that science must be broadly sup- 
ported by our society,” they both might 
answer Like as one would perceive it as a sales 
activity, the other as science. In this illustra- 
tion, their perceptions might be modified sub- 
stantially by substituting other verbs for 


“selling,” such as persuading or, better yet, 


educating or informing. 

This formulation of organizing constellations 
as the constructs underlying interest measure- 
ment is undoubtedly too simple, and only par- 
tially correct. Yet it does provide a way to 
begin to understand more than the empirical 
facts, to lead us out of the wilderness of means 
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TABLE 53 


MALE OccuPATIONS MEAN SCORES ON THE SOCIAL SERVICE SCALE 








Sample 


Rehab Counselor/ Priest/ Unitarian Minister/ Guidance Counselor 


Salesmen, 3M applicants/ Sociologist/ Elem Teacher/ Danforth Fellow/ School Super 65/ Music 


Cham Comm Exec/ Credit Manager/ Librarian/ Physiatrist/ Salesmen, Life 66/ Math-Science 
Pediatrician/ Salesmen, Life 31/ Policemen/ Political Scientist/ Mpls Symphony/ Funeral Director/ 
Pharmacist/ Public Admin/ Salesmen, Steel/ Personnel Director/ Optometrist/ Lawyer 49/ Psychol- 
Vet 66/ Salesmen, PG&E/ CPA 65/ Economist/ McKnight Fellow/ Newsmen/ Banker 64/ Farmer 
Judge/ Office Worker/ Printer/ Carpenter/ Salesmen, Auto/ Banker 34/ Salesmen, Computer/ 


Accountant/ Radiologist/ Urologist/ Photographer/ Forest Service Men/ Computer Programmer/ 
Corp Pres 35/ Air Force Officer/ Physician 49/ Orthopedic Surgeon/ Interior Decorator/ NIAL 


Salesmen, Real Estate/ Production Manager/ Psychologist (Exp)/ Farmer 36/ Sales Manager/ 
Pulitzer Prize/ Lawyer 27/ Corp Pres 65/ Anthropologist/ Neurological Surgeon 

Psychologist 31/ Physician 27/ Biologist/ Surgeon/ Purch Agent/ Dentist/ Navy Officer/ Animal 
Husbandry Prof/ Petroleum Engineer/ Pathologist/ Advertising Men 


Architect/ Chemist/ Engineer/ Pilot/ Generals and Admirals/ Physicist 27/ Physicist 67/ Astro- 


48 
M stand- 
ard score 
70 
69 YMCA Sec 
68 
67 Social Worker 53/ YMCA Staff/ Minister 65 
66 Social Worker 67 
65 YMCA PD/ Minister 27 
64 
63 
62 Soc Sci Teacher 
61 Psychologist 47/ Comm Rec Director/ Student Personnel 
60 Salesmen, 3M/ Legislator/ Physical Therapist 
59 
Teacher 
58 Salesmen, Encyclopedia/ Bus Ed Techer 
57 School Super 30 
56 
Teacher/ Governor 
55 
Psychiatrist 
54 
ogist 49/ Musician/ Vet 49 
Bie 
67/ Osteopath 
52 
CPA 44/ Internist 
$1 
Member 
50 
49 
48 Army Officer/ Mathematician 
47 
naut/ Authors 
46 Artist 
45 
44 
43 
42 
41 
40 
39 
38 
37 
36 
35 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


and correlations, and it has some immediate agreement as to what an item means should be 
implications for the techniques of measurement: very high. This can be achieved in several 


ways: by using short familiar items; by using 


1. If item perception is important, then item longer very specific items; or by using powerful 
clarity is a paramount concern. Interindividual stereotypes such as occupational titles. 
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TABLE 54 


MALE OccuPpATIONS MEAN SCORES ON THE TEACHING SCALE 


M stand- 
ard score 


70 

69 

68 

67 

66 Music Teacher 
65 

64 

63 Elem Teacher 


Sample 


62 Librarian/ Danforth Fellow/ Minister 65/ School Super 65 
61 School Super 30/ Unitarian Minister/ Guidance Counselor/ Minister 27/ Bus Ed Teacher/ Student 


Personnel 


60 Rehab Counselor/ Soc Sci Teacher/ Social Worker 67 
59 Priest/ Sociologist/ Mpls Symphony/ Comm Rec Director/ YMCA Staff/ Physiatrist 
58 Social Worker 53/ Biologist/ McKnight Fellow/ Political Scientist/ Mathematician/ Math-Science 


Teacher/ Neurological Surgeon 


57 Psychologist 47/ Psychologist 31/ Pediatrician/ Legislator/ Physical Therapist/ Psychologist 49 
56 YMCA Sec/ Economist/ Psychiatrist/ Animal Husbandry Prof/ Musician/ Physicist 67/ Pathologist/ 


Internist/ Anthropologist 


oD YMCA PD/ Psychologist (Exp)/ Judge/ Salesmen, Encyclopedia/ Optometrist/ Pulitzer Prize/ 
Generals and Admirals/ Navy Officer/ Physicist 27/ Orthopedic Surgeon/ NIAL Member 

54 Salesmen, 3M applicant/ Cham Comm Exec/ Credit Manager/ Air Force Officer/ Surgeon/ Computer 
Programmer/ Lawyer 49/ Salesmen, Computer/ Newsmen 

53 Salesmen, 3M/ Radiologist/ Photographer/ Army Officer/ Salesmen, Life 66/ Astronaut 

52 Vet 66/ CPA 65/ Urologist/ Personnel Director/ Corp Pres 65/ Physician 49/ Governor 

51 Public Admin/ Chemist/ Petroleum Engineer/ Interior Decorator 


50 Physician 27/ Salesmen, Steel/ CPA 44 


49 Accountant/ Lawyer 27/ Banker 64/ Osteopath 
48 Salesmen, PG&E/ Printer/ Policemen/ Dentist/ Vet 49 
47 Pharmacist/ Architect/ Office Worker/ Salesmen, Life 31/ Production Manager/ Advertising Men/ 


Authors 


46 Engineer/ Corp Pres 35/ Carpenter/ Farmer 36/ Salesmen, Auto/ Forest Service Men/ Pilot/ Artist 


45 Sales Manager 

44 Funeral Director/ Banker 34/ Farmer 67 
43 Salesmen, Real Estate/ Purch Agent 
42 

41 

40 

39 

38 

37 

36 

35 





Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


2. It suggests that forced-choice items are 
too complicated. To ask a person to sort out 
his perceptions of three items in a triad, then 
to determine how he feels about each of these 
and compare those feelings is creating an un- 
necessarily complex task for him. Worse, it 
means the same pattern of answers could result 
from several different psychological viewpoints. 
While that is inevitable, no matter what 


method is used, it should be minimized. 
(Forced-choice items may have some ad- 
vantages in other contexts—but they have yet 
to be demonstrated empirically.) 

3. The Perception X Attraction theory em- 
phasizes that, to this point in history, little 
attention has been paid to the perception por- 
tion in interest measurement—all of the em- 
phasis has been placed on the attraction aspect 
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TABLE 55 


MALE OccuPATIONS MEAN SCORES ON TECHNICAL SUPERVISION SCALE 





M stand- 
ard score 


70 

69 

68 

67 

66 

65 

64 

63 

62 

61 

60 

59 

58 

57 Air Force Officer/ Petroleum Engineer 
56 Production Manager 
55 Chemist/ Army Officer 


Sample 


54 Engineer/ Carpenter/ Pilot/ Salesmen, Computer 
53 Personnel Director/ Navy Officer/ Orthopedic Surgeon/ Astronaut 


52 Printer/ Neurological Surgeon 


51 Public Admin/ Radiologist/ Salesmen, Steel/ Salesmen, PG&E/ Corp Pres 35/ Urologist/ Computer 
Programmer/ Corp Pres 65/ Math-Science Teacher 
50 Salesmen, 3M applicant/ Salesmen, 3M/ Purch Agent/ Policemen/ Generals and Admirals/ CPA 44/ 


Physical Therapist/ Farmer 67 


49 Vet 66/ Accountant/ Pediatrician/ Credit Manager/ Optometrist/ Comm Rec Director/ Legislator/ 


Physiatrist/ Bus Ed Teacher 


48 Rehab Counselor/ Cham Comm/ Social Worker 53/ Office Worker/ CPA 65/ Psychologist (Exp)/ 
Surgeon/ Physicist 27/ Psychiatrist/ Psychologist 49/ Animal Husbandry Prof 

47 YMCA Sec/ YMCA PD/ Psychologist 31/ Guidance Counselor/ Economist/ Farmer 36/ Elem 
Teacher/ Salesmen, Auto/ Sales Manager/ YMCA Staff/ Physicist 67/ Osteopath/ Pathologist 

46 Psychologist 47/ Pharmacist/ Soc Sci Teacher/ Unitarian Minister/ Forest Service Men/ Dentist/ 
Lawyer 49/ Funeral Director/ School Super 65/ Banker 64/ Physician 49/ Social Worker 67/ Internist/ 


Vet 49 
45 Architect/ Biologist/ Student Personnel 


44 Priest/ Sociologist/ Physician 27/ Salesmen, Encyclopedia/ Librarian/ Banker 34/ Minister 27/ 


Minister 65/ Musician/ Mathematician 


43 Salesmen, Real Estate/ School Super 30/ Political Scientist/ Mpls Symphony/ Anthropologist 


42 Music Teacher/ Governor 


At Salesmen, Life 31/ Photographer/ Lawyer 27/ Advertising Men 
40 Danforth Fellow/ Newsmen/ Salesmen, Life 66/ Interior Decorator 


39 Judge/ Artist 

38 McKnight Fellow/ NIAL Member 
37 Pulitzer Prize/ Authors 

36 

35 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 


following samples indicate appropriate year. 


by requiring the Like, Indifferent, or Dislike 
response. More should be done on perception, 
at least enough to determine if this way of 
thinking has any merit. 

4, If the perception factor does prove useful, 
it might help explain the changes over time 
within a single individual as it would allow for 
two types of changes, one in perception, the 


other in actual attraction to the activities. 
Such a possibility is particularly appealing 
when studying teenage-to-adult samples as 
some of their differences may be traced to the 
teenagers’ difficulty with the vocabulary. Per- 
ception certainly includes within it under- 
standing; if a person does not understand an 
item, his perception will be affected. When 
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TABLE 56 


MALE OccuPATIONS MEAN SCORES ON THE WRITING SCALE 


M stand- 


ard score Sample 

70 

69 

68 

67 

66 

65 

64 Unitarian Minister/ Librarian/ Pulitzer Prize/ Newsmen 

63 McKnight Fellow/ Authors 

62 Political Scientist/ Danforth Fellow/ NIAL Member 

61 Minister 65 

60 Psychologist 47/ Judge/ Photographer/ Minister 27/ Sociologist 

59 Anthropologist/ Neurological Surgeon/ Social Worker 63 

58 Economist/ Elem Teacher/ Legislator/ Social Worker 67/ Music Teacher/ Student Personnel/ Ad- 
vertising Men/ Cham Comm Exec/ Psychologist 31 

yt Rehab Counselor/ Priest/ Mpls Symphony/ Lawyer 49/ School Super 65/ Physiatrist/ Psychiatrist/ 
Psychologist 49/ Governor/ YMCA Sec/ Pediatrician 

56 School Super 30/ Psychologist (Exp)/ Salesmen, Encyclopedia/ Lawyer 27/ Musician 

55 Salesmen, 3M applicants/ Salesmen, 3M/ Printer/ Guidance Counselor/ Biologist/ Personnel Di- 
rector/ Comm Rec Director/ Army Officer/ Generals and Admirals/ Artist/ Internist/ Interior Dec- 
orator/ Public Admin/ Soc Sci Teacher 

54 Credit Manager/ YMCA Staff/ Salesmen, Life 66/ Bus Ed Teacher/ Physicist 67/ Orthopedic Sur- 
geon/ Radiologist 

53 Urologist/ Surgeon/ Computer Programmer/ Salesmen, Computer/ Navy Officer/ Physical Thera- 
pist/ Pathologist/ YMCA PD 

52) Salesmen, Steel/ Office Worker/ Salesmen, Life 31/ Salesmen, PG&E/ Air Force Officer/ Optometrist/ 
CPA 44/ Physician 49/ Astronaut/ Mathematician/ Architect 

Dill Corp Pres 35/ CPA 65/ Sales Manager/ Animal Husbandry Prof/ Accountant/ Chemist 

50 Forest Service Men/ Policemen/ Pilot/ Corp Pres 65/ Osteopath/ Petroleum Engineer/ Math-Science 
Teacher/ Salesmen, Real Estate/ Physician 27 

49 Engineer/ Production Manager/ Salesmen, Auto/ Dentist/ Banker 34/ Physicist 27/ Pharmacist 

48 Vet 66/ Funeral Director/ Farmer 36/ Purch Agent/ Banker 64 

47 Carpenter 

46 Vet 49 

45 

44 

43 Farmer 67 

42 

Al 

40 

39 

38 

37 

36 

39 


Note.—In some cases occupational groups were sampled and tested in different years (e.g., Banker, see Table 34). Thus numbers 
following samples indicate appropriate year. 


understanding comes, his answer may change 
even though his attraction to that activity has 
not. 


This brief theoretical side trip illustrates one 
possibility of further research in clustering 
interests, and in understanding the results. 
After 40 yr. of fact gathering, we must have 


some organizing theory, especially since the 
computer has made it possible to carry empiri- 
cism to ridiculous extremes. 


UsE OF THE Basic SCALES 


This paper has been almost solely concerned 
with the construction and evaluation of scales 
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to measure specific areas of vocational interests. 
An important next step is to learn something 
about the application of these scales. 


For Counseling 


Counseling textbooks abound with view- 
points as to how test information should be 
used with individual clients. Research on these 
viewpoints is much less common, and there are 
very few data to support answers to even the 
most obvious questions: ‘“‘It costs about one 
dollar to administer and score a student’s 
Strong—is it worth it?” or ‘Should an indi- 
vidual be shown his scores?” In general, 
opinions here vary inversely in intensity with 
the amount of available relevant information. 


For Selection 


The situation with the use of interest in- 
ventories in employee selection is not greatly 
different—the basic questions on how to apply 
the methods have not been answered, and the 
only acceptable applications currently depend 
on the presence of a skilled, well-trained 
professional. 

While much more research on the effects of 
various approaches is necessary, in the absence 
of such information the authors suggest the 
following guideline: 


These scales are reasonably accurate measures of an 
individual’s interest in the specified areas. Anyone con- 
fronted with a choice of any of these activities, whether 
for training or employment, would be aided by this 
relevant information about himself. Professional psy- 
chologists faced with selection or placement decisions 
should find the results helpful in determining what acti- 
vities an individual might find satisfying. 


Though the basic scales should be useful in 
evaluating an individual’s responses to the 
SVIB, they are intended to supplement, not 
supplant, the regular occupational scales. The 
regular scales still permit greater diversity and 
complexity in profile patterns than do the basic 
scales, as the unique items remaining in the 


SVIB, not included in any of the basic scales, 
are important. It would be a serious mistake 
to restrict measurements to these 22 factors in 
the belief that the valid variance in the interest 
domain has been covered, or even the valid 
variance in the SVIB item pool. Further re- 
search must push ahead to find other ap- 
proaches, always with the intent of finding 
powerful yet parsimonious explanations of the 
organization of an individual’s choices. 
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